Ethical Considerations in Using LLMs

The good, the Bad, and the Ugly

Alex van Vorstenbosch

2025-02-01

Overview

__Disclaimer__
I don’t claim to have the answers.
Be aware of these topics.
Form your own opinions and openly discuss issues.

Overview

Biases and Misinformation
The Dark Side of LLMs
Group discussion

Biases and Misinformation

Biases

LLMs can strengthen negative stereotypes.
LLMs can strengten the views of users via confirmation bias:
- LLMs have a tendency to agree with the user
- People have a tendency to think that models are ‘objective’ and speak the ‘truth’
LLMs are ‘skewed’ to the trainingset majority:
- English Western views for example

Reuters: Amazon scraps secret AI recruiting tool that showed bias against women

Hallucinating false information

What are Hallucinations?
- It’s when LLMs generates incorrect, nonsensical, or unverifiable information presented as fact.
- Might also be answers that are not supported by the provided context
- Can be hard to spot as the model is a great ‘bluffer’
  - Doesn’t know that the information is wrong
  - asking for self-reflection does help

Hallucinating false information

What are Hallucinations?
- It’s when LLMs generates incorrect, nonsensical, or unverifiable information presented as fact.

Who is responsible when AI makes a mistake?

Stories of dangerous behaviour by AI chatbots

NYtimes: A Conversation With Bing’s Chatbot Left Me Deeply Unsettled
- “It (red: Sydney chatbot) then tried to convince me that I was unhappy in my marriage, and that I should leave my wife and be with it instead.”
Unable to verify: La Libre Belgique story: Without these conversations with the Eliza chatbot, my husband would still be here

Screenshots of Business Insider’s disturbing conversation with “Eliza,” a chatbot from Chai Research.

Stories of dangerous behaviour by AI chatbots

Importance of human-in-the-loop

Quality Control: Because of the generative design LLMs are prone to errors. Require human checks and feedback to ensure accuracy of the output.
Human (ethical) Judgement: Some decisions require human (ethical) judgment, especially in complex, nuanced situations where the context matters.

My personal beliefs:

Generative AI is an amazing transformative tool, but not an autonomous agent
You are responsible for the mistakes you make when using generative AI:
- Make sure the risks are known
- Make sure the risks are manageble
- If not possible, make sure the risks are acceptable

What constitutes appropriate content?

Model	Source	Restrictions
ChatGPT by OpenAI	Closed source	Strongly moderated and curated
Grok by Xai	Closed source	Less restrictions
Llama-models by Meta	Open source	Can be finetuned for any purpose

Keep in mind: Nobody is sharing the most important part: HIGH QUALITY DATA

*A reddit pol on the most annoying ChatGPT responses*

Overview

Biases and Misinformation
The Dark Side of LLMs
Group discussion

Overview

Biases and Misinformation
The Dark Side of LLMs
Group discussion

The Dark Side of LLMs

Copyright issues and LLMs

Can the output be copyrighted?
Can they be trained on copyrighted materials?
Can you claim contain generated by LLMs is infringing copyright?
Open source code does not mean no licenses apply:
- GPL: “You may use my code as long as you also release your code open source”

@github copilot, with "public code" blocked, emits large chunks of my copyrighted code, with no attribution, no LGPL license. For example, the simple prompt "sparse matrix transpose, cs_" produces my cs_transpose in CSparse. My code on left, github on right. Not OK. pic.twitter.com/sqpOThi8nf
— Tim Davis (@DocSparse) October 16, 2022

Privacy issues and LLMs

Can companies train LLMs on (scraped) private data without consent?
- What if LLMs memorise private data?
How can we mitigate inference of private information by LLMs?
- https://llm-privacy.org/
How can we trust third-parties with our proprietary/private information?

Current view Dutch-government

(Overheidsbrede visie Generatieve AI; 01-01-2024):

Niet-gecontracteerde generatieve AI-toepassingen voldoen over het algemeen niet aantoonbaar aan de geldende privacy- en auteursrechtelijke wetgeving. Zodoende is het gebruik hiervan door Rijksorganisaties (of in opdracht daarvan) niet toegestaan, in die gevallen waarin het risico bestaat dat wetgeving wordt overtreden, tenzij de aanbieder en de gebruiker aantoonbaar voldoen aan de geldende wet- en regelgeving.

Transparency issues of LLMs

How can we trust models that are “black boxes”?
- Especially if aren’t even sure what these models look like or how they were trained?
How can these models be used if they can generate ‘hallucinations’ at any point?
How can we prevent the use of LLMs for unsuited usecases?

Misuse of LLMs

How can we prevent the automated generation of misinformation at scale?
How can we prevent the use of these techniques for spam, identity fraud, and worse?
Who should decide what misuse of LLMs means?

Are these AI developments safe?

BBC: AI ‘godfather’ Geoffrey Hinton tells the BBC of AI dangers after he quits Google

BBC: AI ‘godfather’ Yoshua Bengio feels ‘lost’ over life’s work

BBC: Yann LeCun says AI won’t take over the world or destroy jobs forever

Are these AI developments safe?

BI: AI one-percenters seizing power forever is the real doomsday scenario, warns AI godfather

Very real threat in current social media ecosystem:

Are these AI developments safe?

BI: Google Brain cofounder says Big Tech companies are inflating fears about the risks of AI wiping out humanity because they want to dominate the market

Should there be governmental oversight on AI

22 march 2023 - Call for pause on giant AI development
- 6 months
- develop and implement a shared safety protocols
- work with policymakers to dramatically accelerate development of robust AI governance systems
- “…new and capable regulatory authorities dedicated to AI”
U.S. senate hearing: Oversight of A.I.: Rules for Artificial Intelligence

EU AI Act

Passed in 2024, main effects:

Unacceptable risk AI systems will be banned
- Real time biometric identification
- Behavioural manipulation
- Social scoring systems
Limited Risk AI needs to be transparant:
- You must know if you are interacting with AI
- Companies must disclose if content was generated with AI
- Chatbots are classified as limited risk
The setup of a new European AI Office to coordinate compliance, implementation, and enforcement of the AI Act
- Tasked with oversight of General Purpose AI models across Europe

The economic impact of AI

Will it take (many of) our jobs?
Will it create jobs?
Or will it just make us more efficient at our current job?

Climate impact of large language models

These models are very compute intensive:
- In the training process
- But also during inference!
Estimated to use 0.5%-2% of global energy usage in 2026 (International Energy Agency - Electricity forecast 2024)
How can we justify this (inefficient) use of technology?

Training the model via RLHF

Low-wage workers in Kenia were paid to help collect data for the ‘moderation’ tool:
- Traumatising work

Time: Exclusive: OpenAI Used Kenyan Workers on Less Than $2 Per Hour to Make ChatGPT Less Toxic

Use of LLMs for essays, homework, etc. cannot be reliably detected.

AI-detectors don’t work, which is creating serious issues for students.

AI-detectors don’t work, which is disrupting how homework is given and made.

Overview

Biases and Misinformation
The Dark Side of LLMs
Group discussion

Overview

Biases and Misinformation
The Dark Side of LLMs
Group discussion

Group discussion

Can LLMs be used if they are trained on copyrighted and/or AVG-protected data?
Should LLM usage be constrained by ethical guidelines and content filters?
Can LLMs be trusted if hallucinations are an inherent part of these systems?

Discuss within your group for 5-10 minutes , and then we will discuss the results in a plenary session.