Ethics & Responsibility

Bias in Training Data

How historical inequities get baked into AI models, and what we can do about it

What it is

LLMs learn from internet-scale data that reflects historical human biases, racial, gender, cultural, and socioeconomic. Models trained on this data inherit these biases: generating more positive associations for certain demographic groups, underrepresenting minority perspectives, and performing worse on tasks involving underrepresented languages or cultures.

Post-training (RLHF) can partially correct for biases, but rater pools for preference data are themselves non-representative, often skewed toward English-speaking, Western, educated populations.

Mitigations include diverse training data curation, bias auditing benchmarks, adversarial testing, and intentional RLHF focus on bias reduction, none of which fully solve the problem.

Why it matters

Any AI product that interacts with real users will encounter bias issues. Ignoring them creates legal, reputational, and ethical risks. Understanding where bias comes from (training data, RLHF rater demographics, prompt design) helps you build systems that minimize harm and set appropriate expectations with clients about limitations.

Resources

Algorithmic Bias in AI: What It Is and How to Fix It

youtube.com· Clear explanation of how bias enters AI systems through data collection, labeling, and model design. Uses real-world examples.

8 min

AI Bias and Fairness, Crash Course AI #18

youtube.com· Excellent beginner-friendly explainer with real examples (COMPAS, facial recognition disparities). Crash Course production quality makes it highly engaging.

12 min

Bias in AI: Examples and 6 Ways to Fix It

research.aimultiple.com· Comprehensive catalog of bias types (selection, confirmation, amplification, cultural/geographic) with real 2024-2025 examples and legal cases.

12 min

UNESCO: Challenging Systematic Prejudices, An Investigation into Bias Against Women and Girls in Large Language Models

unesdoc.unesco.org· Landmark 2024 study showing LLMs associate women with "home" and "family" 4x more than men. Primary source for understanding gender bias in LLMs.

15 min

Bias in Large Language Models, and Who Should Be Held Accountable

law.stanford.edu· February 2025 article examining accountability frameworks for LLM bias. Legal perspective on who bears responsibility when models produce biased outputs.

12 min