SOLID STATE PRESS
← Back to catalog
Simpson's Paradox cover
Buy on Amazon
US list price $2.99
Mathematics

Simpson's Paradox

Reversed Trends, Lurking Variables, and the Berkeley Admissions Case — A TLDR Primer

You're staring at a data set that seems to say one thing — then someone splits it by group and it says the exact opposite. Which answer do you trust? That unsettling situation has a name: **Simpson's Paradox**, and it has derailed medical studies, court cases, and careers in data science. If you've hit it in an AP Statistics course, a college intro class, or a data-analysis project and found the textbook explanation more confusing than the paradox itself, this guide is for you.

This TLDR primer walks you through Simpson's Paradox from the ground up — no filler, no multi-chapter detour into unrelated probability theory. You'll see a clean numerical example that makes the reversal undeniable, then get the weighted-average arithmetic that explains exactly *why* it happens. From there the guide introduces the causal language you need — lurking variables, confounding, aggregation bias — so you can reason about the paradox, not just recognize it.

The centerpiece is the famous 1973 UC Berkeley admissions case: a real-world instance where pooled data suggested gender discrimination, but disaggregated data told a more complicated story. Understanding what researchers actually found — and what it means for causal inference — is the kind of statistical reasoning that separates strong data thinkers from people who just run numbers.

The guide closes with practical decision rules for when to pool versus split data, and a quick tour of where Simpson's Paradox appears in medicine, sports analytics, public policy, and machine learning.

Short by design, built for the student who needs to understand this concept — not skim it. Grab your copy and close the gap.

What you'll learn
  • Recognize Simpson's Paradox in two-way tables and rate comparisons
  • Compute and compare conditional vs. marginal rates correctly
  • Identify lurking (confounding) variables that drive the reversal
  • Decide when to pool data and when to keep it disaggregated
  • Apply the paradox to real cases like UC Berkeley admissions and medical treatments
What's inside
  1. 1. The Paradox in One Picture
    Introduces Simpson's Paradox with a small, concrete numerical example showing a trend that reverses when data is split into groups.
  2. 2. Rates, Weights, and Why the Reversal Happens
    Unpacks the arithmetic of weighted averages to show mechanically how unequal group sizes can flip an overall comparison.
  3. 3. Lurking Variables and Confounding
    Introduces the causal language of lurking and confounding variables that explain why subgroup splits matter.
  4. 4. Case Study: UC Berkeley Admissions (1973)
    Walks through the famous Berkeley gender-bias case to show Simpson's Paradox at full scale and what the reversal actually meant.
  5. 5. Should You Pool or Split? Making the Right Call
    Gives practical rules for deciding whether the aggregated or disaggregated view answers the question you actually care about.
  6. 6. Where the Paradox Shows Up and Why It Matters
    Quick tour of Simpson's Paradox in medicine, sports stats, public policy, and machine learning, and what to watch for.
Published by Solid State Press · June 2026
Simpson's Paradox cover
TLDR STUDY GUIDES

Simpson's Paradox

Reversed Trends, Lurking Variables, and the Berkeley Admissions Case — A TLDR Primer
Solid State Press

Contents

  1. 1 The Paradox in One Picture
  2. 2 Rates, Weights, and Why the Reversal Happens
  3. 3 Lurking Variables and Confounding
  4. 4 Case Study: UC Berkeley Admissions (1973)
  5. 5 Should You Pool or Split? Making the Right Call
  6. 6 Where the Paradox Shows Up and Why It Matters
Chapter 1

The Paradox in One Picture

Imagine a hospital releases data showing that patients who received a new drug survived at a higher rate than patients who did not. You look closer and find that for mild cases, the old treatment actually does better — and for severe cases, the old treatment does better again. So the drug loses in every subgroup, yet somehow wins overall. That is not a rounding error or a trick question. It is Simpson's Paradox: a real, arithmetic phenomenon where a trend that appears in pooled data disappears or reverses when you break the data into groups.

The paradox shows up across medicine, sports, law, and public policy. Before diving into those cases, it helps to see it happen in the smallest possible example — one you can check with a calculator in two minutes.

About This Book

If you are working through an AP Statistics supplemental study guide, sitting in an introductory college statistics or data science course, or simply trying to make sense of contradictory numbers in the news, this book is for you. It is written for high school juniors and seniors and for college freshmen who want a clear, honest explanation of one of statistics' most counterintuitive results.

This is a concise statistics study guide for high school and early-college readers covering everything from how pooled rates mislead to confounding variables in statistics, lurking variables and statistical bias, and the famous Berkeley admissions gender bias statistics case of 1973. It is a focused primer on data analysis concepts for beginners — tight and short by design.

Read it straight through to follow the logic, then work the examples as they appear. A practice problem set closes the book so you can test whether Simpson's paradox explained for students has fully clicked.

Keep reading

You've read the first half of Chapter 1. The complete book covers 6 chapters in roughly fifteen pages — readable in one sitting.

Continue reading on Amazon