Statistical Power and Sample Size
Type II Error, Effect Size, and How Big Your Study Needs to Be — A TLDR Primer
Statistical power is one of the most tested and most misunderstood ideas in introductory statistics — and most textbooks bury it under pages of theory before getting to the point. This concise primer cuts straight to what you actually need to know.
Whether you're preparing for an AP Statistics exam, working through an intro stats course, or trying to make sense of why a study's results don't replicate, this guide covers the full picture: null and alternative hypotheses, Type I and Type II error, and exactly what it means when a test has "80% power." From there, it walks through the four levers that push power up or down — effect size, sample size, alpha, and variability — with concrete numbers, not hand-waving.
You'll learn how to measure effect size using Cohen's d and its equivalents for proportions, and you'll work through complete sample-size calculations for two-sample mean tests and proportion tests, step by step. The final section connects all of it to real research: what underpowered studies look like, why the replication crisis happened, and how to spot the post hoc power fallacy when you read a paper.
This guide is short by design. Every section leads with the one sentence you need to take away, follows with worked examples, and calls out the misconceptions students get wrong most often. No filler, no detours — just the concepts, the formulas, and the intuition to use them.
If statistical power has felt slippery until now, start here.
- Define statistical power and explain its relationship to Type I and Type II error
- Identify the four levers that determine power: effect size, sample size, significance level, and variability
- Compute and interpret Cohen's d and other standardized effect sizes
- Calculate the required sample size for a two-sample test of means or proportions
- Critique studies as 'underpowered' and recognize the consequences for published research
- 1. Hypothesis Testing in 90 Seconds: The Setup You NeedRefreshes null/alternative hypotheses, p-values, and Type I vs Type II errors so the rest of the book has solid footing.
- 2. What Statistical Power Actually IsDefines power as 1 minus beta, gives the intuition with overlapping distributions, and explains why 0.80 became the convention.
- 3. The Four Levers: What Power Depends OnWalks through how effect size, sample size, alpha, and variability each push power up or down, with worked numerical intuition.
- 4. Measuring Effect Size: Cohen's d and FriendsIntroduces standardized effect sizes for mean differences and proportions, with rules of thumb and worked examples.
- 5. Calculating the Sample Size You NeedDerives and applies the sample-size formula for a two-sample test of means and a test of proportions, with full worked examples.
- 6. Underpowered Studies and Why Power Matters in the Real WorldExplains the replication crisis, the danger of small-n studies, post hoc power fallacy, and what to look for when reading research.