Linear Regression and Correlation
A High School & College Primer on Lines, Scatter, and Prediction
Staring down a unit on regression in AP Statistics or Algebra 2 — and the textbook explanation somehow made it worse? This guide cuts straight to what you actually need: how to read a scatterplot, what the correlation coefficient really tells you (and what it doesn't), and how to build and interpret a least-squares regression line from scratch.
**TLDR: Linear Regression and Correlation** covers the full single-predictor regression sequence taught in Algebra 2, AP Statistics, and introductory college stats courses. You'll learn to describe relationships between two variables, compute and interpret *r*, write and use a regression equation, evaluate fit with residuals and *r²*, make predictions without falling into the extrapolation trap, and explain — clearly — why correlation is not causation. Every section leads with the one idea that matters most, works through concrete numbers, and flags the mistakes students make most often.
At roughly 15 pages, this is not a replacement for your class notes or textbook. It's the focused primer you read the night before a test, the quick reference your tutor pulls up before a session, or the plain-language explainer a parent uses to actually help their kid. If you've searched for a straightforward guide to the *ap statistics linear regression* topic or need the *least squares regression line formula* demystified in plain English, this is the book.
Grab it, read it in one sitting, and walk into your exam knowing exactly what you're doing.
- Read a scatterplot and describe the form, direction, and strength of a relationship
- Compute and interpret the correlation coefficient r
- Find the least-squares regression line and interpret its slope and intercept in context
- Use a regression line to predict, and recognize when prediction is unreliable (extrapolation, outliers)
- Interpret r-squared and residuals to judge how well the line fits
- Distinguish correlation from causation and identify common pitfalls
- 1. Scatterplots and the Idea of a RelationshipIntroduces bivariate data, scatterplots, and the vocabulary (form, direction, strength) used to describe relationships between two quantitative variables.
- 2. The Correlation Coefficient rDefines r, shows how to compute and interpret it, and explains what r does and does not measure.
- 3. The Least-Squares Regression LineDerives the formulas for slope and intercept, shows a worked example, and interprets each piece in context.
- 4. How Good Is the Line? Residuals and r-squaredUses residual plots and the coefficient of determination to assess fit and detect when a linear model is the wrong choice.
- 5. Prediction, Extrapolation, and Influential PointsShows how to use the regression line to predict, and warns about extrapolation, outliers, and high-leverage points.
- 6. Correlation Is Not CausationExplains lurking variables, confounding, and why association alone never proves cause — with classic examples.