SOLID STATE PRESS
← Back to catalog
Linear Regression and Correlation cover
Coming soon
Coming soon to Amazon
This title is in our publishing queue.
Browse available titles
Mathematics

Linear Regression and Correlation

A High School & College Primer on Lines, Scatter, and Prediction

Staring down a unit on regression in AP Statistics or Algebra 2 — and the textbook explanation somehow made it worse? This guide cuts straight to what you actually need: how to read a scatterplot, what the correlation coefficient really tells you (and what it doesn't), and how to build and interpret a least-squares regression line from scratch.

**TLDR: Linear Regression and Correlation** covers the full single-predictor regression sequence taught in Algebra 2, AP Statistics, and introductory college stats courses. You'll learn to describe relationships between two variables, compute and interpret *r*, write and use a regression equation, evaluate fit with residuals and *r²*, make predictions without falling into the extrapolation trap, and explain — clearly — why correlation is not causation. Every section leads with the one idea that matters most, works through concrete numbers, and flags the mistakes students make most often.

At roughly 15 pages, this is not a replacement for your class notes or textbook. It's the focused primer you read the night before a test, the quick reference your tutor pulls up before a session, or the plain-language explainer a parent uses to actually help their kid. If you've searched for a straightforward guide to the *ap statistics linear regression* topic or need the *least squares regression line formula* demystified in plain English, this is the book.

Grab it, read it in one sitting, and walk into your exam knowing exactly what you're doing.

What you'll learn
  • Read a scatterplot and describe the form, direction, and strength of a relationship
  • Compute and interpret the correlation coefficient r
  • Find the least-squares regression line and interpret its slope and intercept in context
  • Use a regression line to predict, and recognize when prediction is unreliable (extrapolation, outliers)
  • Interpret r-squared and residuals to judge how well the line fits
  • Distinguish correlation from causation and identify common pitfalls
What's inside
  1. 1. Scatterplots and the Idea of a Relationship
    Introduces bivariate data, scatterplots, and the vocabulary (form, direction, strength) used to describe relationships between two quantitative variables.
  2. 2. The Correlation Coefficient r
    Defines r, shows how to compute and interpret it, and explains what r does and does not measure.
  3. 3. The Least-Squares Regression Line
    Derives the formulas for slope and intercept, shows a worked example, and interprets each piece in context.
  4. 4. How Good Is the Line? Residuals and r-squared
    Uses residual plots and the coefficient of determination to assess fit and detect when a linear model is the wrong choice.
  5. 5. Prediction, Extrapolation, and Influential Points
    Shows how to use the regression line to predict, and warns about extrapolation, outliers, and high-leverage points.
  6. 6. Correlation Is Not Causation
    Explains lurking variables, confounding, and why association alone never proves cause — with classic examples.
Published by Solid State Press
Linear Regression and Correlation cover
TLDR STUDY GUIDES

Linear Regression and Correlation

A High School & College Primer on Lines, Scatter, and Prediction
Solid State Press

Who This Book Is For

If you're a high school student who needs an AP Statistics linear regression study guide, a sophomore working through Algebra 2 scatter plots and line of best fit, or a college freshman looking for a quick intro college stats regression review before an exam, this book was written for you. It also works for tutors prepping a session and parents who want to follow along.

This guide covers the core ideas in one tight sequence: reading scatter plots, getting the correlation coefficient explained in plain student-friendly language, learning how to find the least-squares regression line by hand and by formula, and understanding residuals and r-squared explained simply enough to actually use them. About 15 pages, zero filler.

Read straight through in one sitting — the concepts build on each other. Work every example as you go, then try the problem set at the end. The final section on correlation vs. causation examples will prepare you for the tricky interpretation questions that show up on almost every high school and college statistics exam.

Contents

  1. 1 Scatterplots and the Idea of a Relationship
  2. 2 The Correlation Coefficient r
  3. 3 The Least-Squares Regression Line
  4. 4 How Good Is the Line? Residuals and r-squared
  5. 5 Prediction, Extrapolation, and Influential Points
  6. 6 Correlation Is Not Causation
Chapter 1

Scatterplots and the Idea of a Relationship

When you have two measurements on each individual — say, a student's hours studied and their exam score — you have bivariate data. The goal is to see whether the two measurements move together in a predictable way. Before you calculate anything, you look.

A scatterplot is a graph where each individual becomes one dot. One variable goes on the horizontal axis, the other on the vertical axis, and you plot every pair $(x, y)$. The pattern of dots — or the lack of one — tells you more in three seconds than a table of numbers ever could.

Which variable goes where?

Not every problem cares about direction, but many do. When one variable might explain or predict the other, put the explanatory variable (also called the predictor or independent variable) on the horizontal axis and the response variable (the outcome you care about) on the vertical axis. For example, if you think hours of sleep explains reaction time, sleep goes on the $x$-axis and reaction time on the $y$-axis.

A common mistake is to think the labels "explanatory" and "response" imply causation — they don't. They only describe how you've chosen to set up the prediction. Section 6 returns to this point in detail.

Reading a scatterplot: form, direction, strength

Once you have a scatterplot, describe it with three words.

Form is the shape of the pattern. The most important distinction is linear versus nonlinear. If the dots cluster around a straight line, the form is linear. If they curve — like the path of a thrown ball — the form is nonlinear. Everything in this book assumes a linear form, so the first thing you should do with any scatterplot is check whether that assumption is reasonable.

Keep reading

You've read the first half of Chapter 1. The complete book covers 6 chapters in roughly fifteen pages — readable in one sitting.

Coming soon to Amazon