SOLID STATE PRESS
← Back to catalog
Residual Analysis cover
Coming soon
Coming soon to Amazon
This title is in our publishing queue.
Browse available titles
Mathematics

Residual Analysis

Reading Residual Plots, Spotting Bad Fits, and Checking Linear Regression Assumptions — A TLDR Primer

Your statistics teacher put up a residual plot and asked whether the linear model fits. You nodded — but you weren't sure what you were actually looking for. That gap is exactly what this guide closes.

**Residual Analysis** is a concise, no-filler primer on one of the most tested and most misunderstood pieces of introductory statistics: how to use residuals to judge whether a linear regression model is doing its job. It covers how to compute residuals by hand, how to build and read a residual plot, and how to recognize the three patterns — curvature, fanning, and clustering — that tell you something has gone wrong. From there it connects those visual patterns to the four formal assumptions behind linear regression (linearity, independence, normality, and equal variance), so the diagnostics actually make sense instead of feeling like a checklist.

The guide also untangles the concepts students most often confuse: the difference between an outlier in *y* and a high-leverage point in *x*, when a single data point actually shifts the regression line, and why a high R-squared doesn't automatically mean a good model. It closes with practical next steps — variable transformations, polynomial terms, and honest guidance on when linear regression simply isn't the right tool.

Written for AP Statistics students, introductory college stats courses, and anyone who needs to check linear regression assumptions without slogging through a door-stopper textbook. Short by design, worked examples included, no filler.

If residual plots have ever left you guessing, grab this guide and stop guessing.

What you'll learn
  • Compute residuals from a regression line and interpret their sign and size
  • Read residual plots to detect nonlinearity, non-constant variance, and outliers
  • Distinguish residuals from errors, and outliers from influential points
  • Use residual standard error and R-squared together to assess fit
  • Recognize when a linear model is inappropriate and what to try next
What's inside
  1. 1. What a Residual Actually Is
    Defines residuals as observed minus predicted values, distinguishes them from true errors, and walks through computing them by hand from a small dataset.
  2. 2. Residual Plots and What They Reveal
    Shows how to build a residual plot, what a 'good' random scatter looks like, and how to spot curvature, fanning, and clustering patterns.
  3. 3. The Four Assumptions Residuals Help You Check
    Connects residual patterns to the linearity, independence, normality, and equal-variance assumptions behind linear regression.
  4. 4. Outliers, Leverage, and Influential Points
    Distinguishes large residuals (outliers in y) from high-leverage points (outliers in x) and explains when a single point actually changes the regression line.
  5. 5. Putting Numbers on Fit: RSE, R-squared, and Residual Sum of Squares
    Shows how residuals roll up into the headline fit statistics, why R-squared alone can mislead, and how to read RSE in the units of the response variable.
  6. 6. When the Residuals Say 'Not Linear' — What to Do Next
    Practical next steps when residual analysis fails: transformations, polynomial terms, and knowing when to abandon linear regression entirely.
Published by Solid State Press
Residual Analysis cover
TLDR STUDY GUIDES

Residual Analysis

Reading Residual Plots, Spotting Bad Fits, and Checking Linear Regression Assumptions — A TLDR Primer
Solid State Press

Contents

  1. 1 What a Residual Actually Is
  2. 2 Residual Plots and What They Reveal
  3. 3 The Four Assumptions Residuals Help You Check
  4. 4 Outliers, Leverage, and Influential Points
  5. 5 Putting Numbers on Fit: RSE, R-squared, and Residual Sum of Squares
  6. 6 When the Residuals Say 'Not Linear' — What to Do Next
Chapter 1

What a Residual Actually Is

Every regression line makes a prediction. A residual is the gap between what the line predicted and what you actually observed — the leftover that the model could not explain.

More precisely, if $y_i$ is the observed value (the real data point) and $\hat{y}_i$ is the predicted value (what the regression line says $y$ should be at that $x$), then the residual for observation $i$ is:

$e_i = y_i - \hat{y}_i$

Order matters here. Observed minus predicted, not the other way around. A positive residual means the real value sits above the line — the model underestimated. A negative residual means the real value sits below the line — the model overestimated. A residual of zero means the point falls exactly on the line.

Where the regression line comes from

Before you can compute a residual, you need a line to subtract from. In linear regression, that line is the least squares line — the unique line that minimizes the sum of all the squared residuals. Squaring prevents positive and negative gaps from canceling each other out, and minimizing that total sum forces the line to be as close as possible to all the points at once. You will often see it written as:

$\hat{y} = b_0 + b_1 x$

where $b_0$ is the intercept (the predicted value of $y$ when $x = 0$) and $b_1$ is the slope (how much $\hat{y}$ changes for each one-unit increase in $x$). The line is computed from your data; the residuals then measure how well it did.

Residuals versus errors — a distinction that matters

A common mistake is to treat "residual" and "error" as interchangeable. They are not.

A true error (written $\varepsilon_i$, the Greek letter epsilon) is the gap between an observed value and the true underlying relationship — the one that would exist if you knew the exact equation governing the population. You never observe that true relationship. It is theoretical.

A residual $e_i$ is the gap between an observed value and your estimated regression line — the one built from your sample. Residuals are things you can calculate. Errors are things you can only model.

About This Book

If you're a high school student who needs residual plots explained for statistics class, a student enrolled in AP Statistics or an intro college stats course, or a tutor prepping a session on linear regression diagnostics, this study guide is for you. It is also for anyone who has run a regression, gotten an output, and had no idea whether to trust it.

This book covers how to compute and interpret residuals, what patterns in residual plots reveal about your model, and how to check linear regression assumptions step by step. You will learn the difference between r-squared vs residual standard error, how to identify outliers, leverage, and influential points in regression, and when to use transformations in regression analysis. Concise and focused, with no filler.

Read it straight through — the sections build on each other. Work every example as you go, then use the problem set at the end to confirm your understanding before an exam or assignment.

Keep reading

You've read the first half of Chapter 1. The complete book covers 6 chapters in roughly fifteen pages — readable in one sitting.

Coming soon to Amazon