Classical ML: Decision Trees, Random Forests, and k-Means

A High School & College Primer on the Algorithms That Still Matter

You just hit a unit on machine learning and the textbook reads like a research paper. The lecture moved fast. The exam is coming. What you need is someone to sit down with you and explain how these algorithms actually work — no fluff, no PhD prerequisites.

This TLDR guide covers the three classical ML algorithms that show up everywhere: **decision trees**, **random forests**, and **k-means clustering**. You will learn how a decision tree picks its splits using Gini impurity, why combining hundreds of weak trees into a random forest beats any single perfect tree, and how k-means finds hidden structure in data without any labels at all. Each algorithm is walked through with concrete numbers and worked examples, not hand-waving.

This is a machine learning algorithms for beginners guide written for US high school students (grades 9–12) and college freshmen or sophomores taking their first AI, data science, or computer science course. It also covers the evaluation layer — train/test splits, confusion matrices, accuracy's blind spots — so you know not just how each algorithm works but how to judge whether it's working well and when to reach for which tool.

At roughly 15 pages, it is short by design. Every sentence earns its place. Parents helping a student, tutors prepping a session, and self-studiers who want a focused intro to machine learning study guide without wading through a 600-page textbook will all find it useful.

Pick it up, read it once, and walk into class ready.

What you'll learn

Distinguish supervised from unsupervised learning and identify which algorithms fit each setting
Build a decision tree by hand using Gini impurity or information gain and explain how splits are chosen
Explain how bagging and feature randomness turn weak trees into a strong random forest
Run the k-means algorithm by hand on a small dataset and recognize when it fails
Evaluate models using train/test splits, accuracy, and the elbow method, and recognize overfitting

What's inside

1. What Classical ML Is (and What It Isn't)

Orients the reader to supervised vs. unsupervised learning, situates the three algorithms, and contrasts classical ML with deep learning.
2. Decision Trees: Splitting Your Way to an Answer

Walks through how a decision tree is built one split at a time, using Gini impurity and information gain on a worked example.
3. Random Forests: Why Many Weak Trees Beat One Strong One

Explains bagging, feature subsampling, and majority voting, and why ensembling reduces variance without much added bias.
4. k-Means: Finding Groups Without Labels

Steps through the k-means algorithm on a small 2D dataset, covers initialization issues, and shows how to pick k with the elbow method.
5. Evaluating, Comparing, and Choosing

Covers how to actually judge these models in practice: train/test splits, accuracy and its limits, confusion matrices, and when to reach for which algorithm.

Published by Solid State Press

Classical ML: Decision Trees, Random Forests, and k-Means

Classical ML: Decision Trees, Random Forests, and k-Means

Who This Book Is For

Contents

What Classical ML Is (and What It Isn't)

Supervised vs. Unsupervised Learning