Patterns, Predictions, and Actions (2025)
I’ll be live blogging my graduate course on machine learning this semester (Fall 2025). The course is based on the text Patterns, Predictions, and Actions by Moritz Hardt and myself.
This course will explore how patterns in data support predictions and consequential actions. Starting with the foundations of prediction, we look at the foundational optimization theory used to automate decision-making. We then turn to supervised learning, covering representation, optimization, and generalization as its key constituents. We will discuss datasets as benchmarks, examining their histories and scientific bases. We will then cover the related principles of statistical evaluation, drawing a through line from confidence intervals to AB testing to bandits to reinforcement learning. Throughout the course, we will draw upon connections to historical context, contemporary practice, and societal impact.
Lecture 1: Introduction
Slides (I'm not sure if these are helpful without my narration.)
Lecture 2: Rudiments of Prediction
Lecture 3: Prediction from Samples (without features)
Recht, Benjamin (2025) “The Actuary’s Final Word on Algorithmic Decision-Making.” arXiv:2509.04546. [slides]
Lecture 4: Decision Theory
Reading: Chapter 2 of PPA.
Lecture 5: Errors, Operating Characteristics, and Tradeoffs.
Reading: Chapter 2 of PPA.
Lecture 6: Fairness and trade-offs (by Jessica Dai)
Reading: PPA Chapter 2.
Hardt et al. Equality of Opportunity in Supervised Learning.
Kleinberg et al. Inherent Trade-Offs in the Fair Determination of Risk Scores.
Lecture 7: The Perceptron
Reading: PPA Chapter 3.
Lecture 8: Numerically representing data
Reading: PPA Chapter 4.
Lecture 9: Nonlinearity and approximation
Reading: PPA Chapter 4.
Lecture. 10: Stochastic Gradient Descent
Reading: PPA Chapter 5.
Lecture 11: Analysis of the Stochastic Gradient Method
Reading: PPA Chapter 5.
Lecture 12: Theories of generalization
Reading: PPA Chapter 6. I’m not sure I actually recommend reading this.
Lecture 13: Datasets as benchmarks
Reading: PPA Chapter 8.
Lecture 14: Robustness and fragilities of dataset benchmarks
Desirable Sins (see the bottom of the post for too much extra reading)
Lecture 15: Uncertainty Quantification
Lecture 16: Prediction Intervals
The infamous Vovk paper on conditional conformal prediction.
Lecture 17: Randomized experiments
Lecture 18: The impact of actions
PPA Chapter 9
Relevant old blog: Fractions or Laws of Nature?
Additional Reading: A bureaucratic theory of statistics.
Some slides on the bureaucratic nature of RCTs.
Lecture 19: Adaptive experimentation
(i.e., multiarmed bandits as in PPA Chapter 12)
Auer, P., Ortner, R. UCB revisited: Improved regret bounds for the stochastic multi-armed bandit problem. Period Math Hung 61, 55–65 (2010). https://doi.org/10.1007/s10998-010-3055-6
Lecture 20: Algorithms for online learning
Lecture 21: Generative Models
Note to self: I need to write up my class notes on this. I went slightly more in depth than in the blog post

