Discussion about this post

User's avatar
Matt Hoffman's avatar

I'm a little confused by the free-throw example. Usually we care about a forecaster's average error, not the error of their average prediction. We judge weather forecasts based on their average error, not based on the error of the average forecast temperature. We wouldn't consider it good forecasting if Nate Silver's presidential-election predictions for 2004–2024 were Kerry, McCain, Romney, Clinton, Trump, and Harris, even though he would have gotten the right number of Republicans.

On a more technical level, I'm sure I'm being extremely dense, but how do you get the final step in the bound after equation (3)? Obviously

||F_t|| ≤ M

implies

Σ_t ||F_t||^2 ≤ TM^2,

but it seems like the claim is that

Σ_t ||F_t||^2 ≤ M^2?

Expand full comment
brutalist's avatar

Search heuristics for predicting the next value in a sequence which serve as a discursive substitutes for insight - might there also be a connection here with the purported capabilities of another set of high-profile "next token predictors?"

Expand full comment
16 more comments...

No posts