arg min
Subscribe
Sign in
Home
Lecture Blogs
Collections
Archive
About
Latest
Top
Discussions
Machine Learning Evaluation - A Syllabus
What we read this semester, and what worked, and what didn't.
21 hrs ago
•
Ben Recht
16
Share this post
arg min
Machine Learning Evaluation - A Syllabus
Copy link
Facebook
Email
Notes
More
3
Rossi's Metallic Rules
Why do evaluations tend to find that social programs don't work?
Apr 22
•
Ben Recht
11
Share this post
arg min
Rossi's Metallic Rules
Copy link
Facebook
Email
Notes
More
8
Pretending (Not) to Count
Datafication, online societies, and the war on academia.
Apr 18
•
Ben Recht
33
Share this post
arg min
Pretending (Not) to Count
Copy link
Facebook
Email
Notes
More
9
Maybe just believing in AGI makes AGI exist.
Kill the wise one!
Apr 14
•
Ben Recht
61
Share this post
arg min
Maybe just believing in AGI makes AGI exist.
Copy link
Facebook
Email
Notes
More
6
Evaluation or Valuation
The infinite regress of evaluating large language models
Apr 10
•
Ben Recht
23
Share this post
arg min
Evaluation or Valuation
Copy link
Facebook
Email
Notes
More
14
Demo Or Die
Where do demonstrations fit in the complex world of system evaluation?
Apr 8
•
Ben Recht
31
Share this post
arg min
Demo Or Die
Copy link
Facebook
Email
Notes
More
March 2025
baby, it's cold inside
celebrating the reissue of an ambient deep cut
Mar 28
•
Ben Recht
12
Share this post
arg min
baby, it's cold inside
Copy link
Facebook
Email
Notes
More
1
All bets are off
All decisions are made under uncertainty. Almost no decisions are gambling.
Mar 25
•
Ben Recht
31
Share this post
arg min
All bets are off
Copy link
Facebook
Email
Notes
More
14
Stochastic Coherence
Deriving the laws of probability through superforecasting
Mar 20
•
Ben Recht
16
Share this post
arg min
Stochastic Coherence
Copy link
Facebook
Email
Notes
More
6
I think it's gonna rain...
The paradoxes of calibrated forecasting.
Mar 18
•
Ben Recht
12
Share this post
arg min
I think it's gonna rain...
Copy link
Facebook
Email
Notes
More
11
Gambling on the Richter Scale
Risk aversion when evaluating risk scores
Mar 17
•
Ben Recht
11
Share this post
arg min
Gambling on the Richter Scale
Copy link
Facebook
Email
Notes
More
16
Appraising Tea Leaves
How do we evaluate probabilistic assertions?
Mar 14
•
Ben Recht
22
Share this post
arg min
Appraising Tea Leaves
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts