Discussion about this post

User's avatar
Zach's avatar

John Venn wrote 'The Logic of Chance' in 1866 where he mentions statistical regularity increases as you collect more data over time, but that the regularity will break down after enough time has passed. This was in the context of lifespan and heights.

Expand full comment
Kevin M's avatar

Forgive me if this sounds ignorant, but in CS281A, we had about 2 lectures (maybe more?) going over generalization error in machine learning. From what I recall (and also what wikipedia defines generalization error as), it is supposed to be "a measure of how accurately an algorithm is able to predict outcome values for previously unseen data". And of course, I remember you did not speak very highly of the conventions that arise from the machine learning community as a result of some of those theories. In this blog, however, you argue external validity seems to be moreso a philosophical question based on your beliefs attached to a strong conceptual model, which as a result appear more qualitative than quantitative. So my question and asking for your opinions/thoughts on this is: what did we learn from generalization error that is useful for us to takeaway when we utilize machine learning algorithms in real world applications given this blog seems to handwave that it was never all that useful?

Expand full comment
4 more comments...

No posts