arg min

Ben Recht

Yeah, it's sad how this recsys-driven business has reduced choice and availability.

Given the dimensions of the dataset, Netflix had over 18000 movies in its catalog in 2006. Today they offer around 4000. That's crazy.

Expand full comment

Andrew

"Is all we got out of activism the honor of clicking to accept cookies?"

No, we also got increased barriers to entry, leading directly to centralization, censorship, and the destruction of all that was once good about the internet.

Expand full comment

https://www.asc.ohio-state.edu/statistics/statgen/joul_aut2009/PragmaticTheory.pdf

I was on the 2nd place team in the Netflix Prize (The Ensemble). I don't think your footnote re: Salakhutdinov and Hinton is correct. Restricted Boltzmann Machines were in the Pragmatic Theory subteam's portion of the winning team's blend. See page 39 and reference 5 here:

Note that Mnih was also a coauthor of that RBM paper.

They were also in the BellKor subteam's solution, see e.g. the intro here:

https://www2.seas.gwu.edu/~simhaweb/champalg/cf/papers/KorenBellKor2009.pdf

Also in the BigChaos subteam's solution:

https://www.asc.ohio-state.edu/statistics/statgen/joul_aut2009/BigChaos.pdf

Expand full comment

Ben Recht

Thanks, I'll edit this.

Expand full comment

To be fair, there was a ton of stuff in the winning blend (as well as ours ) so it's hard to expect anyone who was not deep in the weeds of the competition to know all the models that were in there.

Expand full comment

Ben Recht

Part of the reason I wrote this post was to try to remember all of the details. Clearly my memory is spotty! If there's anything else you think I should add here, please let me know.

Expand full comment

https://www.youtube.com/watch?v=coeak1YsaYc

Here's a video of a talk I gave in 2010 on my experience in the competition and some of the research that came out of it, in case it's of interest:

Expand full comment

https://en.wikipedia.org/wiki/Boosting_(machine_learning)

Cagatay Candan

Feb 13

I have watched your talk with great interest and enjoyed it a lot. Thanks so much for the link.

Do you remember anyone utilizing boosting methods (adaboost) for the same goal of blending classifiers? It has the same goal; but, focuses on the difficult to predict (user,movie) pairs in the training set and tries to correct the prediction mistakes by adding more classifiers in the ensemble. It seems to me an excellent match to the task...

https://direct.mit.edu/books/oa-monograph/5342/BoostingFoundations-and-Algorithms

Expand full comment

Feb 13

Glad you enjoyed the talk. I agree that boosting methods could be well suited for blending. I don't specifically remember them being used much in the Netflix Prize, but I also don't remember the details of the approaches of the other teams that well. You could check the links to the writeups of the subteams of the winning team in my first comment to see if there was much boosting in the solutions.

Expand full comment

Jeremy Kun

Feb 8Edited

I feel like any discussion of the Netflix Prize is incomplete without discussing what happened to the winning solution. Though I have heard conflicting reports, my best evaluation of the situation is that the winning solution was not used by Netflix at all, mainly because of how the business changed to de-emphasize user ratings over alternative implicit metrics like watch time.

Expand full comment

Jordan Ellenberg