Discussion about this post

User's avatar
James Golden's avatar

But don’t all of the best scientists from the biggest American companies tell the thought leaders in congress that open models are dangerous?

Love the OLMo project from the Allen Institute - theyve already spent all the gpu money and given away competitive models up to 32B parameters, with open data, open training and some great insights into how the models work.

https://allenai.org/olmo

https://arxiv.org/abs/2504.07096

Edit: I am reading more of Nathan’s work and appreciating his previous analysis of OLMo and other open weight models like Gemma.

Expand full comment
Yaroslav Bulatov's avatar

I'm also optimistic this can be done, just consider the explosion of new entrants last year. Large companies have unlimited GPUs but they also have an unlimited capacity to waste them, they get better at wasting them as they grow.

Expand full comment
5 more comments...

No posts