logoalt Hacker News

ghshephardtoday at 1:13 AM2 repliesview on HN

Do any of the open weight models from smaller labs exist if they can't distill from the SoTA models that are throwing billions of dollars of compute into pretraining?


Replies

daniel_iversentoday at 2:09 AM

I’ve been wondering the same. And I think pretty much all the impressive small lab models were guilty of it, right? At least there is still larger players like DeepSeek and mistral to provide a bit of diversity in the market

username223today at 2:21 AM

Does it matter? The frontier models stole the whole internet, then the second-level models stole from them… It’s all theft.

show 4 replies