logoalt Hacker News

cherryteastainyesterday at 8:02 PM2 repliesview on HN

This logic works only if distilling Claude is the only way to create another SOTA LLM, which is not the case.


Replies

maxdotoday at 1:55 AM

it's not but full path is billions of dollars vs 10-100m range to stay near sota.

the problem is so large scale that distill attempts attribute to a decent share of their token revenue generally.

sciencejerkyesterday at 11:39 PM

How do you think the Qwen and MiniMax models perform so similarly to Anthropic frontier models? What is your take then?

show 1 reply