This logic works only if distilling Claude is the only way to create another SOTA LLM, which is not the case.
it's not but full path is billions of dollars vs 10-100m range to stay near sota.
the problem is so large scale that distill attempts attribute to a decent share of their token revenue generally.
How do you think the Qwen and MiniMax models perform so similarly to Anthropic frontier models? What is your take then?
it's not but full path is billions of dollars vs 10-100m range to stay near sota.
the problem is so large scale that distill attempts attribute to a decent share of their token revenue generally.