> Especially outside the US customers are going to be very hesitant to keep adopting LLMs from US companies.
Not really. There aren't any other choices, and the PRC also heavily utilizes export controls [0].
This is why sovereign AI has become important, as can be seen with EU NatSec uses cases tending to use Mistral [1] and Indian governments starting to use Sarvam [2].
That said, for most commercial usecases, older generations of Opus as well as enterprise grade GPT and Gemini are fairly good.
The distilled OSS models are alright for hobbyists but if you have actually used unrestricted and enterprise grade versions of Claude, Mythos, GPT, and Gemini (most hobbyists don't get access to these) you see how far behind the open weight models are.
Even in China, traditionally open minded models teams like Alibaba's Qwen are looking to become more restricted given the org changes [3].
Also, Corporate RFCs now demand final say on model used and depending on the geo, this can be a dealbreaker (eg. An American financial institution will absolutely blacklist a vendor if they use a Chinese model and same in reverse and European defense vendors mandate sovereign EU models depending on the opportunity).
[0] - https://www.allbrightlaw.com/EN/10475/f9d4055e47e81afb.aspx
[1] - https://www.reuters.com/business/media-telecom/mistral-defen...
[2] - https://www.sarvam.ai/blogs/partnerships-with-indian-states
[3] - https://www.ft.com/content/b39da303-3188-447b-8b65-3dd8dad8b...
> There aren't any other choices
This might be the trigger for creating other choices. Not within a month, but things can change quickly.
Not sure if this is true - I’ve been using mimo and it’s great
> The PRC also heavily utilizes export controls
Matters not for open weight models, no?
> if you have actually used unrestricted and enterprise grade versions of Claude, Mythos, GPT, and Gemini you see how far behind the open weight models are.
I really do feel like DeepSeek V4 Pro is often better than current Sonnet is, in the general case.
Opus 4.7 is a solid step above Sonnet, and Fable was a solid step above Opus 4.7. I've only had Fable for a few days, obviously, but I was decently impressed after Opus 4.8 being a downright disappointment for me (it's just too buggy; I had it go out of control 3 separate times on things Opus 4.7 never had any trouble with.) I still ran into limitations. It's not world-endingly great.
So, based on that, I think DeepSeek V4 Pro is, ignoring multi-modal capabilities, about a couple solid steps behind. Assuming model iteration will continue to decelerate, especially as Anthropic heads into IPO, I'm guessing that DeepSeek will probably be able to strike back with something further along. Of course we'll see how able and willing they are to stay open weight, but they've done well so far so, no reason to doubt them at the moment.
(There are some models that claim to be ahead of DeepSeek V4 Pro. I've tried some of them and really not been that impressed. Maybe it's a me issue.)
Now I reckon that most people just simply don't really need Mythos/Fable for most of what they do and using Mythos/Fable tokens in place of Sonnet-tier models would not make any sense. At my job we already mostly just use Sonnet as it is. I'm sure there is some cutting-edge research where you want the absolute best model available and sure, in that case, you're stuck with Anthropic for the moment.
But is that really everyone? After all, while Mythos was dominating the hype cycles, quite a lot of impressive LLM-assisted CVEs dropped that were not linked to Mythos.