> Your reason can't be cost because there are superior models that are cheaper than Mistral models
Nope. This is not my experience.
Public pricing in token/$ is only part of the equation.
Mistral tooling to consume significantly less tokens-per-given-task than the Anthropic ones.
My bills currently reflects that.
Compare to Xiaomi MiMo-V2.5 you will be shocked
I think other commenter is talking about smaller/cheaper models like Qwen that outperform mistral on just about every metric