Sonnet 4.5 and 4.6*
There is no way it exceeds “all other” open models - but it does exceed all of Mistral’s past models.
You can see it getting blown past by GLM 5.1 and Kimi in this.
Still excited to give it a try
It looks like qwen 3.6 is winning and smaller for the April small model roll out
It looks like qwen 3.6 is winning and smaller for the April small model roll out