I've also seen Qwen 3.6 beat GPT 5.5 a couple of times. The ball is definitely in OpenAI's court now. Qwen is not going to fare so well against Fable, from what I've seen so far.
In theory, GPT-5.5-Pro would do better, but it’s so expensive it’s not worth experimenting to find out.
In theory, GPT-5.5-Pro would do better, but it’s so expensive it’s not worth experimenting to find out.