Which ones fail?
I tested DeepSeek V4 Pro, Qwen 3.6 Max, Qwen 3.7, Kimi K2.6, MiniMax M2.7 - they all fail to answer.
Curiously, MiniMax M3 answers correctly.
Deepkseek
I tested DeepSeek V4 Pro, Qwen 3.6 Max, Qwen 3.7, Kimi K2.6, MiniMax M2.7 - they all fail to answer.
Curiously, MiniMax M3 answers correctly.