This is a ridiculous thing to test on it. Other models are trained on that kind of thing, use those instead.
Fable was designed for _really_ hard software engineering problems. Possibly large, but especially hard. For those tasks, you feel the difference immediately.
I tested it on that too. A problem I usually give a model to test is to optimise already well optimised function that performs certain calculations. I give it reference to CPU instruction set, how instructions can be paired to take advantage of superscalar execution pipeline etc. In that test also it fell on its face by producing code that was demonstrably slower and with extra bug.
No it wasn't, Fable is a general purpose model for use in regular chat, analysis, as well as coding.
And yes, the parent poster is accurate, Fable is just as prone to moronic mistakes as Opus was. Stop being so AI-pilled.
Codex is still a better model, and yes, for the hardest engineering problems. I use Claude for UI/GUIs and Codex for all my backend, because I have 20 years of experience of actual hard engineering, and I can see that Codex writes, cleaner code, and is far more steerable.
Bad engineers think Claude is better because it writes more lines of code and is more "proactive", but lines of code doesn't make a better system.