Lots of confusion about what this model is actually focused on.
It is a cheap specialist for closed-world, verifiable reasoning tasks like math, self-contained coding problems, and similar.
"Closed-world" means the needed information is already in the context. It is not a tool-using agent that can discover missing context. "Verifiable" means answers are hard to generate but easy to check.
So no open ended research, repo wide agent work, factual Q&A, or SVG generation. More of a compact reasoning module for bounded problems.
If it can code well then once you put it in a loop with an interpreter it can do anything.
To follow up on this, I had it solve a nasty ODE problem that I saw in the recent Mathematica 15 release post:
And surprisingly it found a valid solution! Extra impressive because it runs 25 tok/s on my measly RTX 2070 super. Apparently Mathematica 14.3 couldn't solve this ODE.