lets be genuine here: those local models are no where near the capabilities of true modern llms like codex 5.5 and fable 5
but i also dont doubt in a few years time models with those benchmarks will be able to be run locally
still many many breakthroughs to be had
Personally I am fine with the SOTA from last year if I can run it on my hardware and who gets access to my data and history. I don’t really care that it could be marginally better using a model I cannot control on someone else’s server.