I'm fascinated by these smaller models.
The amount of progress they've been making is incredible.
Is anyone following this space more closely? Is anyone predicting performance at certain parameter sizes will plateau soon?
Unlike the frontier models, these don't seem to be showing much progress of slowing down.
On the harness side there's a huge amount of optimisation room to go as well.
I strongly think smaller models will end up being able to do most coding tasks in the future, once they are reigned in properly