logoalt Hacker News

ariwilsontoday at 12:24 AM2 repliesview on HN

Very cool and works pretty well!


Replies

onlyrealcuzzotoday at 12:44 AM

I'm fascinated by these smaller models.

The amount of progress they've been making is incredible.

Is anyone following this space more closely? Is anyone predicting performance at certain parameter sizes will plateau soon?

Unlike the frontier models, these don't seem to be showing much progress of slowing down.

show 1 reply