logoalt Hacker News

nsingh2today at 6:41 PM2 repliesview on HN

It's going to be expensive to serve (also not generally available), considering they said it's the largest model they've ever trained.

I suspect it's going to be used to train/distill lighter models. The exciting part for me is the improvement in those lighter models.


Replies

azan_today at 8:22 PM

What's interesting is that scaling appears to continue to pay off. Gwern was right - as always.

AstroBentoday at 8:07 PM

It seems inevitable that costs will come down over time. Expensive models today will be cheap models in a few years.