logoalt Hacker News

minimaxiryesterday at 4:27 PM1 replyview on HN

It's funny that 128B is now considered Medium. I remember back in the day when 355M parameters was considered medium with GPT-2.


Replies

speedgooseyesterday at 4:38 PM

And GPT-2 1.5B was considered too dangerous to release.

They were perhaps right.

show 1 reply