logoalt Hacker News

dist-epochyesterday at 6:59 PM1 replyview on HN

Musk said Grok 5 is currently being trained, and it has 7 trillion params (Grok 4 had 3)


Replies

svarayesterday at 7:57 PM

My understanding is that all recent gains are from post training and no one (publicly) knows how much scaling pretraining will still help at this point.

Happy to learn more about this if anyone has more information.

show 1 reply