logoalt Hacker News

BoorishBearstoday at 4:35 AM2 repliesview on HN

If they really managed this from pre-training a 1.6 T parameter model through to post-training without NVIDIA, Dwarkesh Patel got what he wanted.


Replies

chvidtoday at 6:23 AM

It is interesting how much people doubt Huawei’s capabilities in this area - Jensen does not (in the dp interview) - of course you can dismiss this as him talking his own book.

Jabrovtoday at 4:45 AM

Who? What did he want?

show 1 reply