logoalt Hacker News

moabtoday at 4:22 PM1 replyview on HN

So to verify their claims and see how strong these models are, the answer is "believe us"?

Note: I'm expressing some skepticism here largely due to how recent rollouts from Meta flopped. Sincerely hoping that they do better this time around!


Replies

nemomarxtoday at 4:27 PM

I assume the answer is try it out in the chat mode? You could run your usual benches through that right