logoalt Hacker News

mohasyesterday at 6:14 PM1 replyview on HN

I kinda feel this bench-marking thing with Chinese models is like university Olympiads, they specifically study for those but when time comes for the real world work they seriously lack behind.


Replies

OsrsNeedsf2Pyesterday at 6:16 PM

I kinda feel like the goalposts are shifting. While we're not there yet, in a world where Chinese models surpass Western ones, HN will be nitpicking edge cases long after the ship sails

show 1 reply