logoalt Hacker News

tuhgdetzhhyesterday at 8:44 PM1 replyview on HN

The test is rigged because they used non thinking models.


Replies

felix089yesterday at 9:03 PM

These are reasoning / thinking models

show 1 reply