logoalt Hacker News

snqbtoday at 1:50 PM1 replyview on HN

how well does it do on frontier models like Opus 4.6?


Replies

GodelNumberingtoday at 2:10 PM

I have only done functionality testing, no benchmark testing on Opus (decided to pay my rent instead)