logoalt Hacker News

alexpop80last Monday at 12:08 PM0 repliesview on HN

What do you mean? Opus 4.5 and GPT 5.2 broke the 80% mark and no other models yet seem to be passing this important milestone.