logoalt Hacker News

taf2last Tuesday at 11:49 PM0 repliesview on HN

I’m waiting to see results on deepswe - that benchmark really seemed accurate for opus and gpt 5.5…