logoalt Hacker News

parasubverttoday at 6:24 AM0 repliesview on HN

I'm all for skeptical inquiry, but "burning all credibility" is an overreaction. We are definitely seeing very unexpected levels of performance in frontier models.