logoalt Hacker News

NitpickLawyeryesterday at 7:21 PM0 repliesview on HN

> for benchmaxxing.

Out of all the big4 labs, google is the last I'd suspect of benchmaxxing. Their models have generally underbenched and overdelivered in real world tasks, for me, ever since 2.5 pro came out.