> for benchmaxxing. Out of all the big4 labs, google is the last I'd suspect of benchmaxx...

NitpickLawyer • yesterday at 7:21 PM • 0 replies • view on HN

> for benchmaxxing.

Out of all the big4 labs, google is the last I'd suspect of benchmaxxing. Their models have generally underbenched and overdelivered in real world tasks, for me, ever since 2.5 pro came out.

alt Hacker News