I'm sure with benchmarks like these future LLMs will be optimized to hide regressions by "...

PunchyHamster • yesterday at 12:20 PM • 1 reply • view on HN

I'm sure with benchmarks like these future LLMs will be optimized to hide regressions by "fixing" test framework too

pixl97 • yesterday at 3:24 PM

Isn't misalignment great.

alt Hacker News