>establish benchmarks that make sense and are reliable How aren't current LLM coding bench... | alt Hacker News

alt Hacker News

NewsaHackO • last Tuesday at 6:09 PM • 1 reply • view on HN

>establish benchmarks that make sense and are reliable

How aren't current LLM coding benchmarks reliable?

Replies

Papazsazsa • last Tuesday at 7:19 PM

They're manipulated.

➕ show 1 reply