Yeah, that's my point. Humans are not reliable LLM evaluators. "Secret model nerfs" h...

ACCount37 • yesterday at 5:07 PM • 0 replies • view on HN

Yeah, that's my point. Humans are not reliable LLM evaluators. "Secret model nerfs" happen in "vibes" far more often than they do in any reality.

alt Hacker News