logoalt Hacker News

mgrundtoday at 6:55 PM0 repliesview on HN

I was under the impression that swe-bench (and I guess most other benchmarks) were supposed to be run offline?

I get that you may accidentally include something in local git history, but it feels off to me to run these kinds of benchmarks online.