logoalt Hacker News

aussieguy1234today at 12:28 AM0 repliesview on HN

If SWE-Bench Verified is no longer a good measure of agentic coding abilities, what benchmark now is?