logoalt Hacker News

AndrewHamptonyesterday at 11:53 PM0 repliesview on HN

This seems like an important caveat to the SWE-bench, but the trend is still clearly AI becoming more and more capable.