Haha. True, CI success was not part of PR accept criteria at any point. If you view the PRs, they ...

paulus_magnus2 • yesterday at 5:24 PM • 1 reply • view on HN

Haha. True, CI success was not part of PR accept criteria at any point.

If you view the PRs, they bundle multiple fixes together, at least according to the commit messages. The next hurdle will be to guardrail agents so that they only implement one task and don't cheat by modifying the CI piepeline

Replies

formerly_proven • yesterday at 5:31 PM

If I had a nickel for every time I've seen a human dev disable/xfail/remove a failing test "because it's wrong" and then proceeding to break production I would have several nickels, which is not much, but does suggest that deleting failing tests, like many behaviors, is not LLM-specific.

➕ show 3 replies

alt Hacker News

Replies