logoalt Hacker News

xyzzy_plughlast Tuesday at 9:09 PM2 repliesview on HN

No, it might figure out the solution but even after many days there's no assurance that it won't get stuck making the same mistakes over and over again, never getting closer to a solution. I've seen this many times.


Replies

manmallast Tuesday at 10:51 PM

Getting in a loop does still happen, yes. If you run codex in tmux and let another agent just occasionally check on progress, it can be prevented. That’s not even expensive - checking every 30 minutes suffices. The watchdog agent can then press Esc in tmux and send a message, maybe do some research to get it unstuck etc

minimaxirlast Tuesday at 9:39 PM

Definitely have not seen that with Opus 4.5.

show 1 reply