logoalt Hacker News

furyofantaresyesterday at 11:49 PM1 replyview on HN

I still run into plenty of situations where the LLM-agent wrote the code really inexpensively, but is totally unable to debug it, and you can sink tons of time trying to get it to do so before giving up with nothing to show for it, and trying to figure it out yourself.


Replies

jannyfertoday at 12:01 AM

What kind of code do you work on, and what model & harness do you use? Genuinely curious so I can calibrate my understanding.

I work on enterprise web apps for a few dozen people with Codex CLI and GPT-5.4, and haven't really run in to those issues.

show 1 reply