logoalt Hacker News

wiseowisetoday at 9:29 AM0 repliesview on HN

> You assert that it writes better code than the average software developer?

Absolutely. It contains a lot, if not majority, of all the code available at our hands right now and can reason, whatever it means for LLMs to reason anyway, about it. It absolutely demolishes average software developer and it’s not even close.

> To achieve code coverage it's enough to CALL the code, it doesn't tell you anything about the correctness of the tests: they could all end with a return true, and a code coverage tool would be perfectly happy.

> So, yes, if you don't carefully check the test suite that the agent writes, it might well be worthless (or simply much less useful than you assume it to be, more realistically).

That’s like saying that if you don’t check every line your coworker writes it becomes worthless.