logoalt Hacker News

ryandrakelast Wednesday at 6:26 PM1 replyview on HN

Yea, for obvious reasons, it seems to be best at code that transforms data: text/binary input to text/binary output. And where the logic can be tracked and verified at runtime with sufficient (text) logging. In other words, it's much better close loop than open loop. I tried to help it by prompting it to please take a screen capture of its output to verify functionality, but it seems LLMs aren't quite ready for that yet.


Replies

mattarmlast Thursday at 2:42 PM

They work much better off a test that must pass. That they can “see”. Without it they are just making up some other acceptance criteria.