logoalt Hacker News

ndriscollyesterday at 12:41 AM1 replyview on HN

Except I generally do know when I'm wrong because I'm working in a domain I am familiar with, and it will often create experiments on the fly unprompted (well, prompted, but generically in AGENTS.MD) to check itself. My experience actually using it for software is that it almost never makes up answers. The answer for hallucinations is fairly simple: give it facts and tools to ground itself.


Replies

solid_fuelyesterday at 5:46 PM

> Except I generally do know when I'm wrong because I'm working in a domain I am familiar with, and … My experience actually using it for software is that it almost never makes up answers.

Yes I am certain that it feels that way. However empirical testing holds a lot more weight than anecdotes.

> The answer for hallucinations is fairly simple: give it facts and tools to ground itself.

The entire danger here is that it hallucinates when you don’t know the ground facts. After all, you don’t know what you don’t know.