logoalt Hacker News

Seviiyesterday at 6:25 PM1 replyview on HN

I haven't been bothered by hallucinations in premier models since early last year. Still see it in smaller local models though.


Replies

aliljetyesterday at 6:29 PM

I'm really running into this deep at the edges of content creation. Take, for example, a need to general some kind of legal work. The cost of painstakingly checking and rechecking each case cited is reducing the value of these frontier models immensely.

Coding, however, is solved like magic. Easier to add tests, to be fair.