logoalt Hacker News

daveguytoday at 1:32 PM2 repliesview on HN

Better options would have been "True", "False", "Unknown" (which opinions would fall under too). That also includes an interesting assessment of how well LLMs can identify missing information. My guess is they would be a very low number of "unknown" and a much higher level of agreement (assuming equal representation). Unless the RLHF techniques have gotten better at getting an LLM to say "I don't know", which I doubt. Saying "I don't know" is not good for a dopamine release to keep users coming back for more.


Replies

kostajtoday at 1:36 PM

Tried initially with a fifth bucket, Abstain. It was actually heavily used by some of the models. But it felt as if they are using this to "avoid" some of the hard questions, and we dropped this bucket to force them to provide a verdict.

show 8 replies
skybriantoday at 3:00 PM

I wouldn’t expect opinions to go into “unknown.” Maybe have an “it’s complicated” bucket.