Better options would have been "True", "False", "Unknown" (which opini...

daveguy • today at 1:32 PM • 2 replies • view on HN

Better options would have been "True", "False", "Unknown" (which opinions would fall under too). That also includes an interesting assessment of how well LLMs can identify missing information. My guess is they would be a very low number of "unknown" and a much higher level of agreement (assuming equal representation). Unless the RLHF techniques have gotten better at getting an LLM to say "I don't know", which I doubt. Saying "I don't know" is not good for a dopamine release to keep users coming back for more.

Replies

kostaj • today at 1:36 PM

Tried initially with a fifth bucket, Abstain. It was actually heavily used by some of the models. But it felt as if they are using this to "avoid" some of the hard questions, and we dropped this bucket to force them to provide a verdict.

➕ show 8 replies

skybrian • today at 3:00 PM

I wouldn’t expect opinions to go into “unknown.” Maybe have an “it’s complicated” bucket.

alt Hacker News

Replies