logoalt Hacker News

Ukvtoday at 1:27 PM1 replyview on HN

> Then the correct answer is “I can’t tell.”

From the paper they're using structured JSON schema mode opposed to freeform answers, so it can't. Models do typically caveat their answer for questions like this, in my experience.


Replies

professoretctoday at 1:47 PM

They'll qualify their answers in English but as the article mentions, if your prompt asks for a confidence score, that "uncertainty" doesn't translate into low numerical confidence.

show 1 reply