logoalt Hacker News

andyferristoday at 9:34 AM0 repliesview on HN

Actually - do they do this in LLM benchmarks? As a measure of overconfidence/confabulation? Seems immediately applicable.