logoalt Hacker News

moffkalasttoday at 7:49 AM1 replyview on HN

If the current state is anything to go by, an automated test would not only flag your out of distribution results but try to gaslight everyone reading its output with additional false indicators to map you into an area that's in distribution. Statistical models cannot accept the existence of extremely rare edge cases.


Replies

ACCount37today at 8:27 AM

Modern LLMs routinely beat human doctors at diagnosing "extremely rare edge cases".

They have unmatched breadth of knowledge by default, and can maintain attention across entire medical histories.

show 2 replies