logoalt Hacker News

sumenoyesterday at 9:00 PM1 replyview on HN

> In fact, most of the time models fail to demonstrate introspection—they’re either unaware of their internal states or unable to report on them coherently.

You got the wrong takeaway from your link.


Replies

XenophileJKOyesterday at 9:14 PM

The parent said: "I argue that the model has no access to its thoughts at the time."

This is falsified by that study, showing that on the frontier models generalized introspection does exist. It isn't consistent, but is is provable.

"no access" vs. "limited access"

show 2 replies