Yes, they're purposely not 'trained on' chain-of-thought to avoid making it useless f...

catigula • last Friday at 4:52 PM • 1 reply • view on HN

Yes, they're purposely not 'trained on' chain-of-thought to avoid making it useless for interpretability. As a result, some can find it epistemically shocking if you tell them you can see their chain-of-thought. More recent models are clever enough to know you can see their chain-of-thought implicitly without training.

Replies

DenisM • last Friday at 5:35 PM

It is in their training set by now.

alt Hacker News

Replies