logoalt Hacker News

mrobtoday at 6:14 PM0 repliesview on HN

You could duplicate every token and reserve the duplicates exclusively for the chain-of-thought, which could be robustly filtered from user input. Basically adding a "thought" bit to each token.