logoalt Hacker News

Bjartrtoday at 3:14 AM1 replyview on HN

It is text that describes a plausible/likely thought process that conditions future generation by it's presence in the context.


Replies

CamperBob2today at 4:21 AM

Interestingly, it doesn't always condition the final output. When playing with DeepSeek, for example, it's common to see the CoT arrive at a correct answer that the final answer doesn't reflect, and even vice versa, where a chain of faulty reasoning somehow yields the right final answer.

It almost seems that the purpose of the CoT tokens in a transformer network is to act as a computational substrate of sorts. The exact choice of tokens may not be as important as it looks, but it's important that they are present.

show 2 replies