It is text that describes a plausible/likely thought process that conditions future generation ...

Bjartr • today at 3:14 AM • 1 reply • view on HN

It is text that describes a plausible/likely thought process that conditions future generation by it's presence in the context.

Replies

CamperBob2 • today at 4:21 AM

Interestingly, it doesn't always condition the final output. When playing with DeepSeek, for example, it's common to see the CoT arrive at a correct answer that the final answer doesn't reflect, and even vice versa, where a chain of faulty reasoning somehow yields the right final answer.

It almost seems that the purpose of the CoT tokens in a transformer network is to act as a computational substrate of sorts. The exact choice of tokens may not be as important as it looks, but it's important that they are present.

➕ show 2 replies

alt Hacker News

Replies