Summaries by different smaller models are usually made by closed proprietary models like Claude as a...

kgeist • today at 9:27 AM • 0 replies • view on HN

Summaries by different smaller models are usually made by closed proprietary models like Claude as a way to combat the distillation of real reasoning traces by competitors. Open weight models show the real reasoning traces. Reasoning traces operate in the same space as the non-reasoning output. It's all just one large text for an LLM. Internally, reasoning is just ordinary chat completion between <think></think> tags.

alt Hacker News