logoalt Hacker News

leetrouttoday at 1:40 AM1 replyview on HN

Related check out chain of draft if you haven't.

Similar performance with 7% of tokens as chain of thought.

https://arxiv.org/abs/2502.18600


Replies

astrangetoday at 5:18 AM

That's a comparison to "CoT via prompting of chat models", not "CoT via training reasoning models with RLVR", so it may not apply.