Thinking vs non-thinking. There'll be a token cost there. But still fairly remarkable!

regularfry • today at 5:13 PM • 1 reply • view on HN

Is there a reason we can't use thinking completions to train non-thinking? i.e. gradient descent towards what thinking would have answered?

➕ show 1 reply

alt Hacker News