logoalt Hacker News

regularfrytoday at 5:13 PM1 replyview on HN

Thinking vs non-thinking. There'll be a token cost there. But still fairly remarkable!


Replies

DoctorOetkertoday at 5:42 PM

Is there a reason we can't use thinking completions to train non-thinking? i.e. gradient descent towards what thinking would have answered?

show 1 reply