logoalt Hacker News

zyklu5yesterday at 7:41 PM1 replyview on HN

Their explanation for why their idea (SSD) might work - precision-exploration conflict hypothesis - is something adaptive decoding also tries to solve.

https://ai.meta.com/research/publications/adaptive-decoding-...


Replies

andy_xor_andrewyesterday at 10:05 PM

I've been wondering about adaptive decoding! It seems obvious to me that at some points during decoding (reasoning, "creative thinking") you would want a higher temperature, while at other points (emitting syntactically correct code, following a plan that was already established) you would want lower temperature.