logoalt Hacker News

tipsytoadtoday at 8:41 AM4 repliesview on HN

1.0 is actually pretty arbitrary and way too high as a general rule. Something like 0.3 is a more sensible default


Replies

317070today at 10:06 AM

If RL was used to train the model, the model will have been trained on its own sequences. Those will have been generated with a temperature of 1.0. They must be, otherwise you would get a premature collapse or explosion of your entropy if the temperature was respectively lower or higher.

After that RL step, you want to stick to the RL distribution, and so keep a temperature of 1.0. Other temperatures will drive the model out-of-distribution.

That is why the sampling step for agents or thinking LLMs are usually kept at a temperature of 1.0.

zipy124today at 9:14 AM

It really depends on the application does it not? I'm not an LLM guy, but for creative tasks like storytelling wouldn't you want a higher temperature usually? Happy to gain insight from anyone with experience here :)

embedding-shapetoday at 8:51 AM

Heavily depends on the model architecture and the implementation though, I don't think you can say what values are better than others without first specifying those, otherwise it's straight up guessing, ironically.

nullctoday at 10:20 AM

If you use a model in a configuration far from where it was RLed you get no warranty. (you also get no warranty the other way, however)