logoalt Hacker News

fookertoday at 3:05 AM1 replyview on HN

LLMs specifically are fine with random bits flipped for the results to be 'creative'.


Replies

jedbergtoday at 3:33 AM

That's not exactly how LLM temperature works. :). Also that's on inference, not training. Presumably these would be used for training, the latency would be too high for inference.

show 1 reply