logoalt Hacker News

mhitzayesterday at 12:48 PM0 repliesview on HN

If you self-host an LLM you'll learn quickly that even batching, and caching can affect determinism. I've ran mostly self-hosted models with temp 0 and seen these deviations.