logoalt Hacker News

chmod775yesterday at 3:24 PM0 repliesview on HN

Not quite then as well, since a lot is typically executed in parallel and the implementation details of most number representations make them sensitive to the order of operations.

Given how much number crunching is at the heart of LLMs, these small differences add up.