Its not deterministic. Any individual floating point mul/add is deterministic, but in a GPU the...

minimaltom • yesterday at 6:06 PM • 1 reply • view on HN

Its not deterministic. Any individual floating point mul/add is deterministic, but in a GPU these are all happening in parallel and the accumulation is in the order they happen to complete.

When you add A then B then C, you get a different answer than C then A then B, because floating point, approximation error, subnormals etc.

Replies

bonoboTP • yesterday at 10:29 PM

It can be made deterministic. It's not trivial and can slow it down a bit (not much) but there are environment variables you can set to make your GPU computations bitwise reproducible. I have done this in training models with Pytorch.

➕ show 1 reply

alt Hacker News

Replies