Not an expert either, but my understanding is that large models use quantized weights and tensor inp...

10000truths • yesterday at 10:22 PM • 1 reply • view on HN

Not an expert either, but my understanding is that large models use quantized weights and tensor inputs for inference. Multiplication and addition of fixed-point values is associative, so unless there's an intermediate "convert to/from IEEE float" step (activation functions, maybe?), you can still build determinism into a performant model.

Replies

kimixa • yesterday at 10:35 PM

Fixed point arithmetic isn't truly associative unless they have infinite precision. The second you hit a limit or saturate/clamp a value the result very much depends on order of operations.

➕ show 1 reply

alt Hacker News

Replies