This is not the case for LLMs. FP16/BF16 training precision is standard, with FP8 inference ver...

qeternity • today at 8:12 PM • 0 replies • view on HN

This is not the case for LLMs. FP16/BF16 training precision is standard, with FP8 inference very common. But labs are moving to FP8 training and even FP4.

alt Hacker News