You don't know what you're talking about: an enormous amount of TOPs now runs through quan...

mathisfun123 • yesterday at 7:45 PM • 1 reply • view on HN

You don't know what you're talking about: an enormous amount of TOPs now runs through quantized (read: integer) kernels. Many GPUs don't have even FP64 or even FP32 support.

Replies

jmalicki • yesterday at 8:07 PM

EDIT: I was completely wrong, I have mostly worked with GGUF and related quantizations that are LUTs, thank you for correcting me.

➕ show 1 reply

alt Hacker News

Replies