That's a good ballpark for something quantized to 8 bits per parameter. But you can 2x/4x ...

petercooper • 07/31/2025 • 1 reply • view on HN

That's a good ballpark for something quantized to 8 bits per parameter. But you can 2x/4x that for 16 and 32 bit.

7734128 • 07/31/2025

I've never seen a 32 bit model. There's bound to be a few of them, but it's hardly a normal precision.

➕ show 1 reply

alt Hacker News