The Q4 quantization requires about 600GB of RAM without context, not exactly consumer hardware frien...

cbg0 • today at 4:57 PM • 0 replies • view on HN

The Q4 quantization requires about 600GB of RAM without context, not exactly consumer hardware friendly.

alt Hacker News