logoalt Hacker News

cbg0today at 4:57 PM0 repliesview on HN

The Q4 quantization requires about 600GB of RAM without context, not exactly consumer hardware friendly.