logoalt Hacker News

walrus01today at 8:36 AM0 repliesview on HN

Give it a day or two and the 'unsloth' people will probably publish a Q6 and Q8 (maybe Q8XL?) quantization in GGUF format for llama-server and other users.