Can't wait for gemma4-31b-it-claude-opus-4-6-distilled-q4-k-m on huggingface tomorrow

mudkipdev • today at 4:45 PM • 2 replies • view on HN

Replies

entropicdrifter • today at 5:36 PM

I'd rather see a distill on the 26B model that uses only 3.8B parameters at inference time. Seems like it will be wildly productive to use for locally-hosted stuff

indrora • today at 5:54 PM

gemma4-31b-it-claude-opus-4-6-distilled-abliterated-heretic-GGUF-q4-k-m

alt Hacker News

Replies