logoalt Hacker News

c-hendricksyesterday at 6:28 PM1 replyview on HN

Not sure you really need huggingface-cli to download anything if you're just using llama.cpp. You can pass `-hf ...` and it will download the models for you. Set `LLAMA_CACHE` to change where the downloads go:

  LLAMA_CACHE="models" ./llama-server \
    -hf unsloth/gemma-4-31B-it-GGUF:UD-Q4_K_XL \
    ...

Replies

dofmyesterday at 6:32 PM

Yes.

-hfd for the draft model.

show 1 reply