logoalt Hacker News

pulse7yesterday at 10:10 PM0 repliesview on HN

Can you please share you llama.cpp server parameters to turn on modern LLM sampling stack?

Docs [1] say that the top_n_sigma is already in the default sampler list: "(default: penalties;dry;top_n_sigma;top_k;typ_p;top_p;min_p;xtc;temperature)"

[1] https://github.com/ggml-org/llama.cpp/blob/master/tools/serv...