logoalt Hacker News

cpburns2009last Monday at 10:49 PM1 replyview on HN

Looping is a common problem with the Qwen models. I've had good luck using --repeat-penalty=1.1 with llama.cpp and 27B. vLLM should have a similar option.


Replies

etdznotsyesterday at 3:31 PM

This is the default value!

show 1 reply