Looping is a common problem with the Qwen models. I've had good luck using --repeat-penalty=1.1...

cpburns2009 • last Monday at 10:49 PM • 1 reply • view on HN

Looping is a common problem with the Qwen models. I've had good luck using --repeat-penalty=1.1 with llama.cpp and 27B. vLLM should have a similar option.

Replies

etdznots • yesterday at 3:31 PM

This is the default value!

➕ show 1 reply

alt Hacker News

Replies