I found that, with the heavily quantized Qwen3 models I can cram onto my 3060 Ti, telling the model ...

ryukoposting • today at 5:58 PM • 1 reply • view on HN

I found that, with the heavily quantized Qwen3 models I can cram onto my 3060 Ti, telling the model to use its tools in the system prompt made it a lot more likely to actually do it. YMMV of course, but give it a shot.

Replies

saghm • today at 9:11 PM

I did try this, and it was pretty hit-or-miss still. I even went as far as configuring context for Zed to inject into all conversations saying stuff like "If you need to read a file, call read_file NOW. Do not say you will read it", and it still didn't really make a huge difference.

alt Hacker News

Replies