Not really, Qwen 27b offloads to a decent gaming GPU (RTX 4090 in my case) without needing tons of R...

eek2121 • today at 4:47 PM • 1 reply • view on HN

Not really, Qwen 27b offloads to a decent gaming GPU (RTX 4090 in my case) without needing tons of RAM.

mathisfun123 • today at 4:54 PM

can you give more info? llama.cpp vs vllm? config? i wanna try specifically this model

alt Hacker News