With yarn and rope scaling arguments for llama.cpp you could run qwen3.6-27B with 1M context… if you...

0xc133 • today at 4:37 PM • 0 replies • view on HN

With yarn and rope scaling arguments for llama.cpp you could run qwen3.6-27B with 1M context… if you have enough memory to store it.

alt Hacker News