Qwen's ~30B-class models are genuinely good enough for use if you can find a machine with enoug...

hadlock • yesterday at 11:15 PM • 1 reply • view on HN

Qwen's ~30B-class models are genuinely good enough for use if you can find a machine with enough memory bandwidth to run them at 30-90 tokens/second. It's been extremely telling that Qwen stopped releasing 120b class models. At some point in the next 10 years (maybe 3?) someone is going to release an Opus 4.5 class 256B model you can run locally. Right now our engineers use about $800/mo worth of opus tokens; at that rate the ROI for local LLM is ~10 months

Replies

strictnein • today at 1:03 AM

Didn't Qwen stop releasing their more powerful models because they're commercializing them?

alt Hacker News

Replies