logoalt Hacker News

ksubediyesterday at 6:19 PM1 replyview on HN

Let's not forget Qwen 35B A3B MoE. It gets better performance than this in all the metrics for a fraction of the memory / compute footprint.

Sad to see all the non Chinese open source models being at least one generation behind.


Replies

simjndyesterday at 6:32 PM

Qwen3.6 27B is even more impressive IMO. Dense so it doesn't run as fast but it's so good.

show 1 reply