Tuned Qwen 3.5 27B beats Step 3.5 on almost all benchmarks, so the point about the size class is moo...

lostmsu • yesterday at 8:59 PM • 1 reply • view on HN

Tuned Qwen 3.5 27B beats Step 3.5 on almost all benchmarks, so the point about the size class is moot.

Replies

tempaccount420 • yesterday at 9:21 PM

Benchmarks are not interesting in deciding the "size class". Bigger size means more knowledge. Also, the Qwen 3.5 27B is a dense 27B active parameter model. StepFun 3.5 Flash has 11B active parameters.

➕ show 1 reply

alt Hacker News

Replies