Wow, 30B parameters as capable as a 1T parameter model?

babelfish • today at 4:25 PM • 1 reply • view on HN

Replies

On the above compared benchmarks is closer to other larger open weights models, and on par with GPT-OSS 120B, for which I also have a frame of reference.

alt Hacker News

Replies