> high as the second best general purpose model According to benchmarks which are gamed to the ...

wasfgwp • yesterday at 5:16 PM • 1 reply • view on HN

> high as the second best general purpose model

According to benchmarks which are gamed to the extreme these days. Trusting them blindly isn’t exactly rational either. They don’t necessarily translate that well to real world tasks

It’s obviously not “distilling” as such but there are reasons why Chinnese models are consistently several months behind OpenAI/Antropic

Replies

2ndorderthought • yesterday at 5:38 PM

[dead]

alt Hacker News

Replies