logoalt Hacker News

hmmmmmmmmmmmmmmtoday at 2:02 PM3 repliesview on HN

Yeah I wouldn't get too excited. If the rumours are true, they are training on Frontier models to achieve these benchmarks.


Replies

jimmydoetoday at 3:25 PM

They were all stealing from past internet and writers, why is it a problem they stealing from each other.

YetAnotherNicktoday at 2:16 PM

Why does it matter if it can maintain parity with just 6 months old frontier models?

show 1 reply
loudmaxtoday at 2:27 PM

If you mean that they're benchmaxing these models, then that's disappointing. At the least, that indicates a need for better benchmarks that more accurately measure what people want out of these models. Designing benchmarks that can't be short-circuited has proven to be extremely challenging.

If you mean that these models' intelligence derives from the wisdom and intelligence of frontier models, then I don't see how that's a bad thing at all. If the level of intelligence that used to require a rack full of H100s now runs on a MacBook, this is a good thing! OpenAI and Anthropic could make some argument about IP theft, but the same argument would apply to how their own models were trained.

Running the equivalent of Sonnet 4.5 on your desktop is something to be very excited about.

show 1 reply