logoalt Hacker News

talldayo01/21/20251 replyview on HN

R1 is a rehash of things we've already seen, and a particularly neutered one at that. Are there any better examples you can think of?


Replies

bugglebeetle01/21/2025

Uh, they invented multilatent attention and since the method for creating o1 was never published, they’re the only documented example of producing a model of comparable quality. They also demonstrated massive gains to the performance of smaller models through distillation of this model/these methods, so no, not really. I know this is the internet, but we should try to not just say things.