> No one is bitter lesson pilled anymore.
Will the 10T parameter Mythos model be released this month or next month?
They better soon because it is generally accepted that one of the reasons GPT 5.5 is better at hard tasks than Opus is because of its parameter size - and that Opus 4.8 remains competitive only be scaling test-time compute (see how many more tokens it uses than GPT 5.5)
https://www.reddit.com/r/LLM/comments/1sz8bjz/parameter_esti...
Why ask me? Anyway, Mythos is not 10T. Anthropic confirmed the training run was under 10^26 flops. You can't train 10T to chincilla and stay under 10^26.
Anthropic also confirmed they will not release Mythos, only a "Mythos-class" model, whatever that means.