> we believe [Mythos 5] is likely unable to fully and reliably automate R&D for frontier projects spanning multiple weeks
this is good news, right? right...?
Probably there will always be frontier surface which frontier model of a given generation would not be able to automate.
So in other words... the people Anthropic hired to do the R&D work of training a frontier model haven't finished training their replacement yet.
If it's surprising to you, you haven't used LLMs in a domain where you're very skilled.
It is certainly good news for those who are selling all these tokens.
lmao, i love how the goal post is now in the "multiple weeks" timeline
Depends whether "unable to fully automate" means "needs occasional human checkpoints" or "slowly stops caring about your actual goal." Pretty different.