Yes, I believe the reasoning is that they think safety research can best be done from the frontier.
If you believe it will be developed regardless and that that there's a 30% chance of doom, they want a company prioritising safety research to be the one threading that needle.
Building frontier models to do safety research on them is what Anthropic was all about in the early years. That included building the best model, but only releasing it once it became the second best. Precisely to avoid an AI arms race where everyone is forced to release better and better models, risks be damned
Something changed their mind, and since Opus 3 they are in the business of releasing the best models
Exactly. And within the AI safety discourse, your behavior hinges on what you think the default chance of doom is, and how optimistic you are about alignment work being able to limit it before we reach superintelligence.
People running the labs are in a middle camp where they are scared enough by AI to take the threat seriously, but much more optimistic about alignment than the people who seem to have thought about it the most.
> If you believe it will be developed regardless and that that there's a 30% chance of doom, they want a company prioritising safety research to be the one threading that needle.
They also want to be trillionaires. If they don't built it, no trillions. So they have to build it, now (and get their IPO done before the bubble pops).
It’s all ego. I, and only I, am the bringer of doom, slayer of worlds.
I am so smart that what I do will destroy humanity, or save it.
Fable 5 was great, but not that great.
Sorry to be crude, but both the government and anthropic are acting like a bunch of pussies.
Meow.
Yeah all they care about is safety, but lets see how many of them quits once US government command them to work on autonomous killbots.