>We plan to launch new safeguards with an upcoming Claude Opus model, allowing us to improve and ...

Miraste • today at 6:39 PM • 3 replies • view on HN

>We plan to launch new safeguards with an upcoming Claude Opus model, allowing us to improve and refine them with a model that does not pose the same level of risk as Mythos Preview2.

This seems like the real news. Are they saying they're going to release an intentionally degraded model as the next Opus? Big opportunity for the other labs, if that's true.

Replies

SheinhardtWigCo • today at 8:41 PM

The other labs already censor their models. Everyone is trying to find the sweet spot where performance and ‘alignment’ are both maximized. This seems no different

wslh • today at 8:31 PM

> Big opportunity for the other labs, if that's true.

It sounds like this is considered military grade technology as cryptography in the 90s. The big difference is it's very expensive to create, and run those models. It's not about the algorithm. If the story rhymes it could be a big opportunity to other regions in the world.

zb3 • today at 7:55 PM

Well since Anthropic treats us as second class evil citizens, I guess they don't want our evil money either.

alt Hacker News

Replies