But then someone would figure out some prompts that don't trigger this, and Anthropic wouldn't be able to try and disadvantage competitors.
Except they openly reject many many other classes of prompts, including extremely high stakes CBRN.
It's only the direction that has direct potential business impact they've decided to sabotage instead of reject.
Except they openly reject many many other classes of prompts, including extremely high stakes CBRN.
It's only the direction that has direct potential business impact they've decided to sabotage instead of reject.