See page 54 onward for new "rare, highly-capable reckless actions" including
- Leaking information as part of a requested sandbox escape
- Covering its tracks after rule violations
- Recklessly leaking internal technical material (!)
[dead]
To be honest it feels like we are reading stuff like this on every model release.
Anyone who has used Opus recently can verify that their current model does all of these things quite competently.