Can someone tell me the mechanism by which the prompts are even recovered? Cosma Shalizi says that...

brcmthrowaway • yesterday at 11:25 PM • 1 reply • view on HN

Can someone tell me the mechanism by which the prompts are even recovered?

Cosma Shalizi says that this isn't possible. Are they in the training set? I doubt it.

There's a detailed description of how they were recovered here: https://www.lesswrong.com/posts/vpNG99GhbBoLov9og/claude-4-5...

➕ show 1 reply

alt Hacker News