logoalt Hacker News

gunalxyesterday at 12:26 PM2 repliesview on HN

I habe come across turning on caching means the llm has a faint memory of what was in the cache, even to unrelated queries. If this is the case its fully unreasonable to share the cache, because of possibility of information leakage.


Replies

weird-eye-issueyesterday at 1:27 PM

This is absolutely 100% incorrect.

samwhoyesterday at 12:31 PM

How would information leak, though? There’s no difference in the probability distribution the model outputs when caching vs not caching.

show 1 reply