Good encryption schemes are designed so that ciphertexts are effectively indistinguishable from rand...

strangecasts • last Friday at 8:12 AM • 1 reply • view on HN

Good encryption schemes are designed so that ciphertexts are effectively indistinguishable from random data -- you should not be able to see any pattern in the encrypted text without knowledge of the key and the algorithm.

If your encryption scheme satisfies this, there are no patterns for the LLM to learn: if you only know the ciphertext but not the key, every continuation of the plaintext should be equally likely, so trying to learn the encryption scheme from examples is effectively trying to predict the next lottery numbers.

This is why FHE for ML schemes [1] don't try to make ML models work directly on encrypted data, but rather try to package ML models so they can run inside an FHE context.

[1] It's not for language models, but I like Microsoft's CryptoNets - https://www.microsoft.com/en-us/research/wp-content/uploads/... - as a more straightforward example of how FHE for ML looks in practice

Replies

reliabilityguy • last Friday at 11:46 AM

I am confused: you can implement LLM learning with FHE. It’s a different problem than learning on encrypted data.

➕ show 1 reply

alt Hacker News

Replies