logoalt Hacker News

lufenialif2today at 12:18 AM1 replyview on HN

Curious how you make something that has data exfiltration as a feature secure.


Replies

CuriouslyCtoday at 1:01 AM

Mitigate prompt injection to the best of your ability, implement a policy layer over all capabilities, and isolate capabilities within the system so if one part gets compromised you can quarantine the result safely. It's not much different than securing human systems really. If you want more details there are a lot of AI security articles, I like https://sibylline.dev/articles/2026-02-15-agentic-security/ as a simple primer.

show 1 reply