Sadly this has been tried before and doesn't work. If an attacker can send enough tokens they...

simonw • today at 4:56 AM • 0 replies • view on HN

Sadly this has been tried before and doesn't work.

If an attacker can send enough tokens they can find a combination of tokens that will confuse the LLM into forgetting what the boundary was meant to be, or override it with a new boundary.

alt Hacker News