Someone needs to train a model where untrusted input uses a completely different set of tokens so th...

LoganDark • today at 1:53 AM • 1 reply • view on HN

Someone needs to train a model where untrusted input uses a completely different set of tokens so that it's entirely impossible for the model to confuse them with instructions. I've never even seen that approach mentioned let alone implemented.

Replies

jorl17 • today at 2:38 AM

Perhaps this is in line with what you had in mind? https://patents.google.com/patent/US12118471

➕ show 1 reply

alt Hacker News

Replies