logoalt Hacker News

catlifeonmarsyesterday at 7:07 PM1 replyview on HN

I am curious, does this mean that you can escape the chat template “early” by providing an end token in the user input, or is there also an escape mechanism (or token filtering mechanism) applied to user input to avoid this sort of injection attack?


Replies

reactordevyesterday at 7:45 PM

Neither, it’s just not providing the base chat template that the model expects between the im tags. This isn’t a hack and it’s not particularly useful information. Abliteration is what he really wanted

show 1 reply