logoalt Hacker News

WithinReasonyesterday at 12:47 PM1 replyview on HN

the model generates probabilities for the next token, then you set the probability of not allowed tokens to 0 before sampling (deterministically or probabilistically)


Replies

PunchyHamsteryesterday at 2:42 PM

but filtering a particular token doesn't fix it even slightly, because it's a language model and it will understand word synonyms or references.

show 1 reply