logoalt Hacker News

paulryanrogersyesterday at 12:41 PM1 replyview on HN

What would this look like?


Replies

WithinReasonyesterday at 12:47 PM

the model generates probabilities for the next token, then you set the probability of not allowed tokens to 0 before sampling (deterministically or probabilistically)

show 1 reply