but filtering a particular token doesn't fix it even slightly, because it's a language model and it will understand word synonyms or references.
I'm obviously talking about network output, not input.
I'm obviously talking about network output, not input.