so how does llm moderation work now on all the major chatbots? they refuse prompts that are against their guidelines right?
Sometimes. That's the whole problem, in short.
Sometimes. That's the whole problem, in short.