I've heard that it was possible to trigger really obvious output poisoning on Fable with someth...

bakugo • today at 4:28 PM • 0 replies • view on HN

I've heard that it was possible to trigger really obvious output poisoning on Fable with something as basic as asking the model to think outside of its built-in hidden thinking delimiters.

This watermark may trigger a similar mechanism.

alt Hacker News