logoalt Hacker News

It blocked us at 'hello ' Anthropic Fable 5 refusing innocuous prompts

17 pointsby abliterationaitoday at 4:54 AM2 commentsview on HN

Comments

bob1029today at 7:18 AM

> users may experience more false positives as we refine these classifiers to respond to new threats. We are working to reduce these as fast as possible.

Getting a really strong capacity issue vibe here. Reframing it as a safety issue could burn a lot of trust if this turns out to be another lie. I hope they've done their math on this one.

afterfiveguytoday at 5:20 AM

How dare you say "hell"o ?