logoalt Hacker News

nomeltoday at 12:25 AM2 repliesview on HN

I'm of the opinion that removing guardrails is how you force regulation. What's your opinion on the balance?


Replies

dannywtoday at 1:13 AM

They have all transcripts for at least 30 days. The problem is that (as anyone who used Fable can attest) their classifiers are extremely sensitive and catch tons of innocent queries.

Imagine being a data scientist or MLE training a small classifier model. How do you know you won’t get steering vectors or a PEFT applied?

show 1 reply
mips_avatartoday at 2:41 AM

They’re not safety guardrails they’re anthropic doesn’t like anyone who isn’t anthropic working on AI rails