logoalt Hacker News

dannywtoday at 1:13 AM1 replyview on HN

They have all transcripts for at least 30 days. The problem is that (as anyone who used Fable can attest) their classifiers are extremely sensitive and catch tons of innocent queries.

Imagine being a data scientist or MLE training a small classifier model. How do you know you won’t get steering vectors or a PEFT applied?


Replies

nomeltoday at 2:32 AM

Since your answer isn't direct, I'm having a little trouble interpreting it.

Are you saying they should relax guardrails since they have 30 days to know if you produced something bad? If that is what you're saying, then I suspect they chose their current path to prevent, since you can't un-produce. Producing is what would cause regulations/PR problems.

show 1 reply