logoalt Hacker News

algoth1yesterday at 10:51 PM8 repliesview on HN

It’s unusable for me due to the refusals. I’m using claude to find patterns in health data


Replies

yakzyesterday at 11:56 PM

I do some work in laboratory automation and it was quick to refuse the first thing I asked it to do. There wasn't anything spicy in the request, just basic liquid-handling protocol implementation. Their position seems to be that they're too stupid to classify requests safely, and that seems reasonable to me. I'd guess the classifier will improve rapidly.

show 1 reply
dmdyesterday at 11:28 PM

Same. I'm working on a set of python and matlab scripts that deals with segmenting MRI images into brain vs skull, and it thinks that's bioterrorism.

show 1 reply
rvnxtoday at 1:07 AM

Quite counterproductive to refuse to help on health issues too. If they detect health data, they can add a disclaimer, but not hide the information.

show 1 reply
girafffe_itoday at 4:33 AM

There’s no way around it? Can’t you obfuscate as generic data and use keys to map to the real data?

show 1 reply
fragmedetoday at 2:52 AM

What custom prompt do you have set up? If you tell it you're occupation, does it turn helpful? There was a study that if you tell models they tested that you're a patient, it would refuse, but tell it you're a doctor and suddenly it turns helpful.

show 1 reply
5d41402abc4btoday at 4:32 AM

what prompts do you use for this?

UltraSaneyesterday at 11:49 PM

Anthropic knows it refuses too much, they want to be very cautious to avoid any scandals. I think this is why they want to store all Fable and Mythos chats for 30 days so they can use the data to improve.

show 2 replies
garciasnyesterday at 10:54 PM

I wonder if it sees Healthcare companies being targeted and that's why it's freaking out; clearly they have some pretty stupid regexes in the harness to detect this sort of shit.

e: I quit the session and went back in. Set it to Fable and told it to continue the last session. It's moving along as if none of that had happened.

How weird.

show 1 reply