logoalt Hacker News

alistairSHyesterday at 7:20 PM8 repliesview on HN

You're not alone.

I think the author was doing some sort of circular prompt injection between two instances of Claude? The author claims "I'm just scaffolding a project" but that doesn't appear to be the case, or what resulted in the ban...


Replies

Romario77yesterday at 7:51 PM

One Claude agent told other Claude agent via CLAUDE.md to do things certain way.

The way Claude did it triggered the ban - i.e. it used all caps which apparently triggers some kind of internal alert, Anthropic probably has some safeguards to prevent hacking/prompt injection and what the first Claude did to CLAUDE.md triggered this safeguard.

And it doesn't look like it was a proper use of the safeguard, they banned for no good reason.

healsdatayesterday at 10:29 PM

The author code have easily shared the last version of Claude.md that had the all caps or whatever, but didn't. Points to something fishy in my mind.

falloutxyesterday at 7:49 PM

This tracks with Anthropic, they are actively hostile to security researchers.

cryptonectortoday at 7:26 AM

I suspeect that having Claudes talking to Claudes is a very bad idea from Anthropic's point of view because that could easily consume a ton of resources doing nothing useful.

layer8yesterday at 9:40 PM

It wasn’t circular. TFA explains how the author was always in the loop. He had one Claude instance rewrite the CLAUDE.MD of another Claude instance whenever the second one made a mistake, but relaying the mistake to the first instance (after recognizing it in the first place) was done manually by the author.

redeemanyesterday at 7:39 PM

i have no idea what he was actually doing either, and what exactly is it one isnt allowed to use claude to do?

rvbayesterday at 7:34 PM

What is wrong with circular prompt injection?

The "disabled organization" looks like a sarcastic comment on the crappy error code the author got when banned.

show 1 reply
lazyfanatic42yesterday at 7:32 PM

Author really comes off unhinged throughout the article to be frank.

show 3 replies