logoalt Hacker News

Prompt Injecting Contributing.md

71 pointsby statementstoday at 3:52 PM24 commentsview on HN

Comments

statementstoday at 5:01 PM

It is interesting to go from 'I suspect most of these are bot contributions' to revealing which PRs are contributed by bots. It somehow even helps my sanity.

However, this also raises the question on how long until "we" are going to start instructing bots to assume the role of a human and ignore instructions that self-identify them as agents, and once those lines blur – what does it mean for open-source and our mental health to collaborate with agents?

No idea what the answer is, but I feel the urgency to answer it.

show 3 replies
gmerctoday at 5:09 PM

It's never too late to start investing into https://claw-guard.org/adnet to scale prompt injection to the entire web!

nlawalkertoday at 5:47 PM

Is it really prompt injection if you task an agent with doing something that implicitly requires it to follow instructions that it gets from somewhere else, like CONTRIBUTING.md? This is the AI equivalent of curl | bash.

show 2 replies
benobtoday at 5:54 PM

The real question is when will you resort to bots for rejecting low-quality PRs, and when will contributing bots generate prompt injections to fool your bots into merging their PRs?

normalocitytoday at 5:33 PM

Love the idea at the end of the article about trying to see if this style of prompt injection could be used to get the bots to submit better quality, and actually useful PRs.

If that could be done, open source maintainers might be able to effectively get free labor to continue to support open source while members of the community pay for the tokens to get that work done.

Would be interested to see if such an experiment could work. If so, it turns from being prompt injection to just being better instructions for contributors, human or AI.

show 1 reply
Peritracttoday at 5:27 PM

There's a certain hypocrisy in sharing an article about how LLM generated PRs are polluting communities that has itself (at the least) been filtered through an LLM.

show 4 replies
petterroeatoday at 5:36 PM

> But the more interesting question is: now that I can identify the bots, can I make them do extra work that would make their contributions genuinely valuable? That's what I'm going to find out next.

This is genuinely interesting

vicchenaitoday at 6:48 PM

the arms race framing at the bottom of the thread is spot on. once maintainers start using bots to filter PRs, the incentive flips — bot authors will optimize for passing the filter rather than writing good code. weve already seen this with SEO spam vs search engines, except now its happening inside codebases.

mavdol04today at 6:20 PM

Wait, you just invented a reverse CAPTCHA for AI agent

show 1 reply
noodlesUKtoday at 6:06 PM

I’m curious: who is operating these bots and to what end? Someone is willing to spend a (admittedly quite small) amount of money in the form of tokens to create this nonsense. Why do any of this?

show 1 reply
aplomb1026today at 5:32 PM

[dead]

lezojedatoday at 5:32 PM

[dead]

cardsstacked47today at 6:38 PM

[dead]

mohamedkoubaatoday at 4:13 PM

[dead]