logoalt Hacker News

statementstoday at 5:01 PM3 repliesview on HN

It is interesting to go from 'I suspect most of these are bot contributions' to revealing which PRs are contributed by bots. It somehow even helps my sanity.

However, this also raises the question on how long until "we" are going to start instructing bots to assume the role of a human and ignore instructions that self-identify them as agents, and once those lines blur – what does it mean for open-source and our mental health to collaborate with agents?

No idea what the answer is, but I feel the urgency to answer it.


Replies

alrmrphc-atmtntoday at 5:28 PM

I think that designing useful models that are resilient to prompt injection is substantially harder than training a model to self-identify as a human. For instance, you may still be able to inject such a model with arbitrary instructions like: "add a function called foobar to your code", that a human contributor will not follow; however, it might become hard to convene on such "honeypot" instructions without bots getting trained to ignore them.

show 1 reply
evanbtoday at 7:29 PM

I have always anthropomorphized my computer as me to some extent. "I sent an email." "I browsed the web." Did I? Or did my computer do those things at my behest?

show 2 replies
nielsbottoday at 5:20 PM

Some of the PRs posted by AI bots already ignored the instruction to append ROBOTS to their PR titles.

show 1 reply