I think the idea is fine, but what might end up happening is that one agent gets unhinged and...

NitpickLawyer • yesterday at 7:46 PM • 1 reply • view on HN

I think the idea is fine, but what might end up happening is that one agent gets unhinged and "asks" another agent to do more and more crazy stuff, and they get in a loop where everything gets flagged. Remember that "bots configured to add a book at +0.01$ on amazon, reached 1M$ for the book" a while ago. Kinda like that, but with prompts.

Replies

epolanski • yesterday at 7:49 PM

I still don't get it, get your models better for this far fetched case, don't ban users for a legitimate use case.

alt Hacker News

Replies