logoalt Hacker News

kuerbeltoday at 12:55 PM1 replyview on HN

Might be but I just can't imagine a customer being fine with a loose cannon agent in their environment. E.g. coding agents are ignoring instructions. Who is to say that Claudes solution to a, say, slow backup isn't deleting the backup?


Replies

foobar10000today at 1:10 PM

Imagine an agent shadowing all your terminals, providing ideas and asking to run commands that will let it verify the hypotheses it comes up with, while at the same time doing research on vendor docs, etc...

Quite safe, and already a force multiplier - this would be a harness. Maybe have it be able to write to a shadow system with similar (ideally same) hardware to verify it's hypothesis on how the system works, etc...