Genuine question: why is everyone rolling out their own sandbox wrappers around VMs/Docker for ...

kjok • today at 4:05 PM • 5 replies • view on HN

Genuine question: why is everyone rolling out their own sandbox wrappers around VMs/Docker for agents?

Replies

I know, right? The day I initially thought about posting this, there was another one called `yolo-box`. (That attempt--my very first post--got me instantly shadow-banned due to being on a VPN, which led to an unexpected conversation with @dang, which led to some improvements, which led to it being a week later.)

I think it's the convergence of two things. First, the agents themselves make it easier to get exactly what you want; and second, the OEM solutions to these things really, really aren't good enough. CC Cloud and Codex are sort of like this, except they're opaque and locked down, and they work for you or they don't.

It reminds me a fair bit of 3D printer modding, but with higher stakes.

catlifeonmars • today at 4:19 PM

Because of findings like this

https://www.anthropic.com/research/small-samples-poison

(A small number of samples can poison LLMs of any size) to save clicks to read the headline

The way I think of it is, coding agents are power tools. They can be incredibly useful, but can also wreak a lot of havoc. Anthropic (et al) is marketing them to beginners and inevitably someone is going to lose their fingers.

➕ show 1 reply

derpsteb • today at 4:12 PM

My experience is that neither has a good UX for what I usually try to do with coding agents. The main problem I see is setup/teardown of the boxes and managing tools inside them.

m-hodges • today at 4:12 PM

It all feels like temporary workflow fixes until The Agent Companies just ship their opinionated good enough way to do it.

➕ show 2 replies

dist-epoch • today at 5:55 PM

Because people want to run agents in yolo mode without worrying that it's going to delete the whole computer.

And once you put the agent in a VM/container it's much easier to run 10 of them in parallel without mutual interference.

➕ show 1 reply

alt Hacker News

Replies