> it appears to me to be really hard to guard against I don't want to sound glib, but one ...

throwmeaway820 • last Thursday at 7:02 PM • 3 replies • view on HN

> it appears to me to be really hard to guard against

I don't want to sound glib, but one could simply not let an LLM execute arbitrary code without reviewing it first, or only let it execute code inside an isolated environment designed to run untrusted code

the idea of letting an LLM execute code it's dreamt up, with no oversight, in an environment you care about, is absolutely bananas to me

Replies

blibble • last Thursday at 7:44 PM

> the idea of letting an LLM execute code it's dreamt up, with no oversight, in an environment you care about, is absolutely bananas to me

but if a skilled human has to check everything it does then "AI" becomes worthless

hence... YOLO

➕ show 3 replies

alexjplant • last Thursday at 7:55 PM

I think a nice compromise would be to restrict agentic coding workflows to cloud containers and a web interface. Bootstrap a project and new functional foundations locally using traditional autocomplete/chat methods (which you want to anyway to avoid a foundation of StackOverflow-derived slop) then implement additional features using the cloud agents. Don't commit any secrets to SCM and curate the tools that these agents can use. This way your dev laptops are firmly in human control (with IDEs freed up for actual coding) while LLMs are safelt leveraged. Win-win.

sigmonsays • last Thursday at 7:04 PM

just wait until the exploit is so heavily obfuscated that you just review and allow it to get the project done.

➕ show 1 reply

alt Hacker News

Replies