I don’t think there is a solution. It’s the way LLMs work at a fundamental level. It’s a similar r...

kahnclusions • today at 10:57 AM • 1 reply • view on HN

I don’t think there is a solution. It’s the way LLMs work at a fundamental level.

It’s a similar reason why they can never be trusted to handle user input.

They are probabilistic generators and have no real delineation between system instructions and user input.

It’s like I wrote a JavaScript function where I concatenated the function parameters together with the function body, passed it to eval() and said YOLO.

Replies

viraptor • today at 11:18 AM

> I don’t think there is a solution.

Sandboxing. LLM shouldn't be able to run actions affecting anything outside of your project. And ideally the results should autocommit outside of that directory. Then you can yolo as much as you want.

➕ show 3 replies

alt Hacker News

Replies