I'm surprised to see people getting value from "web sandbox"-type setups, where you d...

scubbo • last Sunday at 9:57 PM • 5 replies • view on HN

I'm surprised to see people getting value from "web sandbox"-type setups, where you don't actually have access to the source code. Are folks really _that_ confident in LLMs as to entirely give up the ability to inspect the source code, or to interact with a running local instance of the service? Certainly that would be the ideal, but I'm surprised that confidence is currently running that high.

Replies

simonw • last Sunday at 10:16 PM

I still get the full source code back at the end, I tell it to include code it wrote in the PR.

I also wrote my own tool to extract and format the complete transcript, it gives me back things like this where I can see everything it did including files and scripts it didn't commit. Here's an example: https://gistpreview.github.io/?3a76a868095c989d159c226b7622b...

➕ show 1 reply

theptip • last Monday at 8:21 PM

Right - I’m missing how you get the source code in the OP. It says you tmux in with ssh agent forwarding for GH. But you can’t do that on your iOS device? So you have to set up all your repos in the morning before leaving the house, then collect and push all your branches when you return home?

I could imagine this working for a small number of branches/changes.

smarx007 • last Sunday at 10:15 PM

The output from Jules is a PR. And then it's a toss-up between "spot on, let's merge" and "nah, needs more work, I will check out the branch and fix it properly when I am the keyboard". And you see the current diff on the webpage while the agent is working.

nl • last Sunday at 11:46 PM

Claude Code on the web, ChatGPT Codex and Google Jules are not the same as Claude, ChatGPT and Gemini. They are entire apps where you authorize Github access and they work via PRs.

They'll include screenshots on your PRs etc.

I like using them a lot when I can.

➕ show 1 reply

suninsight • last Monday at 4:43 PM

[dead]

alt Hacker News

Replies