logoalt Hacker News

nijaveyesterday at 11:49 PM1 replyview on HN

I gave Sonnet 4.6 root access to my Android via adb and it wrote frida scripts to help me recover the encryption keys from SwiftBackup

Also gave Opus 4.6 access to a Kubernetes container and it was able to use pyrasite (a Python replacement that attached to a running process with gdb) to debug a "memory leak" in Python

I don't think I'd let them run unattended on anything I care about especially if there weren't backups, but they've never tried to break anything while supervised.

Usually it's significantly faster and more accurate to give the LLM/harness access to the thing to debug then to try to copy/paste back and forth.


Replies

andaitoday at 12:11 AM

It's been a while but last year I'd see posts like "Claude nuked my homedir / entire drive" on a regular basis. I don't know if they fixed that (or just made it very rare).

show 1 reply