I got Claude to self reference and update its own instructions to solve making a typed proxy API of ...

dataviz1000 • yesterday at 7:34 PM • 8 replies • view on HN

I got Claude to self reference and update its own instructions to solve making a typed proxy API of any website. After a week, scores of iterations, it can reverse engineer any website. The first few days I had to be deeply involved with each iteration loop. Domain knowledge is helpful. Each time I saw a problem I would ask Claude to update its instructions so it doesn't happen again. Then less and less. Eventually it got to the point it was updating and improving the metrics every iteration unsupervised.

Edit: This is going to have huge ramifications for the tech security industry as these systems will be able to break security systems as easily it solved the proof. The sooner the good guys, if there are any left, understand this the better it will be for everybody.

> Super interesting but what does this mean for us mere mortals?

I would go for a 2 or 3 hour walk with my phone using the remote control feature looking every 5 - 10 minutes to make sure it doesn't need human help. I went to the coffeeshop and drank very good coffee listening to music. Then at night I sat and had a beer thinking about T.S. Eliot's 'The Wasteland', the effect of industrialization in England at that time and his views of how ennui affected the aristocracy.

Replies

DrewADesign • yesterday at 8:04 PM

> I went to the coffeeshop and drank very good coffee listening to music. Then at night I sat and had a beer thinking about T.S. Eliot's 'The Wasteland', the effect of industrialization in England at that time and his views of how ennui affected the aristocracy.

Well, for those among us that are not aristocracy already, except for the vanishingly small number of people required to oversee such processes, we’re probably the closest we’re going to get to it. If they don’t need people to do the tech labor, we’ve got way more people than we need, so that’s a huge oversupply of tech skills, which means tech skills are rapidly becoming worthless. Glad to see how fast we’re moving in our very own race to the bottom!

➕ show 3 replies

dunder_cat • yesterday at 9:13 PM

> Edit: This is going to have huge ramifications for the tech security industry as these systems will be able to break security systems as easily it solved the proof. The sooner the good guys, if there are any left, understand this the better it will be for everybody.

What can the good guys do? Fire up Claude to improve their systems? Unless you have it working fully autonomously to counter-act abuse, I don't see how you can beat the "bad guys". There may be some industries where this is a solved problem (e.g. you can do all the validation server-sided, religiously follow best practices to prevent and mitigate abuse), but a lot of stuff like multiplayer video games will be doomed unless they move to a "you must use a locked down system we control" model. I honestly don't consider it liberating as someone that has various hobby projects, that now in addition to plain old DDoS I'll also have people spin up layer 7 attacks with just their credit card. It almost makes me want to give up instead of pushing forward in a world where the worst of the worst has access to the best of the best.

➕ show 1 reply

frizlab • yesterday at 8:10 PM

> I would go for a 2 or 3 hour walk with my phone using the remote control feature looking every 5 - 10 minutes to make sure it doesn't need human help.

That is a nightmarish scenario tbh

➕ show 3 replies

ale • yesterday at 9:16 PM

This type of slop comment is somehow worse than spam.

>After a week, scores of iterations, it can reverse engineer any website

Cool, let’s see the proof.

➕ show 2 replies

troupo • yesterday at 8:38 PM

> I would go for a 2 or 3 hour walk with my phone using the remote control feature looking every 5 - 10 minutes

2-3 hours "walking" while having to check in every 5-10 minutes?

If I have to check in every 5-10 minutes, I won't taste coffee or hear that there's good music playing.

➕ show 1 reply

colechristensen • yesterday at 9:02 PM

I have similar amounts of success (pretty good!) standing in line at a coffee shop talking to people who work for me through some action that needs to be taken and doing the same with AI.

However I do not trust AI anywhere near as much as I trust the humans. The AI is super capable but also occasionally a psychopath toddler. I sat in amused astonishment when faced with job 2 not running because job 1 was failing Claude went in to the database, changed the failure record to success, triggered job 2 which produced harmful garbage, and then claimed victory. Only the most troubled person would even think of doing that, but Claude thought it was the best solution.

➕ show 1 reply

virtue3 • yesterday at 9:00 PM

That's fucking insane. Thank you for sharing.

I had a bad feeling we were basically already there.

alt Hacker News

Replies