I feel like we are just inching closer and closer to a world where rapid iteration of software will ...

nickandbro • today at 5:13 AM • 20 replies • view on HN

I feel like we are just inching closer and closer to a world where rapid iteration of software will be by default. Like for example a trusted user makes feedback -> feedback gets curated into a ticket by an AI agent, then turned into a PR by an Agent, then reviewed by an Agent, before being deployed by an Agent. We are maybe one or two steps from the flywheel being completed. Or maybe we are already there.

Replies

chatmasta • today at 5:36 AM

I love everything about this direction except for the insane inference costs. I don’t mind the training costs, since models are commoditized as soon as they’re released. Although I do worry that if inference costs drop, the companies training the models will have no incentive to publish their weights because inference revenue is where they recuperate the training cost.

Either way… we badly need more innovation in inference price per performance, on both the software and hardware side. It would be great if software innovation unlocked inference on commodity hardware. That’s unlikely to happen, but today’s bleeding edge hardware is tomorrow’s commodity hardware so maybe it will happen in some sense.

If Taalas can pull off burning models into hardware with a two month lead time, that will be huge progress, but still wasteful because then we’ve just shifted the problem to a hardware bottleneck. I expect we’ll see something akin to gameboy cartridges that are cheap to produce and can plug into base models to augment specialization.

But I also wonder if anyone is pursuing some more insanely radical ideas, like reverting back to analog computing and leveraging voltage differentials in clever ways. It’s too big brain for me, but intuitively it feels like wasting entropy to reduce a voltage spike to 0 or 1.

➕ show 4 replies

Leptonmaniac • today at 6:04 AM

I think that as a user I'm so far removed from the actual (human) creation of software that if I think about it, I don't really care either way. Take for example this article on Hacker News: I am reading it in a custom app someone programmed, which pulls articles hosted on Hacker News which themselves are on some server somewhere and everything gets transported across wires according to a specification. For me, this isn't some impressionist painting or heartbreaking poem - the entity that created those things is so far removed from me that it might be artificial already. And that's coming from a kid of the 90s with some knowledge in cyber security, so potentially I could look up the documentation and maybe even the source code for the things I mentioned; if I were interested.

➕ show 1 reply

theredbeard • today at 7:01 AM

We haven’t been inching closer to users writing a half-decent ticket in decades though.

➕ show 2 replies

andy_ppp • today at 9:51 AM

Users are often incorrect about what the software should actually be doing and don’t see the bigger picture.

mindwok • today at 8:13 AM

I think Anthropic will launch backend hosting off the back of their Bun acquisition very soon. It makes sense to basically run your entire business out of Claude, and share bespoke apps built by Claude code for whatever your software needs are.

➕ show 1 reply

heavyset_go • today at 7:14 AM

Feedback loops like that would be an exercise in raising garbage-in->garbage-out to exponential terms.

It's the "robots will just build/repair themselves" trope but the robots are agents

➕ show 1 reply

jvuygbbkuurx • today at 5:30 AM

Tusted user like Jia Tan.

tuo-lei • today at 6:50 AM

The missing piece for me is post-hoc review.

A PR tells me what changed, but not how an AI coding session got there: which prompts changed direction, which files churned repeatedly, where context started bloating, what tools were used, and where the human intervened.

I ended up building a local replay/inspection tool for Claude Code / Cursor sessions mostly because I wanted something more reviewable than screenshots or raw logs.

slopinthebag • today at 5:50 AM

What kind of software are people building where AI can just one shot tickets? Opus 4.6 and GPT 5.4 regularly fail when dealing with complicated issues for me.

➕ show 5 replies

shafyy • today at 9:46 AM

Haha sure, let's just let every user add their feedback to the software.

edf13 • today at 7:11 AM

Or perhaps we end up where all software is self evolving via agents… adjusting dynamically to meet the users needs.

➕ show 1 reply

eru • today at 6:52 AM

Instead of having a trusted user, you can also do statistics on many users.

(That's basically what A/B testing is about.)

hyperionultra • today at 6:55 AM

"Trusted user" also can be an Agent.

bredren • today at 5:49 AM

What you're describing is absolutely where we're headed.

But the entire SWE apparatus can be handled.

Automated A/B testing of the feature. Progressive exposure deployment of changes, you name it.

tossandthrow • today at 5:33 AM

I think the Ai agent will directly make a PR - tickets are for humans with limited mental capacity.

At least in my company we are close to that flywheel.

➕ show 2 replies

overfeed • today at 8:15 AM

> I feel like we are just inching closer and closer to a world where rapid iteration of software will be by default.

There's a lots of experimentation right now, but one thing that's guaranteed is that the data gatekeepers will slam the door shut[1] - or install a toll-booth when there's less money sloshing about, and the winners and losers are clear. At some point in the future, Atlassian and Github may not grant Anthropic access to your tickets unless you're on the relevant tier with the appropriate "NIH AI" surcharge.

1. AI does not suspend or supplant good old capitalism and the cult of profit maximization.

MattGaiser • today at 5:35 AM

I am already there with a project/startup with a friend. He writes up an issue in GitHub and there is a job that automatically triggers Claude to take a crack at it and throw up a PR. He can see the change in an ephemeral environment. He hasn't merged one yet, but it will get there one day for smaller items.

I am already at the point where because it is just the two of us, the limiting factor is his own needs, not my ability to ship features.

➕ show 2 replies

yieldcrv • today at 5:34 AM

We do feedback to ticket automatically

We dont have product managers or technical ticket writers of any sort

But us devs are still choosing how to tackle the ticket, we def don't have to as I’m solving the tickets with AI. I could automate my job away if I wanted, but I wouldn't trust the result as I give a degree of input and steering, and there’s bigger picture considerations its not good at juggling, for now

charcircuit • today at 5:19 AM

Then sets up telemetry and experiments with the change. Then if data looks good an agent ramps it up to more users or removes it.

eranation • today at 5:58 AM

Um, we are already there...

alt Hacker News

Replies