logoalt Hacker News

onion2ktoday at 3:51 PM9 repliesview on HN

Models aren't deterministic - every time you would try to re-apply you'd likely get different output (without feeding the current code into the re-apply and let it just recommend changes)

If the result is always provably correct it doesn't matter whether or not it's different at the code level. People interested in systems like this believe that the outcome of what the code does is infinity more important than the code itself.


Replies

sensanatytoday at 5:32 PM

That if at the beginning of your sentence is doing a whole lot of work. Indeed, if we could formally and provably (another extremely loaded word) generate good code that'd be one thing, but proving correctness is one of those basically impossible tasks.

show 1 reply
dsr_today at 4:16 PM

Let's rephrase:

Since nobody involved actually cares whether the code works or not, it doesn't matter whether it's a different wrong thing each time.

show 1 reply
tomtomtom777today at 4:58 PM

> If the result is always provably correct it doesn't matter whether or not it's different at the code level. People interested in systems like this believe that the outcome of what the code does is infinity more important than the code itself.

If the spec is so complete that it covers everything, you might as well write the code.

The benefit of writing a spec and having the LLM code it, is that the LLM will fill in a lot of blanks. And it is this filling in of blanks that is non-deterministic.

show 1 reply
SpaceNoodledtoday at 3:52 PM

That's a huge "if."

show 1 reply
FrankRay78today at 4:55 PM

Sure, but where are the formal acceptance tests to validate against?

0-_-0today at 5:40 PM

Besides, you can deterministically generate bad code, and not deterministically generate good code.

__loamtoday at 4:14 PM

The code is what the code does.

show 1 reply
Copyrightesttoday at 4:00 PM

[dead]

jrm4today at 4:00 PM

I would be very comfortable with - re-run 100 times with different seeds. If the outcome is the same every time, you're reliably good to go.

show 1 reply