Models aren't deterministic - every time you would try to re-apply you'd likely get dif...

onion2k • today at 3:51 PM • 9 replies • view on HN

Models aren't deterministic - every time you would try to re-apply you'd likely get different output (without feeding the current code into the re-apply and let it just recommend changes)

If the result is always provably correct it doesn't matter whether or not it's different at the code level. People interested in systems like this believe that the outcome of what the code does is infinity more important than the code itself.

Replies

sensanaty • today at 5:32 PM

That if at the beginning of your sentence is doing a whole lot of work. Indeed, if we could formally and provably (another extremely loaded word) generate good code that'd be one thing, but proving correctness is one of those basically impossible tasks.

➕ show 1 reply

dsr_ • today at 4:16 PM

Let's rephrase:

Since nobody involved actually cares whether the code works or not, it doesn't matter whether it's a different wrong thing each time.

➕ show 1 reply

tomtomtom777 • today at 4:58 PM

> If the result is always provably correct it doesn't matter whether or not it's different at the code level. People interested in systems like this believe that the outcome of what the code does is infinity more important than the code itself.

If the spec is so complete that it covers everything, you might as well write the code.

The benefit of writing a spec and having the LLM code it, is that the LLM will fill in a lot of blanks. And it is this filling in of blanks that is non-deterministic.

➕ show 1 reply

SpaceNoodled • today at 3:52 PM

That's a huge "if."

➕ show 1 reply

FrankRay78 • today at 4:55 PM

Sure, but where are the formal acceptance tests to validate against?

0-_-0 • today at 5:40 PM

Besides, you can deterministically generate bad code, and not deterministically generate good code.

__loam • today at 4:14 PM

The code is what the code does.

➕ show 1 reply

Copyrightest • today at 4:00 PM

[dead]

jrm4 • today at 4:00 PM

I would be very comfortable with - re-run 100 times with different seeds. If the outcome is the same every time, you're reliably good to go.

➕ show 1 reply

alt Hacker News

Replies