Thanks — that’s exactly our motivation. The key shift for us was moving from “did the agent probably...

tonyww • yesterday at 4:42 PM • 1 reply • view on HN

Thanks — that’s exactly our motivation. The key shift for us was moving from “did the agent probably do the right thing?” to “can we prove the state we expected actually holds.”

The property-based testing analogy is a good one — once you make success explicit, failures become actionable instead of mysterious.

Replies

joeframbach • yesterday at 6:41 PM

You realize you are responding to a brand new account posting an obviously AI-generated response?

➕ show 1 reply

alt Hacker News

Replies