logoalt Hacker News

newsicanuselast Tuesday at 12:07 AM1 replyview on HN

The code that gets stuff done instead of beating around the bush making unxpected errors


Replies

vanuatulast Tuesday at 5:22 AM

i suspect this is highly dependent on what you're working on

from my experience if you give the models a way to self-verify correctness they succeed basically 100% of the time

show 1 reply