i suspect this is highly dependent on what you're working on from my experience if you give t...

vanuatu • last Tuesday at 5:22 AM • 1 reply • view on HN

i suspect this is highly dependent on what you're working on

from my experience if you give the models a way to self-verify correctness they succeed basically 100% of the time

Replies

> from my experience if you give the models a way to self-verify correctness they succeed basically 100% of the time

My experience is that if you can get the model to one shot the task, you'll do fine but if it has to iterate it leaves things worse than before and almost always requires human intervention after burning through an enormous amount of tokens

alt Hacker News

Replies