> The ‘give up after ten attempts’ threshold aims to prevent Claude from wasting tokens when further progress is unlikely. It was only partially successful, as Claude would still sometimes make dozens of attempts.
Not what I would have expected from a 'one-shot'. Maybe self-supervised would be a more suitable term?
One shot just means one prompt. What Claude decides to do during that prompt is up to it.
"one-shot" usually just means, one example and its correct answer was provided in the prompt.
See also, "zero-shot" / "few-shot" etc.
Meh, the main idea of one-shot is that you prompted it once and got a good impl when it decided it was done. As opposed to having to workshop yourself with additional prompts to fix things.
It doesn't do it in one-shot on the GPU either. It feeds outputs back into inputs over and over. By the time you see tokens as an end-user, the clanker has already made a bunch of iterations.
I definitely didn't expect one-shot to mean "let it run itself in an indefinite loop"