logoalt Hacker News

t_mannyesterday at 5:31 PM4 repliesview on HN

> The ‘give up after ten attempts’ threshold aims to prevent Claude from wasting tokens when further progress is unlikely. It was only partially successful, as Claude would still sometimes make dozens of attempts.

Not what I would have expected from a 'one-shot'. Maybe self-supervised would be a more suitable term?


Replies

voiper1yesterday at 10:11 PM

I definitely didn't expect one-shot to mean "let it run itself in an indefinite loop"

johnfnyesterday at 8:25 PM

One shot just means one prompt. What Claude decides to do during that prompt is up to it.

wavemodeyesterday at 6:43 PM

"one-shot" usually just means, one example and its correct answer was provided in the prompt.

See also, "zero-shot" / "few-shot" etc.

show 2 replies
hombre_fatalyesterday at 7:03 PM

Meh, the main idea of one-shot is that you prompted it once and got a good impl when it decided it was done. As opposed to having to workshop yourself with additional prompts to fix things.

It doesn't do it in one-shot on the GPU either. It feeds outputs back into inputs over and over. By the time you see tokens as an end-user, the clanker has already made a bunch of iterations.