So they wanna use AI to fix AI. Sam himself said it doesn't work that well.

behnamoh • yesterday at 7:11 PM • 4 replies • view on HN

Replies

It's much more interesting than that. They're using this document as part of the training process, presumably backed up by a huge set of benchmarks and evals and manual testing that helps them tweak the document to get the results they want.

jdiff • yesterday at 7:18 PM

"Use AI to fix AI" is not my interpretation of the technique. I may be overlooking it, but I don't see any hint that this soul doc is AI generated, AI tuned, or AI influenced.

Separately, I'm not sure Sam's word should be held as prophetic and unbreakable. It didn't work for his company, at some previous time, with their approaches. Sam's also been known to tell quite a few tall tales, usually about GPT's capabilities, but tall tales regardless.

jph00 • yesterday at 7:19 PM

If Sam said that, he is wrong. (Remember, he is not an AI researcher.) Anthropic have been using this kind of approach from the start, and it's fundamental to how they train their models. They have published a paper on it here: https://arxiv.org/abs/2212.08073

drcongo • yesterday at 7:13 PM

He says a lot of things, most of it lies.

alt Hacker News

Replies