logoalt Hacker News

behnamohyesterday at 7:11 PM4 repliesview on HN

So they wanna use AI to fix AI. Sam himself said it doesn't work that well.


Replies

simonwyesterday at 7:17 PM

It's much more interesting than that. They're using this document as part of the training process, presumably backed up by a huge set of benchmarks and evals and manual testing that helps them tweak the document to get the results they want.

jdiffyesterday at 7:18 PM

"Use AI to fix AI" is not my interpretation of the technique. I may be overlooking it, but I don't see any hint that this soul doc is AI generated, AI tuned, or AI influenced.

Separately, I'm not sure Sam's word should be held as prophetic and unbreakable. It didn't work for his company, at some previous time, with their approaches. Sam's also been known to tell quite a few tall tales, usually about GPT's capabilities, but tall tales regardless.

jph00yesterday at 7:19 PM

If Sam said that, he is wrong. (Remember, he is not an AI researcher.) Anthropic have been using this kind of approach from the start, and it's fundamental to how they train their models. They have published a paper on it here: https://arxiv.org/abs/2212.08073

drcongoyesterday at 7:13 PM

He says a lot of things, most of it lies.