That is GPT-3. Modern models are rewarded based off the accuracy of their responses.
By... another AI model. Which uses statistical generation to decide whether the answer is likely to be accurate or not.
Wait, have we solved hallucination already?
No they are not
By... another AI model. Which uses statistical generation to decide whether the answer is likely to be accurate or not.