No, they just need to be trained to have adversarial self review "thinking" processes. Y...

colechristensen • today at 12:54 AM • 1 reply • view on HN

No, they just need to be trained to have adversarial self review "thinking" processes.

You ask an LLM "What's wrong with your answer?" and you get pretty good results.

binary0010 • today at 12:59 AM

Or you get the original output result was perfect and the adversarial "rethinking" switches to an incorrect result.

➕ show 1 reply

alt Hacker News