logoalt Hacker News

andaitoday at 5:35 PM0 repliesview on HN

Yeah, GPT also constantly misattributes things.

OpenAI have some kinda 5 tier content hierarchy for OpenAI (system prompt, user prompt, untrusted web content etc). But if it doesn't even know who said what, I have to question how well that works.

Maybe it's trained on the security aspects, but not the attribution because there's no reward function for misattribution? (When it doesn't impact security or benchmark scores.)