logoalt Hacker News

koakuma-chantoday at 4:00 PM1 replyview on HN

> If you're seeing the model deliberately creating errors so you have something to fix, then that sounds like something is fundamentally wrong in your prompt.

No, all these models are just bad for anything that they weren't RLed for, and decent for things they were. Decent, because people who evaluate them aren't experts.


Replies

embedding-shapetoday at 4:13 PM

> No, all these models are just bad for anything that they weren't RLed for, and decent for things they were

Are you claiming that the models are RLed to intentionally adding errors to our programs when you use them, or what's the argument you're trying to make here? Otherwise I don't see how it's relevant to how I said.

show 1 reply