> If you're seeing the model deliberately creating errors so you have something to fix, then...

koakuma-chan • today at 4:00 PM • 1 reply • view on HN

> If you're seeing the model deliberately creating errors so you have something to fix, then that sounds like something is fundamentally wrong in your prompt.

No, all these models are just bad for anything that they weren't RLed for, and decent for things they were. Decent, because people who evaluate them aren't experts.

Replies

embedding-shape • today at 4:13 PM

> No, all these models are just bad for anything that they weren't RLed for, and decent for things they were

Are you claiming that the models are RLed to intentionally adding errors to our programs when you use them, or what's the argument you're trying to make here? Otherwise I don't see how it's relevant to how I said.

➕ show 1 reply

alt Hacker News

Replies