> as in they deliberately create errors in code that then you have to spend time debugging and fi...

embedding-shape • today at 1:28 PM • 2 replies • view on HN

> as in they deliberately create errors in code that then you have to spend time debugging and fixing

No, all the models are designed to be "helpful", but different companies see that as different things.

If you're seeing the model deliberately creating errors so you have something to fix, then that sounds like something is fundamentally wrong in your prompt.

Besides that, I'm guessing "repeat solving it until it is correct" is a concise version of your actual prompt, or is that verbatim what you prompt the model? If so, you need to give it more details to actually be able to execute something like that.

Replies

varispeed • today at 5:28 PM

> then that sounds like something is fundamentally wrong in your prompt.

I am holding it wrong?

➕ show 1 reply

koakuma-chan • today at 4:00 PM

> If you're seeing the model deliberately creating errors so you have something to fix, then that sounds like something is fundamentally wrong in your prompt.

No, all these models are just bad for anything that they weren't RLed for, and decent for things they were. Decent, because people who evaluate them aren't experts.

➕ show 1 reply

alt Hacker News

Replies