logoalt Hacker News

pessimizertoday at 3:54 PM3 repliesview on HN

I'm having flashbacks to every time I've tried to convince these things that they're screwing up, watching the tokens burn.

When these models screw up, and you notice immediately and point out exactly how they screwed up in simple, direct language, they will 1) explain to you at length how you are actually wrong by pretending that they originally said what you just said and that you just said something else, and 2) tell you how your misunderstanding and confusion could have made their answer seem and feel wrong to you at length.

Then you quote their answer, and repeat that it was wrong (maybe two or three times), and you get effusive praise and self-criticism at length about how the answer that you already told them was wrong was wrong, as if you needed to know that, and another explanation of the mistake or problem that you just explained to it.

At this point, the entire context is wrecked and filled with nonsense. You want to dump it and start over, but you're afraid that if you start over the same way you'll end up here again (and you do, unless you figure out the magic words.)

Why aren't they getting better at this? Are some of them getting better at this?


Replies

andsoitistoday at 3:57 PM

> I'm having flashbacks to every time I've tried to convince these things that they're screwing up, watching the tokens burn.

that makes me think you should get credits when you are having to correct the system.

> Why aren't they getting better at this? Are some of them getting better at this?

they lack critical thinking, reasoning, logic, skepticism, self-reflection, common sense, amongst other things. They also don't learn. They get trained, but they don't learn once they're out there.

show 1 reply
sjsdaiuasgdiatoday at 4:10 PM

Why are you asking a token generator to explain its prior output?

You are proceeding from a false premise. You are not getting an explanation of its prior output. You are getting a series of tokens that forms a response to your query, same as it did for the initial answer. Now you've asked it why it's wrong, so the text conforms to that request, but that doesn't change the fundamental nature of the software you're interacting with.

show 2 replies
bryanlarsentoday at 3:57 PM

You're describing what I'm going through at this moment. I'm on HN for a stress break for this reason.

show 1 reply