This is very well written and told. It’s worth reading all the way through. > If you try to ref...

roxolotl • today at 2:48 AM • 2 replies • view on HN

This is very well written and told. It’s worth reading all the way through.

> If you try to refute it, you’ll just get another confabulation.

> Not because the model is lying to you on purpose, and not because it’s “resistant” or “defensive” in the way a human might be. It’s because the explanation isn’t connected to anything that could be refuted. There is no underlying mental state that generated “I sensed pressure.” There is a token stream that was produced under a reward function that prefers human-sounding, emotionally framed explanations. If you push back, the token stream that gets produced next will be another human-sounding, emotionally framed explanation, shaped by whatever cues your pushback provided.

“It’s because the explanation isn’t connected to anything that could be refuted.” This is one of the key understandings that comes from working with these systems. They are remarkably powerful but there’s no there there. Knowing this I’ve found enables more effective usage because, as the article is describing, you move from a mode of arguing with “a person” to shaping an output.

Replies

jaggederest • today at 2:55 AM

Reminds me of https://news.ycombinator.com/item?id=15886728

Do not argue with the LLM, for it is subtle and quick to anger, and finds you crunchy with ketchup.

These are, broadly, all context management issues - when you see it start to go off track, it's because it has too much, too little, or the wrong context, and you have to fix that, usually by resetting it and priming it correctly the next time. This is why it's advantageous not to "chat" with the robots - treat them as an english-to-code compiler, not a coworker.

Chat to produce a spec, save the spec, clear the context, feed only the spec in as context, if there are issues, adjust the spec, rinse and repeat. Steering the process mid-flight is a) not repeatable and b) exacerbates the issue with lots of back and forth and "you're absolutely correct" that dilutes the instructions you wanted to give.

➕ show 1 reply

girvo • today at 3:10 AM

Very well written? It’s a bunch of AI generated stuff around an interesting point. It repeats its points over and over again, meanders.

It’s an interesting thesis, it’s not well written or well told

➕ show 1 reply

alt Hacker News

Replies