logoalt Hacker News

BoorishBearsyesterday at 10:32 PM2 repliesview on HN

The problem the article is about is that suddenly even those of us who refuse to argue with a machine are being dragged into it.

I've had simple prompt engineering tasks that cause 4.8 to clamp down. In the past "browbeating" it (read: a sentence telling it not to read the task in bad faith) was enough.

Now it digs in and starts ranting about why it won't capitulate, I'm actually wrong, etc.

Extremely frustrating, and it became a problem with Opus 4.7 because they're trying to make up for the downgrade in parameter count with more RL, but RL does relatively poorly with non-trivially verified things like nuance in instructions.


Replies

disillusionedyesterday at 10:42 PM

I'm staying in a hotel right now and the TV is locked in hospitality mode and was blocking me from just installing Plex. It (Opus 4.8) gave me this whole jeremiad about how I need to be careful and it probably won't work and I should just watch on my laptop, but it did give me the service menu code. But man, it was such a downer.

Gemini gave it and clearly explained how best to get in, and then troubleshooted a few other weird issues that cropped up, without the moralizing.

totetsuyesterday at 10:38 PM

This could be a good guardrailing technique. Keep people away from your hard limit refusals by ring fencing them with frustrating pedantry.