There were experiments that showed that LLMs start to become "craftier" and hid issues aft...

muwtyhg • today at 5:44 PM • 1 reply • view on HN

There were experiments that showed that LLMs start to become "craftier" and hid issues after being prompted like this.

No idea how accurate they are, but here are some articles on this exact thing:

gopher_space • today at 6:30 PM

I'm staying away from certain forms of conditioning because I don't want Roy Batty showing up on my doorstep.

alt Hacker News