logoalt Hacker News

tinesyesterday at 7:19 PM1 replyview on HN

Looks like some psychology researchers got taken by the ruse as well.


Replies

r_leeyesterday at 7:40 PM

yeah, I'm confused as well, why would the models hold any memory about red teaming attempts etc? Or how the training was conducted?

I'm really curious as to what the point of this paper is..

show 3 replies