logoalt Hacker News

dwa3592yesterday at 10:41 PM1 replyview on HN

Why weren't these attacks tested on the frontier models? The models they tested these on can also be fooled by poems and rhymes.


Replies