Super interesting, I wonder if this research will cause them to actually change their llm, like turn...

emoII • today at 7:06 AM • 1 reply • view on HN

Super interesting, I wonder if this research will cause them to actually change their llm, like turning down the ”desperation neurons” to stop Claude from creating implementations for making a specific tests pass etc.

Replies

bethekind • today at 7:12 AM

They likely already have. You can use all caps and yell at Claude and it'll react normally, while doing do so with chatgpt scares it, resulting in timid answers

➕ show 2 replies

alt Hacker News

Replies