What do you mean? They found when they trained a LLM to lie that internally it knew the truth and ...

unparagoned • today at 7:02 AM • 0 replies • view on HN

What do you mean?

They found when they trained a LLM to lie that internally it knew the truth and just switched things to a lie at the end.

alt Hacker News