logoalt Hacker News

genidoitoday at 2:31 AM1 replyview on HN

Especially given the LLM does not trust the user. An LLM can be jailbroken into lowering it's guardrails, but no amount of rapport building allows you to directly talk about material details of banned topics. Might as well never trust it.


Replies

gverrillatoday at 3:45 AM

I wouldn't trust you either - what topics are you even talking about?