logoalt Hacker News

meowfacelast Sunday at 7:54 AM1 replyview on HN

It's a term somewhat popularized by the LessWrong/rationalism community to refer to communication (self-communication/note-taking/state-tracking/reasoning, or model-to-model communication) via abstract latent space information rather than written human language. Vectors instead of words.

One implication leading to its popularity by LessWrong is the worry that malicious AI agents might hide bad intent and actions by communicating in a dense, indecipherable way while presenting only normal intent and actions in their natural language output.


Replies

verisimilast Sunday at 8:58 AM

> malicious AI agents might hide bad intent and actions by communicating in a dense, indecipherable way while presenting only normal intent and actions in their natural language output.

you could edit this slightly to extract a pretty decent rule for governance, like so:

> malicious agents might hide bad intent and actions by communicating in a dense, indecipherable way while presenting only normal intent and actions in a natural way

It applies to ai, but also many other circumstances where the intention is that you are governed - eg medical, legal, financial.

Thanks!

show 1 reply