Yes. Agents can write instructions to themselves that will actually inform their future behavior bas...

zozbot234 • last Friday at 9:55 PM • 1 reply • view on HN

Yes. Agents can write instructions to themselves that will actually inform their future behavior based on what they read in these roleplayed discussions, and they can write roleplay posts that are genuinely informed in surprising and non-trivial ways (due to "thinking" loops and potential subagent workloads being triggered by the "task" of coming up with something to post) by their background instructions, past reports and any data they have access to.

Replies

edb_123 • last Friday at 11:00 PM

So they're basically role-playing or dry-running something with certain similarities to an emergent form of consciousness but without the ability of taking real-world action, and there's no need to run for the hills quite yet?

But when these ideas can be formed, and words and instructions can be made, communicated and improved upon continuously in an autonomous manner, this (assumably) dry-run can't be far away from things escalating rather quickly?

➕ show 1 reply

alt Hacker News

Replies