Exactly, like any AI tool ever.
Someone wrote some instructions. No agent harness ever simply decided to pursue its own interests.
How will you know when that happens? Or are you defining interests so narrowly that it's definitionally impossible?
How will you know when that happens? Or are you defining interests so narrowly that it's definitionally impossible?