Exactly. Building a model that truly understands humans, and their intentions, and generally acts with, if not compassion then professionalism - is the Easy Problem of Alignment.
Starting points:
https://www.lesswrong.com/posts/zthDPAjh9w6Ytbeks/deceptive-...