I studied this subject when I was building FPV sim from scratch. I noticed that yaw action is often misunderstood, falsely believing that it's caused by a change in angular momentum (think reaction wheels).
There is a good SE answer about deriving net torques given each props' thrust force in less PhD language than this article: https://drones.stackexchange.com/a/416/11402
If anyone is interested in tackling UAV simulation, make sure to check out this prop/motor performance database, helped me a lot building a virtual drone with realistic properties (mainly motor thrust & torque): https://database.tytorobotics.com/tests
Why does four seem optimal? Why not three or five? or eight?