I wonder how accurate joint positions and muscle activations can be from just a POV camera. Maybe it’s not crazy to think someone could get tens of millions of hours of well-labeled training data.