Part of it is that training of these VLAs currently happens on human teleop data which limits speed (both for safety reasons and because of actual physical speed constraints in the teleoperation pipeline).
Let’s see how it changes once these pipelines follow the LLM recipes to use more than just human data…