"On mobile devices such as iPhone 15 Pro Max, our EfficientTAMs can run at ~10 FPS for performing video object segmentation with reasonable quality"
This is pretty impressive! Lowering the compute requirements will allow more applications to be feasible.