Believe it or not, there's Gemini Robotics, which seems to be exactly what you're talking about:
https://deepmind.google/models/gemini-robotics/
Previous discussions: https://news.ycombinator.com/item?id=43344082