World model is a 3d game engine, which uses neural net to compute screen pixels instead of rasterizing triangles with textures or ray-tracing.
It is different from video generators, because it also accepts control inputs (ie. keyboard, mouse).
I think a robot would want to know a little more about the world though? For instance, picking up a spatula requires less force than picking up a an anvil. So you would want to know about the mass of things.
I think a robot would want to know a little more about the world though? For instance, picking up a spatula requires less force than picking up a an anvil. So you would want to know about the mass of things.