regarding all the comments about physics, I wonder if a hybrid approach would work better, with an llm generating 3d objects that interact in a physics simulation with guiding forces from the LLM and then another model generating photo realistic rendering.