This mirrors my experience as well, when using things like ThreeJS.
Any SOTA model can one-shot something that looks pretty similar to something from Three's examples, but things go south quickly when attempting to increase the complexity, even with pretty unambiguous instructions.
[dead]