You really cannot expect audio processing to yield color information.
Beyond that, you are correct that the 3D shapes themselves cannot be derived perfectly accurately (see my other post)
I share the prevailing skepticism about this specific project, but it does seem plausible that desiccated fall foliage might interact with sound differently than supple new growth.
I share the prevailing skepticism about this specific project, but it does seem plausible that desiccated fall foliage might interact with sound differently than supple new growth.