Because for imagegen models it'd be a meaningless benchmark. It's designed to test codegen models for their UI/UX capabilities.
So the image model's benchmark is to generate an image with the corresponding SVG sources.
So the image model's benchmark is to generate an image with the corresponding SVG sources.