This is actually interesting. Feels like we’re moving from “generate UI” to “validate UI,” which is a completely different problem. Curious how you handle edge cases where something looks correct but breaks in interaction?
...you test the interaction too? That's what Playwright does and LLMs are pretty capable of writing playwright tests for interaction.
...you test the interaction too? That's what Playwright does and LLMs are pretty capable of writing playwright tests for interaction.