A more viable path might actually be agentic testing via agents that simply use a browser or screen reader that can work off high level test scenarios.
I've done some UI testing via the agent mode in chat gpt and I got some pretty decent feedback out of that. I've been trying to do more of that.
Accessibility testing might require a bit more additional tooling than comes with chat gpt by default. But otherwise, this could work.