logoalt Hacker News

docheinestagestoday at 1:29 PM0 repliesview on HN

This is a helpful method for visually grounding LLMs to take actions on the screen such as clicking. For humans though, hell no.