I'm pretty unsurprised that the vision agent did worse. I'd be interested in a comparison ...

sudb • yesterday at 5:08 PM • 1 reply • view on HN

I'm pretty unsurprised that the vision agent did worse. I'd be interested in a comparison between the different tools that now exist to let LLMs drive browsers (e.g. vercel's agent-browser, the relatively new dev-browser[1], etc.)

There are usecases where the vision agent is the more obvious, or only choice though, e.g. prorprietary/locked-down desktop apps that lack an automation layer.

1. https://github.com/SawyerHood/dev-browser

Replies

palashawas • yesterday at 5:16 PM

Interesting! I'll play around with agent-browser and update this article if anything comes up

alt Hacker News

Replies