Browser Agent Benchmark: Comparing LLM models for web automation

10 points • by MagMueller • yesterday at 3:48 PM • 2 comments • view on HN

wiradikusuma • today at 5:45 AM

Since we're in this topic, can anyone suggest good AI-based tool for exploratory (fuzzy?) web testing?

pixel_popping • today at 5:24 AM

It's lacking the best model (Opus 4.5) on the benchmark tho.

MagMueller • yesterday at 3:54 PM

[dead]

alt Hacker News