logoalt Hacker News

Browser Agent Benchmark: Comparing LLM models for web automation

10 pointsby MagMuelleryesterday at 3:48 PM2 commentsview on HN

Comments

wiradikusumatoday at 5:45 AM

Since we're in this topic, can anyone suggest good AI-based tool for exploratory (fuzzy?) web testing?

pixel_poppingtoday at 5:24 AM

It's lacking the best model (Opus 4.5) on the benchmark tho.

MagMuelleryesterday at 3:54 PM

[dead]