logoalt Hacker News

SoMomentaryyesterday at 9:39 PM1 replyview on HN

The speed was impressive when I tested it but unfortunately the accuracy left a lot to be desired. Be interesting to do the math on some of my normal workflows to see where the break even is between them, assuming the tasks you have can tolerate a couple of failures.


Replies

zuzululutoday at 12:16 AM

we are talking about computer use here

gemini 3.5 flash isn't meant to compete head to head with frontier models on tough problems