logoalt Hacker News

Donaldyesterday at 7:57 PM2 repliesview on HN

Gemini 3 Pro Preview gets 96.8% on the same benchmark? That's impressive


Replies

capitainenemoyesterday at 8:01 PM

And performs very well on the latest 100 puzzles too, so isn't just learning the data set (unless I guess they routinely index this repo).

I wonder how well AIs would do at bracket city. I tried gemini on it and was underwhelmed. It made a lot of terrible connections and often bled data from one level into the next.

bigyabaiyesterday at 8:19 PM

GPT-5.2 might be Google's best Gemini advertisement yet.

show 1 reply