logoalt Hacker News

user_7832today at 4:50 PM3 repliesview on HN

Two quick questions to Gemini/AI Studio users:

1, has anyone actually found 3 Pro better than 2.5 (on non code tasks)? I struggle to find a difference beyond the quicker reasoning time and fewer tokens.

2, has anyone found any non-thinking models better than 2.5 or 3 Pro? So far I find the thinking ones significantly ahead of non thinking models (of any company for that matter.)


Replies

Workaccount2today at 4:54 PM

Gemini 3 is a step change up against 2.5 for electrical engineering R&D.

Davidzhengtoday at 5:08 PM

I think it's probably actually better at math. Though still not enough to be useful in my research in a substantial way. Though I suspect this will change suddenly at some point as the models move past a certain threshold (also it is heavily limited by the fact that the models are very bad at not giving wrong proofs/counterexamples) so that even if the models are giving useful rates of successes, the labor to sort through a bunch of trash makes it hard to justify.

tmalytoday at 5:06 PM

Not for coding but for the design aspect, 3 outshines 2.5