logoalt Hacker News

XCSmeyesterday at 8:29 PM3 repliesview on HN

Funny how they didn't include Gemini 3.0 Pro in the bar chart comparison, considering that it seems to do the best in the table view.


Replies

jychangyesterday at 8:44 PM

Also, funny how they included GPT-5.0 and 5.1 but not 5.2... I'm pretty sure they ran the benchmarks for 5.0, then 5.1 came out, so they ran the benchmarks for 5.1... and then 5.2 came out and they threw their hands up in the air and said "fuck it".

show 3 replies
guluarteyesterday at 9:49 PM

Gemini is garbage and does it's own thing most of the time ignoring the instructions