logoalt Hacker News

marcus_holmestoday at 3:57 AM1 replyview on HN

I think they've been gaming benchmarks.

I use Claude every day. I cannot get Gemini to do anything useful, at all. Every time I've tried to use it, it has just failed to do what was required.


Replies

asdfftoday at 7:22 AM

Three subthreads up you have someone saying gemini did what claude couldn't for them on some 14 year old legacy code issue. Seems you can't really use peoples prior success with their problem as an estimate of what your success will be like with your problem and a tool.