logoalt Hacker News

pplonski86today at 7:22 AM2 repliesview on HN

There are so many models, is there any website with list of all of them and comparison of performance on different tasks?


Replies

Reubendtoday at 7:31 AM

The post actually has great benchmark tables inside of it. They might be outdated in a few months, but for now, it gives you a great summary. Seems like Gemini wins on image and video perf, Claude is the best at coding, ChatGPT is the best for general knowledge.

But ultimately, you need to try them yourself on the tasks you care about and just see. My personal experience is that right now, Gemini Pro performs the best at everything I throw at it. I think it's superior to Claude and all of the OSS models by a small margin, even for things like coding.

show 1 reply
coffeeritoday at 7:27 AM

There is https://artificialanalysis.ai

show 2 replies