logoalt Hacker News

llmslaveyesterday at 8:14 PM1 replyview on HN

The benchmarks on all these models are meaningless


Replies

alchemist1e9yesterday at 8:36 PM

Why and what would a good benchmark look like?

show 1 reply