logoalt Hacker News

alchemist1e9yesterday at 8:36 PM1 replyview on HN

Why and what would a good benchmark look like?


Replies

moffkalastyesterday at 8:54 PM

30 people trying out all models on the list for their use case for a week and then checking what they're still using a month after.