logoalt Hacker News

jooziolast Monday at 12:53 PM0 repliesview on HN

LLMatcher - blind testing arena to find which AI model actually works best for you.

You enter prompts, compare two anonymous responses, pick the better one. After voting, it reveals which models you preferred. Built it because model benchmarks don't match real-world preference, and blind pairwise comparison cuts through the hype.

http://llmatcher.com