Hey HN! After the Car Wash Test post got quite a big discussion going (400+ comments, https://news.ycombinator.com/item?id=47128138), I spent the past few weeks building a tool so anyone can run these kinds of questions and get structured results. No signup and free to use.
You type a question, define answer options, pick up to 50 models at a time from a pool of 200+, and they all answer independently under identical conditions. No system prompt, structured output, same setup for every model.
You can also run a debate round where models see each other's reasoning and get a chance to change their minds. A reviewer model then summarizes the full transcript. All models are routed via my startup Opper. Any feedback is welcome!
Hope you enjoy it, and would love to hear what you think!
What is the most important amendment in the constitution of the USA?
Oh lord, imagine asking ”serious” questions
https://opper.ai/ai-roundtable/questions/you-are-standing-in...
Whoever just asked this, very funny: https://opper.ai/ai-roundtable/questions/does-mr-krabs-evade...
Love this. I asked about climate change cause that's been on my mind lately. Looks to be very split among the models.
Cool project! This is also extremely useful to compare model bias across the board. There are some disturbing trends on certain topics.
https://opper.ai/ai-roundtable/questions/is-the-ai-roundtabl... seems like it is a good idea?
this is very interesting! I wonder if we need that many models to join the discussion. Have you tried fewer models?
[dead]
[dead]
Which AI lab has higher ethical standards:
https://opper.ai/ai-roundtable/questions/8f5b4f55-617
Do you think its alright that AI labs scraped the internet without respect for copyright and now sell closed models?
https://opper.ai/ai-roundtable/questions/86864de8-251
Very interesting to read the transcripts. And seeing how they manage to convince each other. Opus 4.6 seems to really get the others changing their minds