Where they agree it shows the data supports that answer - not necessarily that it is true, where they disagree it shows you need to hedge. That's useful.
This is so wrong!
e.g., if you had a heart condition, you can't just poll three LLMs and be "reasonably sure" you've properly diagnosed the ailment.
This is so wrong!
e.g., if you had a heart condition, you can't just poll three LLMs and be "reasonably sure" you've properly diagnosed the ailment.