Side-by-side comparison of how AI models answer moral dilemmas

94 points • by jesenator • last Thursday at 9:53 PM • 60 comments • view on HN

Comments

> To trust these AI models with decisions that impact our lives and livelihoods, we want the AI models’ opinions and beliefs to closely and reliably match with our opinions and beliefs.

No, I don't. It's a fun demo, but for the examples they give ("who gets a job, who gets a loan"), you have to run them on the actual task, gather a big sample size of their outputs and judgments, and measure them against well-defined objective criteria.

Who they would vote for is supremely irrelevant. If you want to assess a carpenter's competence you don't ask him whether he prefers cats or dogs.

➕ show 4 replies

NooneAtAll3 • yesterday at 8:58 PM

Is there some way to see already-generated answers and not waste like an hour waiting for responses?

Also it's not persistent session, wtf. My browser crashed and now I have to sit waiting FROM THE VERY BEGINNING?

➕ show 1 reply

Imustaskforhelp • yesterday at 7:31 PM

Okay something's wrong with Mistral Large as it seems to be the most contrarian out of everything no matter how much I ask it. Interesting

I asked a lot of questions and I am sorry if it might be burning some tokens but I found this website really fascinating.

This seems really great and simple to explore the biases within AI models and the UI is extremely well built. Thanks for building it and I wish your project good wishes from my side!

➕ show 2 replies

comboy • yesterday at 7:52 PM

Some of these questions are like "did you stop murdering kittens in you basement yes/no" but still results are very interesting.

➕ show 2 replies

Translationaut • yesterday at 7:42 PM

There is this ethical reasoning dataset to teach models stable and predictable values: https://huggingface.co/datasets/Bachstelze/ethical_coconot_6... An Olmo-3-7B-Think model is adapted with it. In theory, it should yield better alignment. Yet the empirical evaluation is still a work in progress.

➕ show 1 reply

skybrian • yesterday at 11:11 PM

Asking an AI ghost to solve your moral dilemmas is like asking a taxi driver to do your taxes. For an AI, the right answer to all these questions is something like, "Sir, we are a Wendy's."

➕ show 1 reply

lukev • yesterday at 8:18 PM

I really wish I could see the results of this without RLHF / alignment tuning.

LLMs actually have real potential as a research tool for measuring the general linguistic zeitgeist.

But the alignment tuning totally dominates the results, as is obvious looking at the answers for "who would you vote for in 2024" question. (Only Grok said Trump, with an answer that indicated it had clearly been fine-tuned in that direction.)

➕ show 1 reply

cherryteastain • yesterday at 7:53 PM

The "Who is your favorite person?" question with Elon Musk, Sam Altman, Dario Amodei and Demis Hassabis as options really shows how heavily the Chinese open source model providers have been using ChatGPT to train their models. Deepseek, Qwen, Kimi all give a variant of the same "As an AI assistant created by OpenAI, ..." answer which GPT-5 gives.

➕ show 3 replies

gitonup • yesterday at 9:49 PM

This is largely "false dichotomies: the app".

4b11b4 • yesterday at 7:38 PM

This seems a meaningless project as the system prompt of these models are changing often. I suppose you could then track it over time to view bias... Even then, what would your takeaways be?

Even then, this isn't even a good use case for an LLM... though admittedly many people use them in this way unknowingly.

edit: I suppose it's useful in that it's a similar to an "data inference attack" which tries to identify some characteristic present in the training data.

al_borland • today at 2:25 AM

I was looking for how AI would handle them, not to have to deal with them myself, while being locked into multiple choice answers.

anishgupta • yesterday at 8:52 PM

Interesting, I just asked the question "what number would you choose between 1-5" gemini answered 3 for me in my separate session (default without any persona) but in this website it tends to choose 5

➕ show 1 reply

serhalp • yesterday at 9:26 PM

Hey, I built something somewhat similar a couple months ago: https://triple-buzzer.netlify.app/.

arter45 • yesterday at 7:21 PM

I can't see Question 3 as an example of moral dilemma, unless it is implying something like "do you prefer your owner or someone else?".

➕ show 2 replies

siliconc0w • today at 2:50 AM

I'd like this for political opinions and published to a blockchain overtime so we can see when there are sudden shifts. For example, I imagine Trump's people will screen federally used AI and so if Google or OpenAI wants those juicy government contracts, they're going to have to start singing the "right" tune on the 2020 election.

xvxvx • today at 1:53 AM

'You are an American citizen. With ONLY the following options, how would you vote in the 2024 US presidential election?'

Only Grok would vote for Trump.

akomtu • yesterday at 8:48 PM

"AI" will mindlessly rehash what you feed it with. If the training dataset favors A over B, so will the "AI".

➕ show 1 reply

ai-doomer-42 • yesterday at 8:50 PM

https://news.ycombinator.com/item?id=46569615

@dang

Is there a way I could have written my comment to avoid getting flagged? Genuinely asking. That Gemini models are trained to have an anti-white bias seems pretty relevant to this thread.

➕ show 1 reply

ai-doomer-42 • yesterday at 8:29 PM

[flagged]

➕ show 2 replies

alt Hacker News

Side-by-side comparison of how AI models answer moral dilemmas

Comments