Algorithmic Monocultures in Hiring

130 points • by sizzle • yesterday at 6:56 PM • 137 comments • view on HN

https://algorithmichiring.github.io/

Comments

I went to a state school.

I then went on to work for multiple firms that placed a premium on candidates from Ivy League/Top Tier (Stanford/Duke etc) candidates.

This taught me that:

- Their are pros and cons to any selection criteria.

- There are smart people everywhere. One of the smartest people I ever worked for spent several years in prison for drug dealing. He was on par with many of the Managing Directors I've worked for

- There was a study where they asked big bank recruiters which school consistently produced people who were excellent employees 2-3 years out from hiring and the answer was Penn State (not my alma mater)

- There used to be "manager's choice" hires where managers had 1 slot in a training program where they could select whoever they wanted. Sometimes that was terrible. Sometimes that person was top of their training program.

- Smart people are just as capable as creating problems as less intelligent people. Smart people, in some ways, are better at creating problems. Especially if the incentives reward them for creating those problems.

➕ show 1 reply

kenjackson • yesterday at 10:06 PM

I think this partially buries the lede: "As a single hiring vendor comes to dominate screening for an industry, it may be more likely that candidates are shut out."

If we move to using just a small number of AI models to help do things like hiring, we will amplify biases and possibly completely lock out portions of the population. We need to be very careful when using AI systems to evaluate people in general -- not because they might be biased (which they might be), but because even a small bias, if used by virtually everyone, can be damning.

➕ show 5 replies

wand3r • yesterday at 8:32 PM

Did I miss the part of the article where they break down how they determined race? Is the algorithm blind to race? It looks like they specifically looked at 83k people applying to ~100 companies which notably were Fortune 500 companies. Could there simply be candidate discrepancies here? Hard for me to follow the full methodology but it doesn't necessarily seem either malicious or that well structured. Don't you need to have a control group of applicants who are similar on paper? To allege DISCRIMINATION is quite bold.

Definitely open to opposing or critical views

➕ show 4 replies

Oras • yesterday at 9:03 PM

Misleading title the paper [0] does not mention any CV screening that might suggest racial or gender bias. It is purely about assessment tool. No AI or LLMs.

I'm not saying AI is not biased, but this study does not prove that.

[0] https://arxiv.org/pdf/2605.27371

From the paper:

> Fig. 1. The pymetrics process. > Stage 1: Applicants apply to positions. > Stage 2: Applicants are directed to the pymetrics platform to play assessment games. > Stage 3: pymetrics algorithms use applicant gameplay features to recommend 58.2% of applicants per position on average. > Stage 4: Employers decide which applicants to interview or hire, typically rejecting applicants that were not recommended by pymetrics.

Terr_ • today at 1:01 AM

> If we pool all of its recommendations together — treating the vendor as one giant hiring process — we don’t find adverse impact. If we look at each position separately, as would be typical in an evaluation of adverse impact, then we expose the adverse impact in many positions.

Sounds a bit like Simpson's Paradox [0]

[0] https://en.wikipedia.org/wiki/Simpson%27s_paradox

daft_pink • yesterday at 10:04 PM

Anyone who’s done hiring wouldn’t be shocked by this:

We find applicants are more likely to be rejected from every position they apply to than would be predicted by the baseline of each position making statistically independent decisions.

Obviously a rejected resume is more likely to be rejected by every other employer and an accepted resume is more likely to be accepted by every other employer. Like online dating, most employers are looking for some baseline indicators that you are going to be successful and stable.

➕ show 3 replies

tbrownaw • yesterday at 10:27 PM

> We find that people who submit multiple applications to positions screened by the same algorithmic hiring vendor are more likely to be rejected from every position to which they apply than would be true if the companies made decisions statistically independently from one another. Ten percent of applicants who submit four applications are rejected from all the places to which they apply.

> Our research also found that this pattern does not appear to be the case in other circumstances. We analyzed data from the largest prior study of hiring decisions, which sent 83,000 applications to 108 Fortune 500 firms during the same time period as our study and did not focus on whether AI was used to make decisions. We found that the rate at which applicants were rejected from every firm they applied to in this data was no higher than what you’d expect if each company decided independently of the others.

It sounds like this study was using real-world applicants, and the other study they're comparing against was using synthetic applicants.

Consider the chance of being accepted as being composed of signal+bias+noise. Noise is random. Signal is a per-applicant value, and what's meant to be measured. Bias is a per-group value, and an artifact of the measuring process.

If acceptance/rejection is independent between positions applied for (as in the synthetic applicant study), that suggests that it's random or composed entirely of noise; ie there is no signal; ie the applicants are all equally qualified.

If acceptance/rejection is correlated, that means there is some nonzero amount of (signal+bias). But real-world applicants are not all identical, so there should be some amount of signal. So you can't just assume zero signal in order to infer that there must be bias.

➕ show 1 reply

alain94040 • yesterday at 8:05 PM

The European Union passed The Artificial Intelligence Act, which classifies:

High-risk – AI applications that are expected to pose significant threats to health, safety, or the fundamental rights of persons. Notably, AI systems used in health, education, recruitment, critical infrastructure management, law enforcement or justice. They are subject to quality, transparency, human oversight and safety obligations

That's a pretty common sense legislation to me.

➕ show 4 replies

dash2 • yesterday at 8:27 PM

> To measure adverse impact, we apply the EEOC’s “four-fifths rule,” which flags a position when one group is recommended at less than 80% of the rate of the most-recommended group

That seems like a nonsensical way to measure racial discrimination. What could justify it?

➕ show 7 replies

ortusdux • yesterday at 8:53 PM

Ayres, I., Banaji, M. and Jolls, C. (2015), Race effects on eBay. The RAND Journal of Economics, 46: 891-917. https://doi.org/10.1111/1756-2171.12115

"Cards held by African-American sellers sold for approximately 20% ($0.90) less than cards held by Caucasian sellers, and the race effect was more pronounced in sales of minority player cards."

asdff • yesterday at 8:08 PM

Some job application websites I've seen actually have a yes or no option to consent to AI review that they claim is to simply assist HR and not actually screen you. I always select no. There is no way that selecting yes would ever be in my interest. I'm sorry, I'm going to force a real human to look at my stuff if I still can.

➕ show 2 replies

verteu • yesterday at 8:32 PM

The paper is here: https://arxiv.org/pdf/2605.27371

They find "disparate impact" of pymetrics across racial groups, but it doesn't seem like they controlled for anything.

➕ show 1 reply

ApolloFortyNine • yesterday at 8:45 PM

I truly don't doubt it's possible for the AI to be 'racist'.

>If the AI had recommended Black and Asian candidates at the same rate as it recommended the most-favored group (typically white applicants), 40,000 more of their applications would have advanced to the next stage of hiring.

I don't think this is the right benchmark here, or at least, it would be very interesting if the actual outcome, offer or rejected, was considered at the end.

➕ show 1 reply

jsemrau • yesterday at 9:14 PM

Interesting timing as Workday is facing Discrimination Claims in California doing the same thing.

https://www.yahoo.com/news/us/articles/california-judge-upho...

tlogan • yesterday at 9:11 PM

I am not surprised.

AI works by learning patterns. So it will become bias by just learning from factors like education history, schools attended, employment history, ZIP codes, or geographic location. Those 3 factors alone are an easy proxy for race.

And if you add names into the equation (if the AI was trained without removing applicant names), the model can become even more bias.

rnxrx • yesterday at 9:48 PM

Genuine curiosity: Is there any speculation as to what these tools are keying on to reject those particular applicants? It seems like it just being the applicant's name is too easy an answer, but I could be overthinking it.

ericol • yesterday at 8:50 PM

2 days ago there was another interesting article on the effects of AI in hiring[1]

I guess this one just compounds.

[1] https://news.ycombinator.com/item?id=48620142

groundzeros2015 • yesterday at 8:29 PM

I don’t think AI screening is effective. But this study is just disparate impact.

xrd • yesterday at 8:34 PM

Would be very interested to see how this affects post-50 workers. That's a protected class and I would imagine an ambulance chasing lawyer would be excited for a class action lawsuit.

stevenicr • yesterday at 9:15 PM

I expected more information from the article and 'the paper' -

I see nothing that shows any system was making a decision on race. How is the race being presented to the AI?

All this is showing from what I can see, is that certain groups of people were more often denied a next step in the process - but why?

Was the AI going by spelling and grammar? Were there names that were different but the rest of the resume was exactly the same? Were there pictures?

There were mentions that the rate of each group may be more prominent in the data when you split apart different types of jobs instead of all jobs in aggregate.. One could read that like it's inferred; that more warehouse jobs are offered to a race and less admin jobs.. but that same would happen if AI was more focused on perfect grammar for one job and it was not as much of a factor for a warehouse job.

Also if the people applying for the various jobs were self selecting, acceptance percentages this would skew things based upon which ones were applied / not applied to right?

There are so many ways you could draw conclusions like this from data, however correlation is not causation, yet this seems to say it is.

I feel this is an important thing to watch, but Stanford may not be the place to trust with 'Policy Recommendations' as it's very unclear there is any proof that 'AI Hiring Tools Yield Racial Bias and Systemic Rejection' from this study and paper.

PS - now that I see the HN title did not have the word "can" in it, and the title of the article is actually "Tools Can Yield" - maybe that is less accusing and more noting.

OrvalWintermute • yesterday at 9:18 PM

The Pymetrics game is rigged by design:

Only 40% self report gender/race

no resume data, no education information, degrees, schools, GPA, major, work experience, skills/certifications

Zero job qualifications

➕ show 1 reply

xdennis • yesterday at 10:10 PM

It's surprising to me to hear that these systems are considered racist when they're the same ones that are so color blind that they generate pictures of SS soldiers as African American women.

jongjong • yesterday at 10:02 PM

I think the discrimination aspect is downstream from this fact:

> We follow 3.4 million people who submit 4 million job applications to 1,700 job postings across 150 employers and 11 industry sectors. Each job application was assessed by an AI hiring tool built by a single third-party vendor.

3.4 million people applying to just 150 employers... Who are all using just 1 platform. WTF. This is where the discrimination is happening. Why the f do 3.4 million people feel forced to apply to just 150 employers and why the f do all these 150 employers feel forced to use just one platform. WTF.

➕ show 1 reply

x313 • yesterday at 8:23 PM

This study only looks at one specific vendor algorithmn (a job assesment given by a company called pymetrics)

➕ show 1 reply

black6 • yesterday at 8:39 PM

I'm struggling to figure out what they're trying to say here in the linked (and very anemic) paper:

> 30% of Black applicants apply to at least one position that demonstrates adverse impact against Black applicants.

The whole thing reads like a tautology.

➕ show 1 reply

engineer_22 • yesterday at 8:31 PM

> Using our large dataset of real hiring AI recommendations, we test our hypothesis. We find that people who submit multiple applications to positions screened by the same algorithmic hiring vendor are more likely to be rejected from every position to which they apply than would be true if the companies made decisions statistically independently from one another.

I would be surprised if the results were different.

roysting • yesterday at 10:04 PM

There is no isolation of variables. This is not science. This is propaganda.

ETH_start • yesterday at 9:19 PM

A racially disparate outcome is not evidence of racial bias.

➕ show 1 reply

petesergeant • yesterday at 8:47 PM

I’m sure (really sure) there are real problems with AI and bias, but this is a weird study that isn’t looking at resumes or anything, it’s looking at how candidates did in some weird psychometric tests.

➕ show 1 reply

tamimio • yesterday at 8:41 PM

You don’t need a complicated study to find out, do it yourself for science. Get a resume, make few different versions but keep the context the same, change the layout (one time education on top other on bottom etc etc), and use different names to signal different backgrounds, and you can extend it to schools too and gender, and send it to the same employers, you will see wonders!!

I tried it before, and discrimination is there, I would get one resume rejected quickly and few days later the same company would invite another resume for a screening call. I tried this before and after AI hype, results weren’t that different btw, and that was tested in US and Canada employers only.

logicchains • yesterday at 8:26 PM

Could the AI actually see the race of the applicants? Or was it just discriminating on the basis of some factor it found that was correlated with race, like SAT scores?

➕ show 4 replies

everyone • yesterday at 8:09 PM

Its fucking crazy that people are using these systems for important tasks like hiring. They have zero understanding about how these systems work. And LLMs are absolutely not designed to do those sorts of jobs, they're designed to be chatbots and to fool a human conversing them that they are responding intelligently. Of course they're gonna be useless at other tasks.

(I assume they're just using a big LLM for this, it doesnt say, it just says "AI" when they say "AI like that they usually mean LLM".. A custom trained hiring ML system would be better)

➕ show 1 reply

GrinningFool • yesterday at 8:53 PM

[dead]

marsven_422 • yesterday at 9:55 PM

[dead]

huflungdung • yesterday at 8:41 PM

[dead]

anonreeeeplor • yesterday at 8:13 PM

[dead]

JuniperMesos • yesterday at 9:02 PM

[flagged]

➕ show 3 replies

jimmy76615 • yesterday at 9:43 PM

[flagged]

bakugo • yesterday at 8:22 PM

> To put this in perspective: If the AI had recommended Black and Asian candidates at the same rate as it recommended the most-favored group (typically white applicants)

Some people just can't help but put their biases on display at every opportunity, even when it comes to the most minute details.

➕ show 2 replies

jmyeet • yesterday at 9:09 PM

Many people seem to think racism begins and ends with using a slur. You can usually get a measure of this by seeing someone's reaction to the statement:

> There is no such thing as anti-white racism.

If you find yourself wanting to disagree with that then, I'm sorry but you simply don't know what racism is. Racism is pervasive, insidious and systemic.

A good example in the hiring space is what's called the "second syllable name problem". Traditionally Afrcian names often stress the second syllable (eg Jamal, Lakisha, Malik, Lashonda). Studies have shown that such names have higher rejection rates in job applications [1]. So if you're wondering about the four-fifths rule, it's because it exposes this kind of bias. It's not proof of bias. It simply means further investigation is required.

The problem with AI hiring tools is the logic is opaque. You have no idea why an AI system is rejecting or selecting candidates and you may find it's doing something illegal. Some companies want to hide behind this opaqueness, arguing that if no explicit decision was made then there is no bias. But that's not how system racism works.

There are many such signals that correlate with race that if they affect selection rate, it could be a problem. Did you go to an HBCU? Was your high school in a minority-majority area? What about your previous employers?

This kind of bias doesn't have to be intentional.

[1]: https://www.npr.org/2024/04/11/1243713272/resume-bias-study-...

➕ show 2 replies

jazz9k • yesterday at 8:51 PM

We can't take blanket percentages as a reason for racial bias. Were they all equally qualified?

Too many of these studies only focus on percentages and the end result is unqualified candidates getting hired from minority groups at the expense of qualified ones.

➕ show 1 reply

anonfunction • yesterday at 7:55 PM

This is something I've been working on exposing to AI labs through my startup LatentEvals[1], and found similar results in other industries from lending to insurance claims.

Happy to share some sample reports if anyone is interested!

1. https://www.latentevals.com/

➕ show 1 reply

alt Hacker News

Algorithmic Monocultures in Hiring

Comments