Module 4 · Lesson 1

Wrongful Arrests: When Face Recognition Gets It Wrong

Three Black men in Detroit. Three wrongful arrests. One broken algorithm.

What happens when a city outsources suspicion to a machine trained on the wrong faces?

Robert Williams was in his front yard in January 2020 when two Detroit police officers pulled up and arrested him in front of his wife and daughters. He was taken to a detention center and held overnight. The charge: shoplifting watches from a Shinola store. The evidence: a grainy surveillance image run through facial recognition software that matched him to a suspect. Williams had never been near that store.

He became the first documented case of a wrongful arrest caused by facial recognition in the United States. He was not the last in Detroit. Michael Oliver was arrested in 2019 for a felony assault he did not commit. Porcha Woodson — eight months pregnant — was arrested in 2023 on carjacking and robbery charges. All three were Black. All three were misidentified by the same city system.

The Algorithm Behind the Arrests

Detroit contracted with DataWorks Plus and used a system that ran probe images through a Michigan State Police database. Independent audits and academic research had already established that commercially available facial recognition systems misidentified Black faces at rates up to 100 times higher than white faces in some tests. A 2019 National Institute of Standards and Technology (NIST) study evaluated 189 algorithms and found the highest false-positive rates for African American and Asian faces — in some systems, up to 10–100 times higher than for white male faces.

The Detroit Police Department's own policy at the time allowed a facial recognition match alone — without corroborating evidence — to be used as a basis for arrest. An algorithm output was treated as probable cause.

Documented Record

Robert Williams' case was documented by the American Civil Liberties Union. Clare Garvie of Georgetown Law's Center on Privacy and Technology had warned as early as 2016 that half of all American adults were in a law enforcement face recognition network, and that the technology was least accurate on Black women. Detroit arrested three people using this technology before instituting reforms — none of the three committed the crime they were accused of.

The Harm Cascade

Liberty Lost

Williams was detained overnight. Oliver spent 10 days in jail before charges were dropped. Woodson was held for 11 hours while eight months pregnant.

Employment Risk

An arrest record — even without conviction — appears in background checks. Williams was a vice president at a technology company. Oliver lost his job.

Family Trauma

Williams was handcuffed in front of his children. His daughters asked their mother, "Are the police going to take Daddy away again?"

Systemic Pattern

All three Detroit facial-recognition wrongful arrests involved Black individuals. This is not coincidence — it reflects the documented accuracy gap in the underlying technology.

Why the Technology Fails Unevenly

Facial recognition systems are trained on datasets. The largest early datasets — including Labeled Faces in the Wild (LFW) — were scraped from the internet and were overwhelmingly white and male. When Joy Buolamwini of MIT's Media Lab tested commercial systems in 2018, she found error rates on darker-skinned women as high as 34.7% versus 0.8% for lighter-skinned men. Her paper, co-authored with Timnit Gebru, became known as the Gender Shades study.

When a system trained on imbalanced data is deployed in a high-stakes criminal justice context — where a wrong answer means someone is handcuffed in their driveway — the bias is not a technical footnote. It is an instrument of harm.

Key Terms

False Positive:The system says two images match when they do not — a misidentification that, in law enforcement, becomes a wrongful arrest.

Demographic Disparity:Error rates that differ significantly by race, gender, or age group, resulting in unequal exposure to harm.

Probable Cause:The legal standard for arrest — an algorithm output is not sufficient alone, yet it was used that way in Detroit.

Quiz — Lesson 1

Wrongful Arrests & Facial Recognition

1. Robert Williams was the first documented person wrongfully arrested due to facial recognition in the US. What city did this occur in?

Correct. Williams was arrested in Detroit in January 2020 after the city's facial recognition system misidentified him as a shoplifting suspect.

Not quite. It was Detroit, Michigan, in January 2020.

2. The 2019 NIST study found that some facial recognition algorithms had false-positive rates for African American faces how many times higher than for white faces?

Correct. The NIST FRVT 2019 report documented up to 10–100x higher false positive rates for African American and Asian faces in some commercial systems.

The NIST study found far larger disparities — up to 10–100 times higher for some algorithms.

3. The Gender Shades study by Joy Buolamwini and Timnit Gebru found error rates as high as 34.7% for which group?

Correct. Darker-skinned women faced up to 34.7% error rates versus just 0.8% for lighter-skinned men — a 43x gap.

The worst performance was on darker-skinned women, at up to 34.7% error rate.

4. Which of the following best describes why facial recognition systems perform worse on Black faces?

Correct. Datasets like Labeled Faces in the Wild skewed heavily toward white and male subjects, producing models that generalize poorly to other groups.

The core cause is dataset imbalance — training data that underrepresents Black and female faces leads to higher error rates for those groups.

Lab 1 — The Evidence Problem

Investigate the gap between algorithmic output and legal evidence

Your investigation

You are a civil liberties researcher examining how Detroit's facial recognition policy turned algorithm outputs into arrests. Your task: interrogate the AI about what constitutes legitimate evidence, why demographic accuracy gaps matter legally, and what reforms were needed.

Starter prompts: "Why shouldn't a facial recognition match alone be treated as probable cause?" · "What makes Detroit's case a turning point in AI bias documentation?" · "How do training data gaps translate into real-world harm for specific communities?"

AI Research Assistant

Lab 1

Welcome to Lab 1. I'm here to help you think through the evidentiary and legal dimensions of facial recognition wrongful arrests — focusing on the documented Detroit cases and what they reveal about deploying biased AI in high-stakes settings. What would you like to examine?

Module 4 · Lesson 2

Credit, Hiring & Housing: Bias in Automated Decisions

Amazon scrapped its AI hiring tool. Apple's credit card offered women lower limits. The pattern is not accidental.

When algorithms decide who gets a job, a loan, or a home — whose past becomes everyone's future?

Amazon engineers built a machine learning tool to screen resumes. They trained it on a decade's worth of résumés submitted to the company — a dataset that reflected Amazon's historically male workforce. The model learned a lesson its builders did not intend: penalize words associated with women. It downgraded résumés that mentioned "women's chess club." It penalized graduates of all-women's colleges. The project was abandoned in 2018. Reuters broke the story in October of that year.

Amazon's Résumé Tool: Training Data Is History

The engineers had attempted to build a neutral tool. But neutrality is not the same as fairness. When training data encodes historical discrimination — decades of tech hiring that favored men — a model trained to replicate "successful" hires learns to perpetuate that discrimination automatically. The tool assigned scores of 1–5 to candidates. Amazon's own engineers noticed gender-correlated penalties only after the model was in use. They scrapped it entirely rather than trying to fix it.

The case illustrates a foundational principle: past patterns are not a neutral guide to future merit. If hiring was biased before, training on it produces a model that automates the bias.

Documented Case

Source: Reuters, "Amazon scraps secret AI recruiting tool that showed bias against women," October 10, 2018. The system was built by Amazon's machine learning specialists in Edinburgh and was used experimentally — Amazon said it was never used by recruiters to evaluate candidates, though it had been in development for years.

The Apple Card: Algorithmic Redlining in Credit

In November 2019, software developer David Heinemeier Hansson tweeted that Apple Card's algorithm had given him 20 times the credit limit of his wife — despite their filing joint taxes and her having a higher credit score in some measures. Apple co-founder Steve Wozniak reported the same experience with his wife. New York's Department of Financial Services launched an investigation.

Goldman Sachs, which issued the card, stated that gender was not used as a variable. But that is irrelevant if correlated proxies — ZIP code, income type, length of credit history in individual rather than joint accounts — produce gender-skewed outcomes. This is called proxy discrimination: the protected characteristic is not in the model, but variables correlated with it produce equivalent disparate impact.

20×

Higher credit limit David Heinemeier Hansson received vs. his wife on Apple Card (2019)

2018

Year Amazon scrapped its AI résumé-screening tool after internal discovery of gender bias

58%

Proportion of mortgage algorithms studied in a 2019 UC Berkeley paper that charged Black and Latino borrowers more — even with equivalent credit risk

Algorithmic Redlining in Mortgage Lending

A 2019 study by economists at UC Berkeley, published in PLOS ONE, analyzed 30 million mortgage records. Fintech lenders — using algorithmic models — charged Black and Latino borrowers approximately 0.08% higher interest rates on purchase mortgages and 0.14% higher on refinances, compared to similar white borrowers with equivalent financial profiles. The researchers estimated this cost minority borrowers approximately $765 million per year collectively.

The Fair Housing Act prohibits discrimination in mortgage lending. But when the discriminatory output is generated by a model, with no human explicitly making a biased decision, enforcement becomes complex. The harm is real. The accountability is diffuse.

Key Terms

Proxy Discrimination:Using variables that correlate with a protected characteristic (race, gender) to achieve effectively discriminatory outcomes without explicitly using that characteristic.

Disparate Impact:When a facially neutral policy or algorithm produces outcomes that disproportionately harm a protected group — illegal under US civil rights law even without discriminatory intent.

Feedback Loop:When biased outputs become training data for future models, compounding the original bias over time.

Quiz — Lesson 2

Credit, Hiring & Housing Algorithmic Bias

1. Why did Amazon's AI résumé tool penalize applications from women's colleges?

Correct. Training on a decade of résumés from Amazon's historically male tech workforce taught the model to associate maleness with successful candidates.

No explicit rule existed. The bias emerged from training on historical hiring data that reflected gender imbalance.

2. The Apple Card credit limit controversy is an example of which type of algorithmic bias?

Correct. Goldman Sachs stated gender was not used, but variables correlated with gender may have produced gender-disparate outcomes — proxy discrimination.

This is proxy discrimination: gender wasn't directly in the model, but correlated variables produced gender-skewed credit limits.

3. The 2019 UC Berkeley mortgage study found that fintech algorithms charged Black and Latino borrowers how much more on refinances compared to equivalent white borrowers?

Correct. The study found approximately 0.14% higher rates on refinances for Black and Latino borrowers — modest per loan, but totaling ~$765 million annually across all borrowers.

The study found approximately 0.14% higher rates on refinances — small per loan, but massive in aggregate impact.

4. Under US civil rights law, disparate impact means a policy can be illegal even if:

Correct. Disparate impact doctrine holds that a facially neutral policy that produces discriminatory outcomes can violate civil rights law even without discriminatory intent.

Disparate impact law focuses on outcomes, not intent — a policy with no discriminatory purpose can still be illegal if it produces discriminatory results.

Lab 2 — The Proxy Problem

Trace how neutral-seeming variables become discriminatory instruments

Your investigation

You are an algorithmic fairness auditor reviewing automated decision systems in lending and hiring. Your task: interrogate the AI about how variables that seem neutral can function as discriminatory proxies, and what audit mechanisms could detect this.

Starter prompts: "What variables in a credit model might function as proxies for race without explicitly including race?" · "If Amazon's tool was never officially used, does it still matter that it was biased?" · "How would you audit a hiring algorithm for gender proxy discrimination?"

AI Research Assistant

Lab 2

Welcome to Lab 2. I'm here to help you think through proxy discrimination in automated decision systems — focusing on credit, hiring, and mortgage lending cases. What would you like to examine?

Module 4 · Lesson 3

Healthcare Algorithms: Who Gets Sick Care

A widely used health algorithm concluded Black patients were healthier than white patients with the same conditions — because it measured cost, not illness.

When an algorithm optimizes for the wrong variable, who pays the price?

In 2019, researchers at UC Berkeley and University of Chicago published a study in Science that exposed a significant racial bias embedded in a healthcare algorithm used by hospitals and insurers serving over 200 million people. The algorithm was built by Optum, a subsidiary of UnitedHealth Group. Its purpose: identify high-risk patients who would benefit from extra care management. The algorithm systematically ranked Black patients as healthier than equally sick white patients — reducing their access to additional care by roughly half.

The Measurement Problem

The researchers — Ziad Obermeyer, Brian Powers, Christine Vogeli, and Sendhil Mullainathan — found that the algorithm used healthcare cost as a proxy for healthcare need. This seems intuitive: sicker people cost more to treat. But the assumption breaks down when the healthcare system itself is racially unequal.

Because Black Americans historically have less access to healthcare, face more barriers to care, and on average have lower incomes and less insurance coverage, they generate lower healthcare costs for the same level of illness. The algorithm interpreted lower cost as lower need. It concluded Black patients were healthier — not because they were, but because the proxy variable was corrupted by systemic inequity.

At the same severity threshold, Black patients were given risk scores 3.5 percentile points lower than equivalent white patients. This meant Black patients were significantly less likely to be enrolled in care management programs that could improve their outcomes.

200M+

Americans whose care may have been affected by this algorithm at the time of the 2019 study

~50%

Reduction in Black patients flagged for extra care relative to white patients with equivalent medical need

46.5%

Share of Black patients who would have been enrolled in care management had the algorithm used health status directly, rather than cost — vs. the 17.7% who were enrolled

After the Study: Optum's Response

Optum acknowledged the findings and said it had begun working to reduce the disparity. The company stated it disagreed that the algorithm was biased but committed to recalibrate it. The researchers noted that replacing healthcare cost with health status as the target variable could reduce measured disparity by 84%.

The case is important because the algorithm was not built with malicious intent. The designers did not set out to reduce Black patients' care. But by using a historically skewed proxy, they reproduced systemic healthcare inequality at scale — and automated it into tens of millions of clinical decisions annually.

The Science Paper

Obermeyer, Z., Powers, B., Vogeli, C., & Mullainathan, S. (2019). "Dissecting racial bias in an algorithm used to manage the health of populations." Science, 366(6464), 447–453. This is one of the most-cited empirical demonstrations of AI health bias and was instrumental in triggering industry and regulatory attention to algorithm auditing in healthcare.

The Broader Healthcare AI Landscape

The Optum case is not isolated. A 2021 analysis in JAMA Internal Medicine found that clinical AI tools for predicting sepsis, pneumonia readmission, and other conditions showed significant performance differences across racial groups — generally performing worse on Black patients. A 2022 study found pulse oximeters — a hardware/software combination — had been overestimating blood oxygen in Black patients for decades, leading to undertreatment of hypoxia. This error, compounded by COVID-19 severity decisions, had direct consequences for patient mortality.

From diagnostic algorithms to triage tools to care management scores, the pattern is consistent: systems trained or calibrated on non-representative data, or using proxies distorted by systemic inequality, produce less accurate and often more harmful outputs for Black patients.

Key Terms

Proxy Variable Corruption:When a variable used as a stand-in for the true target (e.g., cost for health need) is itself distorted by the very inequity the system aims to address.

Care Management Algorithm:Predictive tools used by hospitals and insurers to identify high-risk patients for additional support — consequential because they affect who receives proactive intervention.

Construct Validity:Whether a model is actually measuring what it claims to measure. The Optum algorithm claimed to measure health need but measured cost — poor construct validity.

Quiz — Lesson 3

Healthcare Algorithms & Systemic Harm

1. The Optum healthcare algorithm used what variable as a proxy for patient health need?

Correct. The algorithm used healthcare cost as a proxy for health need — a measure that was itself distorted by unequal access to care across racial groups.

The proxy was healthcare cost — which is corrupted by unequal healthcare access, making it a poor measure of true health need.

2. Why did using cost as a proxy systematically underestimate Black patients' health needs?

Correct. Systemic barriers to care — lower insurance rates, financial obstacles — mean Black patients spend less on healthcare for the same level of illness, making cost a racially biased proxy for need.

The issue is that systemic inequality means Black patients access healthcare less, spending less for equal illness — so cost is a biased proxy for health need.

3. What did the researchers find would happen if health status replaced cost as the algorithm's target variable?

Correct. Obermeyer et al. estimated that replacing cost with direct health status measures could reduce the racial disparity in algorithmic care management by roughly 84%.

The researchers found the fix was available: replacing cost with health status could reduce disparity by approximately 84%.

4. The pulse oximeter bias study found that pulse oximeters overestimated blood oxygen in Black patients. What was the consequence of this hardware/software bias?

Correct. Overestimated oxygen readings led clinicians to underestimate hypoxia severity in Black patients — a bias with potentially fatal consequences, especially during COVID-19.

The consequence was undertreatment — if oxygen appears sufficient, clinicians don't intervene. This had life-threatening consequences especially during COVID-19.

Lab 3 — Measuring What Matters

When a proxy is corrupted by the very inequity you're trying to address

Your investigation

You are a healthcare policy researcher advising a hospital system on algorithm auditing. The hospital uses a commercial care management tool. Your task: interrogate the AI about construct validity, proxy corruption, and what a fair healthcare algorithm would actually measure.

Starter prompts: "What would a healthcare algorithm need to measure directly to avoid the cost-as-proxy problem?" · "How can construct validity be tested for a clinical AI tool?" · "If the Optum fix was known in 2019, why would hospitals still be using biased algorithms in 2024?"

AI Research Assistant

Lab 3

Welcome to Lab 3. I'm here to help you think through algorithmic bias in healthcare — focusing on the Optum case, proxy variable corruption, and what fair clinical AI requires. What would you like to examine?

Module 4 · Lesson 4

Content Moderation & Surveillance: Who Gets Silenced, Who Gets Watched

Meta's systems removed Black Lives Matter posts. Clearview AI scraped 30 billion faces without consent. The power to see and silence is not distributed equally.

When AI decides whose speech is dangerous and whose face is a threat, what does it encode about power?

During the 2020 Black Lives Matter protests, civil rights organizations documented a pattern: Facebook's content moderation systems were removing or suppressing posts by Black users discussing their experiences with police violence, while posts by white supremacist groups remained up for days or weeks. The ACLU and Color of Change filed formal complaints. Internal Meta documents later leaked by Frances Haugen confirmed that Facebook was aware of differential enforcement across racial groups in its automated systems.

This was not a new problem. A 2017 ProPublica investigation had already found that Facebook's hate speech classifiers were twice as likely to flag posts from Black users as from white users for equivalent content. The algorithm had learned to associate certain vernacular language patterns with "hate speech" — but those patterns were disproportionately associated with African American Vernacular English (AAVE).

AAVE and the Vernacular Penalty

In 2021, a Stanford study published in Nature analyzed NLP models' performance on African American Vernacular English. The researchers found that state-of-the-art language models had error rates approximately 2.5 times higher on AAVE text than on Mainstream American English — and that automated speech recognition systems misidentified Black speakers' audio 35% more often than white speakers' audio. When content moderation systems fail to parse AAVE accurately, Black users face disproportionate removal, suppression, and suspension.

This is not hypothetical. The 2020 congressional testimony of civil rights groups before the House Judiciary Committee included documented accounts of Black activists being suspended during the precise period when organizing for social justice was at its peak. The bias in the moderation algorithm had direct political consequences.

Documented Case — ProPublica 2017

Julia Angwin and Hannes Grassegger at ProPublica tested Facebook's hate speech algorithm in 2017 and found it protected white men as a group but not Black children. Posts attacking Black children were left up; posts attacking white men were removed. The disparity traced to how the algorithm categorized "protected groups" and which vernacular patterns it associated with harm.

Clearview AI and Mass Facial Surveillance

In January 2020, the New York Times revealed that Clearview AI had scraped over 3 billion photos from Facebook, YouTube, Venmo, and millions of other websites to build the largest known facial recognition database. By 2023, Clearview claimed to have over 30 billion images. The company sold access to law enforcement agencies across the United States — at least 600 by 2020 — and internationally.

Clearview's database was built without the consent of the people photographed. Multiple US states and the FTC took action. The EU found Clearview in violation of GDPR and issued fines. But the core harm — that a private company had unilaterally enrolled tens of millions of Americans in a surveillance network — was not reversible. The data, once collected, could not be meaningfully erased.

The intersection with racial bias is direct: because facial recognition systems perform worst on Black faces, a mass surveillance system built on that technology subjects Black communities to both more surveillance and more misidentification — a dual burden. A 2021 investigation by the ACLU found that Clearview was being used by ICE, local police, and retail companies with essentially no oversight, training requirements, or error rate disclosure.

30B+

Facial images Clearview AI scraped without consent by 2023

2.5×

Higher NLP error rate on AAVE vs. Mainstream American English (Stanford, 2021)

2×

Higher rate at which Facebook's algorithm flagged Black users' posts as hate speech vs. white users' equivalent posts (ProPublica, 2017)

The Asymmetry of Power

Content moderation and surveillance are not neutral technical functions. They are exercises of power — power to amplify or silence, to identify or misidentify, to act or be acted upon. When the algorithms that perform these functions are less accurate and more punitive toward Black, Indigenous, and other marginalized communities, they do not simply malfunction. They reproduce the existing distribution of power in digital form.

What makes these cases distinctive from earlier lessons is their political valence. A biased credit algorithm costs money. A biased content moderation system can suppress political organizing. A biased surveillance system can subject entire communities to state scrutiny without recourse. The harm is not merely material — it is civic and democratic.

Key Terms

AAVE (African American Vernacular English):A linguistically distinct variety of English with its own grammatical rules, spoken primarily by African Americans. NLP models trained on standard English perform worse on AAVE, causing disparate moderation and misrecognition.

Mass Scraping:The automated collection of public data at scale without individual consent — the method Clearview used to build its database.

Civic Harm:Harm that affects a person's capacity to participate in political and social life — distinct from economic harm and often more difficult to quantify or redress.

Quiz — Lesson 4

Content Moderation & Surveillance Bias

1. A 2017 ProPublica investigation found Facebook's hate speech classifier was how much more likely to flag Black users' posts compared to equivalent posts from white users?

Correct. ProPublica's 2017 testing found Facebook's automated hate speech classifier flagged Black users' posts at roughly twice the rate of white users for equivalent content.

ProPublica found the algorithm was approximately twice as likely to flag Black users' posts as hate speech for equivalent content.

2. Why do NLP-based content moderation systems perform worse on AAVE (African American Vernacular English)?

Correct. NLP models trained predominantly on Mainstream American English have higher error rates on AAVE — a dialect with distinct grammatical rules that the model wasn't adequately trained to parse.

The issue is training data composition. Models skewed toward mainstream English fail to accurately parse AAVE's distinct grammatical patterns.

3. How did Clearview AI build its database of over 30 billion facial images?

Correct. Clearview scraped billions of photos from Facebook, YouTube, Venmo, and other public websites without consent — a practice challenged by regulators in the US and EU.

Clearview scraped photos from public websites at massive scale — no consent from the individuals pictured was obtained or sought.

4. What distinguishes "civic harm" from the economic harms of biased credit or hiring algorithms?

Correct. When biased content moderation suppresses political organizing, the harm is to democratic participation — civic harm that may be harder to quantify but affects fundamental rights.

Civic harm goes beyond financial loss — it affects the capacity to participate in political and social life, including organizing and free expression.

Lab 4 — Power, Speech & Surveillance

Interrogate who controls the algorithms that decide who is heard and who is watched

Your investigation

You are a digital rights policy analyst examining AI bias in content moderation and facial surveillance. Your task: interrogate the AI about the political dimensions of algorithmic power, the Clearview AI controversy, and what democratic oversight of these systems would require.

Starter prompts: "What oversight mechanisms could prevent companies like Clearview from building mass surveillance databases?" · "How does biased content moderation affect political organizing specifically?" · "What would platform accountability for AAVE-skewed moderation actually look like in practice?"

AI Research Assistant

Lab 4

Welcome to Lab 4. I'm here to help you think through algorithmic power in content moderation and facial surveillance — focusing on documented cases and what democratic accountability for these systems would require. What would you like to examine?

Module 4 — Test

Real People, Real Harm · 15 questions · 80% to pass

1. Robert Williams was wrongfully arrested in Detroit in January 2020. What was the basis for his arrest?

Correct. Williams was arrested after Detroit's facial recognition system matched him to a grainy surveillance image from a Shinola store — a match that was wrong.

The arrest was based solely on a facial recognition match — no corroborating evidence placed Williams near the store.

2. What common characteristic did all three Detroit facial recognition wrongful arrests share?

Correct. Williams, Oliver, and Woodson were all Black — consistent with documented higher false-positive rates for Black faces in facial recognition systems.

All three wrongfully arrested individuals in Detroit were Black — reflecting the documented accuracy gap in facial recognition for darker-skinned faces.

3. The NIST 2019 facial recognition evaluation tested how many algorithms?

Correct. NIST's FRVT 2019 evaluated 189 facial recognition algorithms and found the highest false-positive rates for African American and Asian faces.

NIST evaluated 189 algorithms in its 2019 Face Recognition Vendor Test (FRVT).

4. Amazon's AI résumé screening tool penalized graduates of all-women's colleges because:

Correct. Training on a decade of Amazon's male-skewed tech hires taught the model to associate maleness with "successful candidate" — penalizing markers of womanhood.

Training data bias was the cause — the model learned from historical hiring patterns that heavily favored men in tech.

5. The Apple Card credit limit controversy involving David Heinemeier Hansson and his wife is an example of:

Correct. Goldman stated gender wasn't used, but correlated variables may have functioned as gender proxies — producing discriminatory outcomes without explicit discriminatory intent.

This is proxy discrimination — gender wasn't directly in the model, but correlated variables may have produced gender-disparate credit limits.

6. The UC Berkeley 2019 mortgage study found fintech algorithms charged Black and Latino borrowers approximately how much more on purchase mortgages?

Correct. The study found approximately 0.08% higher rates on purchase mortgages — modest per loan, but collectively approximately $765 million per year in excess costs borne by minority borrowers.

The study found approximately 0.08% higher rates on purchase mortgages — small individually but massive at population scale.

7. The Optum health algorithm published in Science in 2019 — roughly how many Americans did it affect?

Correct. The researchers estimated the algorithm was in use in systems serving over 200 million Americans at the time of publication.

The study estimated the algorithm affected systems serving over 200 million Americans — making it one of the largest documented instances of AI health bias.

8. The Optum algorithm used healthcare cost as a proxy for health need. Why was this problematic for Black patients specifically?

Correct. Systemic healthcare inequality means Black patients, facing more barriers to care, spend less on healthcare for the same level of illness — making cost a racially biased proxy for health need.

The proxy was corrupted by inequality: Black patients face more barriers to care and thus generate lower costs for equivalent illness — the algorithm misread this as lower need.

9. What percentage reduction in racial disparity did Obermeyer et al. estimate was achievable by replacing cost with health status as the algorithm's target?

Correct. The researchers estimated an approximately 84% reduction in the racial disparity by using direct health status measures rather than cost as the optimization target.

The estimated reduction was approximately 84% — showing that much of the disparity was attributable to the choice of proxy variable, not fundamental limitations of prediction.

10. A 2021 Stanford study in Nature found that NLP models had error rates on AAVE approximately how much higher than on Mainstream American English?

Correct. The Stanford study found approximately 2.5× higher error rates on AAVE — meaning content involving African American language patterns is more often misclassified by automated systems.

The Stanford study found approximately 2.5× higher error rates on AAVE versus Mainstream American English.

11. How many facial images did Clearview AI claim to have in its database by 2023?

Correct. Clearview claimed over 30 billion images by 2023 — scraped without consent from public websites and social media platforms.

Clearview claimed over 30 billion images — all scraped without the consent of individuals photographed.

12. Which law did EU regulators cite when issuing fines against Clearview AI?

Correct. Multiple EU data protection authorities found Clearview in violation of GDPR and issued significant fines for scraping biometric data without consent.

EU regulators cited GDPR — the General Data Protection Regulation — which requires consent for processing biometric data.

13. "Disparate impact" in civil rights law means a policy can be found illegal even without discriminatory intent. This principle applies to:

Correct. Disparate impact doctrine under US civil rights law covers discriminatory outcomes produced by any policy or practice — including algorithms — regardless of whether discrimination was intended.

Disparate impact focuses on outcomes, not intent — and applies across employment, lending, and housing under various federal statutes.

14. Porcha Woodson, wrongfully arrested by Detroit's facial recognition system in 2023, was in what condition at the time of her arrest?

Correct. Porcha Woodson was eight months pregnant when Detroit arrested her on carjacking and robbery charges based on a facial recognition misidentification.

Woodson was eight months pregnant when she was arrested — the case drew particular attention as one of the most egregious examples of the technology's misuse.

15. Across all four lessons in this module — facial recognition, economic algorithms, healthcare, and content moderation — what is the most consistent pattern?

Correct. Whether in criminal justice, credit, healthcare, or speech — the consistent pattern is that AI systems ingest historical inequality and produce outputs that perpetuate and scale it, with Black and other marginalized communities bearing disproportionate harm.

The consistent pattern across all four domains: AI systems trained on historically unequal data reproduce that inequality at scale, with disproportionate harm to Black and marginalized communities.