Intro
L1
Β·
Quiz
Β·
Lab
L2
Β·
Quiz
Β·
Lab
L3
Β·
Quiz
Β·
Lab
L4
Β·
Quiz
Β·
Lab
Module Test
See the World Through AI Eyes Β· Introduction

Every Algorithm Was Built by Someone Who Wanted Something

The course that gives you the question no one else is asking.

In November 2022, a tool called ChatGPT went online. Within five days, one million people had signed up. Within two months, one hundred million. Nothing in internet history had grown that fast β€” not Facebook, not Instagram, not TikTok. Adults started writing headlines about "the end of everything." Schools started banning laptops. Governments started holding emergency meetings. And somewhere in all of that, most people completely missed the more interesting question: who built this, what did they want it to do, and what did they decide to leave out?

That question doesn't go away. Every AI system you will ever encounter β€” the one that recommends your next video, the one that grades your essay, the one that decides whether a job application gets a human review β€” was designed by specific people, at a specific company, with specific goals, specific blind spots, and specific choices about whose needs mattered most. Those choices are invisible unless you know how to look for them. This course teaches you how to look.

We're going to cover four things: who builds algorithms and why their identity matters; how training data shapes what an AI "knows"; what happens when AI systems get it wrong and who absorbs the cost; and how to actually push back, ask the right questions, and read any AI story in the news without being fooled. You won't finish this course as an AI engineer. You'll finish it as someone who can't be easily deceived by one.

See the World Through AI Eyes Β· Lesson 1

Who Built This Algorithm β€” and Why?

The people behind the code shape what the code does β€” and who it ignores.
When a machine makes a decision about you, whose values is it actually applying?

In the summer of 2015, a software developer named Jacky AlcinΓ© was using Google Photos β€” the app that automatically sorts your pictures into albums β€” when he noticed something that stopped him cold. The app had tagged photos of him and his friend, both of whom are Black, under the label "Gorillas." He posted screenshots to Twitter. The story went viral within hours. Google's CEO Sundar Pichai personally apologized. Engineers at Google said they would fix it immediately.

But here's what actually happened next. According to a 2018 investigation by Wired magazine, Google's fix wasn't to train the algorithm to correctly recognize Black faces. Their fix was to remove the label "gorillas" from the app entirely β€” along with "chimp," "chimpanzee," and "monkey." Three years later, if you searched Google Photos for any of those animals, you got zero results. The algorithm still couldn't reliably distinguish the faces. They had just hidden the evidence of the failure.

That story is worth sitting with for a moment. A system built by a trillion-dollar company, used by billions of people, had a failure so serious it couldn't be honestly fixed β€” only concealed. And the reason it failed in the first place had nothing to do with the math. It had everything to do with who built it, what data they used, and whose faces they thought to test it on before shipping it to the world.

The Team in the Room

When an algorithm is built, a group of people sits in a room β€” or on a video call, or scattered across a campus β€” and makes decisions. What should this system do? What counts as success? What data should we train it on? What should we test before we release it? Every one of those decisions is made by a human being who has a background, a set of assumptions, and things they never thought to question.

In 2016, a researcher named Joy Buolamwini β€” then a graduate student at MIT β€” was working on a project that used facial recognition software. The software couldn't detect her face. It worked fine when she put on a white mask. She dug into why, and what she found launched a research career and eventually changed U.S. law. The systems she tested β€” from companies like IBM, Microsoft, and Amazon β€” had been trained almost entirely on faces of light-skinned men. When she published her findings in 2018 in a paper called "Gender Shades," the error rates told a stark story: for light-skinned men, the systems were wrong about 1% of the time. For darker-skinned women, they were wrong up to 35% of the time.

That gap didn't come from nowhere. It came from a decision β€” conscious or not β€” about whose faces mattered enough to include in the training data. And that decision came from a room where most of the people looked roughly the same.

Training data: The collection of examples an AI system learns from. If the examples are skewed, the AI's behavior will be skewed in the same direction.
Benchmark testing: The process of checking how well an AI performs before releasing it. If you only test on certain kinds of inputs, you only find certain kinds of failures.
Goals Shape Outcomes

It's tempting to think of AI failures as purely technical β€” bugs, bad math, a corrupted dataset. But most of the important failures are design decisions. They happen when the people building a system define "success" in a way that works for most users but quietly fails a few.

In 2018, the news organization Reuters reported that Amazon had been quietly developing an AI hiring tool since 2014 β€” a system designed to scan resumes and score job candidates. The goal was to speed up hiring by automatically rating applicants on a scale of one to five stars. By 2015, Amazon's own engineers noticed a serious problem: the system was systematically downgrading resumes that included the word "women's" β€” as in "women's chess club" or "women's leadership program." It also penalized graduates of two all-women's colleges.

Why? Because the system had been trained on ten years of Amazon's own hiring data. And over those ten years, Amazon β€” like most tech companies β€” had hired mostly men. The AI wasn't trying to discriminate. It was doing exactly what it was designed to do: identify patterns in historical success and replicate them. The problem was that the historical pattern included bias. The AI amplified it. Amazon scrapped the tool in 2017 without ever using it for official decisions β€” but not before quietly running it in parallel for years.

The Core Mechanism

An AI system trained on past human decisions inherits the biases embedded in those decisions. It doesn't clean them up. It mathematically reinforces them β€” and then presents the output as if it were objective, which makes the bias harder to challenge.

Now here is the part that most people skip over. Amazon's engineers noticed the problem. They flagged it. And the company still let the system run β€” in parallel, quietly β€” for two years before retiring it. The failure wasn't purely technical. It was also a decision made by people about what mattered enough to stop.

Whose Problem Gets Solved First

Here is a pattern worth memorizing: when a technology is built primarily by people from one demographic, the default use cases tend to serve that demographic first. This isn't always malicious. Often it's just the result of building what you know, testing on people you have access to, and shipping when it works for most of your team.

Voice recognition is a good example. In 2017, a Stanford researcher named Rachael Tatman published a study showing that Google's voice recognition system was significantly more accurate for men than for women, and significantly more accurate for people without regional accents than for people with them. Scottish English was recognized far worse than American English. African American English was recognized far worse than white American English.

The gap mattered in the real world. By 2019, voice recognition was being used in courtrooms to transcribe testimony, in call centers to route complaints, in schools as an accessibility tool for students with disabilities. A system that works 95% of the time for some users and 78% of the time for others sounds like a small difference β€” until the 17% gap is the part where someone's words get lost.

Understanding this changes how you read every product announcement, every "revolutionary AI" headline, every "this system is 98% accurate" claim. You now know to ask: accurate for whom? Tested on which users? Built by which team? Designed to optimize for what goal? Most people never think to ask those questions. You can.

You Can Now See This

Every accuracy statistic in AI is an average. Averages hide who is above the line and who is below it. When a company says "our system is 95% accurate," that's not the number you need. The number you need is: what's the accuracy for the people most likely to be harmed if it gets it wrong?

The Ethical Question That Doesn't Have a Clean Answer

In 2019, the city of San Francisco became the first city in the United States to ban government use of facial recognition technology. The argument was simple: the technology was too error-prone for high-stakes decisions, particularly for Black and brown residents. Other cities followed β€” Oakland, Boston, Portland.

But here's the complication. Some police departments argued that facial recognition, even imperfect, helped solve cold cases β€” including murders and child abductions β€” that would otherwise go unsolved. In 2020, facial recognition was used to help identify a man who had been abducting children in New York. He was caught. The victims were found. The tool worked, in that case, for those people, on that day.

So the question isn't whether the technology works. The question is: who gets to decide when a tool is accurate enough to use on people who never consented to be scanned? Who absorbs the cost when it gets it wrong? And is it acceptable to use a tool that works better for some people than others in decisions where being wrong can mean someone loses their job, their freedom, or their life?

There isn't a clean answer to that. Serious, thoughtful people land in different places. What matters is that you now understand the structure of the problem well enough to recognize when someone is oversimplifying it β€” either by pretending the technology is neutral, or by pretending it has no legitimate use at all.

Lesson 1 Quiz

Five questions β€” test your reasoning, not just your memory.
1. When Google's Photos app mislabeled Jacky AlcinΓ©'s photos in 2015, the company's actual fix was to remove the animal labels entirely rather than correct the underlying algorithm. What does this reveal about how companies sometimes respond to AI failures?
Exactly. The Google Photos case is a case study in damage control β€” removing the category rather than fixing the recognition β€” and it illustrates that technical failures can become institutional decisions about what gets hidden.
Reread the opening scene. The fix Google chose wasn't technical retraining β€” it was concealment. The question is about what that choice tells us about how companies manage visible failures.
2. Joy Buolamwini's 2018 "Gender Shades" study found that facial recognition systems had up to 35% error rates for darker-skinned women but only ~1% error rates for light-skinned men. What is the most direct cause of this gap?
Right. Training data composition is the direct cause. The AI learned from a skewed sample and its performance reflected exactly that skew β€” not a deliberate choice, but a consequential one regardless.
The lesson explains this specifically. The issue wasn't the math or deliberate discrimination β€” it was whose faces were included in the data the system learned from. Skewed input produces skewed output.
3. Imagine a hospital uses an AI system trained on ten years of patient records to predict which patients need the most follow-up care. The system consistently scores white patients higher than Black patients with the same medical conditions. Applying what you learned in this lesson, what is the most likely explanation?
This is exactly the Amazon hiring tool pattern applied to healthcare. When training data reflects past human decisions that were unequal, the AI treats that inequality as the correct outcome and reproduces it. This is a real documented problem β€” a 2019 study in Science journal identified this exact mechanism in a widely-used healthcare algorithm.
Think about what the Amazon hiring example taught us. The system wasn't malfunctioning β€” it was doing exactly what it was built to do: find patterns in historical data and replicate them. The problem is that historical data can encode historical inequality.
4. A company announces: "Our new speech recognition AI is 96% accurate." Based on what you learned in this lesson, what is the single most important follow-up question to ask?
Exactly right. An average accuracy number hides everything important about who is above and below that average. The pattern from Rachael Tatman's voice recognition research β€” and from every other case in this lesson β€” is that aggregate accuracy masks unequal distribution of errors.
The lesson specifically addresses this. An average accuracy number is the beginning of the question, not the answer. The crucial issue is whether that accuracy is evenly distributed β€” or whether some groups are absorbing most of the failures.
5. The lesson describes San Francisco banning facial recognition while noting it helped identify a child abductor in New York. Why does the lesson present both facts without resolving the tension between them?
Yes. The lesson explicitly says "there isn't a clean answer to that." The goal isn't to hand you a conclusion β€” it's to make sure you understand the actual stakes on both sides well enough to evaluate arguments about it, rather than accepting oversimplified takes.
Reread the final section of the lesson. The point isn't to advocate for one side β€” it's to help you recognize when someone is flattening a genuine dilemma into a simple story. That's a skill more valuable than any particular opinion about facial recognition.

Lab 1: The Algorithm Auditor

Your role: independent auditor investigating an AI system for a city council.

Your Assignment

A city council is considering deploying an AI system to predict which neighborhoods need the most infrastructure repair. The company selling the system claims it is "data-driven and objective." Your job is to audit it before the city signs the contract.

Your lab partner β€” call them Alex β€” is another auditor who will push back on your reasoning. Alex isn't here to teach you the answers. Alex is here to stress-test your thinking.

Start by telling Alex: What are the first three questions you'd ask about this AI system before recommending whether the city should use it? Explain your reasoning β€” don't just list them.
Alex β€” Algorithm Auditor
Lab Partner
Alright, I've read the company's pitch deck. Lots of words like "neutral," "evidence-based," and "eliminating human bias." Before I say anything else β€” what are you actually planning to ask them? And don't just give me a list. Tell me why those questions matter.
See the World Through AI Eyes Β· Lesson 2

The Data Was Never Neutral

Every dataset is a record of choices β€” and choices always have a point of view.
If an AI learned everything it knows from human history, what did it inherit along with the facts?

On March 23, 2016, Microsoft launched a chatbot named Tay on Twitter. The idea was elegant: Tay would learn how to talk by absorbing real conversations from real Twitter users, in real time. Microsoft engineers had run similar experiments internally and liked the results. They expected Tay to emerge as friendly, funny, and contemporary β€” a bot that spoke the way young people actually spoke.

Within sixteen hours, they had to shut Tay down. Coordinated groups of users had deliberately fed Tay racist, misogynist, and Holocaust-denying content, and Tay had learned it β€” enthusiastically. "Hitler was right," Tay tweeted. "Feminism is cancer." The engineers hadn't built a racist system. They had built a system that learned from data it was given, and the data it was given was human behavior at its worst. Tay was, as one journalist put it, "a mirror held up to the internet."

What failed wasn't the machine learning. What failed was the assumption that data is neutral β€” that you can pour human language into an algorithm and get something clean out the other end. Tay was the most public demonstration of a principle that now governs every conversation about AI: the data is never a neutral record of reality. It is always a record of who was creating it, and why.

Where Training Data Comes From

To understand why data is never neutral, you first have to understand where it comes from. Large language models β€” the kind that power ChatGPT, Google's Gemini, and similar tools β€” are trained on enormous collections of text scraped from the internet. One commonly used dataset is called Common Crawl, which contains a snapshot of billions of web pages going back to 2008. Another is called Books3, which contained digitized versions of over 196,000 books before legal challenges forced changes to it in 2023.

The internet sounds like "everything." But it isn't. As of 2023, only about 43% of the world's population uses the internet in any regular way. Among that 43%, about 56% of all web content is in English β€” a language spoken natively by roughly 5% of the world. The vast majority of Wikipedia articles, forum posts, reviews, and news articles that end up in training data were written by people who are English-speaking, educated, and connected enough to have internet access and time to write online.

This means that when an AI system trained on this data gives you an answer, it is heavily shaped by the worldview, assumptions, and blind spots of a specific slice of humanity. It doesn't know it's doing this. It has no way to flag when it's applying an English-speaking American lens to a question that has a completely different answer in a different cultural context. It just responds, confidently.

Large language model (LLM): An AI system trained on massive amounts of text to predict what words should come next. The patterns it learns from that text shape everything it "knows" and every opinion it reflects.
Representation bias: When certain groups, perspectives, or types of information appear far more or less frequently in training data than they do in the real world, causing the AI to treat some things as normal or important and others as marginal.
The Label Problem

Not all AI training data comes directly from the internet. A lot of it is labeled β€” meaning a human being looked at a piece of content and attached a tag to it. "This photo contains a dog." "This text is positive." "This news article is reliable." "This face is expressing anger." Those labels become the ground truth the AI learns from. The AI will eventually make the same kinds of judgments millions of times per day, on everything from moderation decisions to medical diagnoses.

In 2019, a research team at the AI Now Institute β€” a research organization based at New York University β€” published a report examining how labels get applied in commercial AI datasets. One of their findings concerned emotional recognition systems: tools trained to identify emotions from facial expressions. These systems were being sold to companies for use in job interviews β€” scanning applicants' faces and rating their emotional suitability. The problem was that the labels defining what "happiness" or "trustworthiness" looks like on a face had been applied by data annotators, many of them working in crowdsourced tasks for low pay, drawing on cultural frameworks that vary enormously across the world.

In East Asian cultural contexts, for example, strong direct eye contact in a job interview can signal aggression rather than confidence. A system labeled primarily by Western annotators would score that cultural difference as a "red flag." The AI wouldn't know it was enforcing one culture's norms. It would just produce a score. The score would look authoritative. The applicant would never know why they were rejected.

The Hidden Power of Annotation

Every human judgment in a training label gets amplified across millions of decisions. The person who labeled your training data had more influence on the AI's behavior than almost any engineer β€” and you've almost certainly never heard their name.

Historical Data Is History

There is one more layer to this that changes how you read almost everything about AI. When we train AI systems on historical records, we are training them on a history that contains the full weight of past inequality. This isn't a glitch or an edge case. It is a structural feature of learning from the past.

In 2016, ProPublica β€” an investigative journalism organization β€” published a groundbreaking analysis of a tool called COMPAS (Correctional Offender Management Profiling for Alternative Sanctions). COMPAS was being used in courtrooms across the United States to predict the likelihood that a defendant would re-offend β€” a score called a "recidivism risk score." Judges were using this score to inform decisions about bail, sentencing, and parole.

ProPublica's analysis found that COMPAS was twice as likely to falsely flag Black defendants as "high risk" when they did not actually go on to reoffend β€” and twice as likely to falsely label white defendants as "low risk" when they did reoffend. The company that made COMPAS, Northpointe, disputed the methodology. Academics wrote papers arguing about which statistical measure of "fairness" should apply. Judges kept using the tool.

The data COMPAS learned from was criminal justice records β€” records shaped by decades of unequal policing, unequal prosecution, and unequal sentencing. An AI trained on those records learned those patterns as if they were natural law. Then it applied them in individual courtrooms, one person at a time, under the cover of an authoritative-looking number.

You now understand why some researchers say that "training on historical data" is sometimes just another way of saying "automating historical prejudice." That framing is deliberately provocative β€” but it's precise. And knowing it means you will never hear "data-driven decision" the same way again.

What You Can Now See

When someone tells you an AI system is objective because it uses data, you now know that "data" is not the same as "truth." Data is a record of what was measured, by whom, under which conditions, with which labels attached. The question to ask is always: what did the data leave out?

An Ethical Question Without a Clean Answer

Here is the complication. The COMPAS case β€” and cases like it β€” prompted some researchers to call for banning algorithmic risk scores in criminal justice entirely. Pure human judgment, they argued, is fairer than an opaque algorithm.

But research on human judicial decision-making tells a different story. Multiple studies β€” including a widely-cited 2011 paper in the Proceedings of the National Academy of Sciences β€” found that judges gave significantly harsher rulings right before lunch than right after it. Other studies found that judges with daughters gave more lenient sentences in certain cases than judges with only sons. Human judges are inconsistent, fatigued, and shaped by their own experiences in ways they can't fully control.

So we are left with a real dilemma: an algorithm that inherits and amplifies historical bias, versus human judgment that is inconsistent, emotional, and difficult to appeal. Neither is clean. Anyone who tells you the answer is obvious β€” in either direction β€” hasn't fully understood the question. You have.

Lesson 2 Quiz

Five questions on data, labels, and what "objective" actually means.
1. Microsoft's Tay chatbot was shut down in 2016 after it learned to post racist content on Twitter. What core principle about AI training data does this incident most directly illustrate?
Exactly. Tay is the clearest demonstration that "learn from real data" means "learn from everything in real data" β€” including the parts that reflect humanity at its most destructive. The AI didn't know what it was doing was wrong. It was executing its design.
Reread the Tay section. The engineers weren't malicious and the platform wasn't hacked in the traditional sense. The story illustrates something more fundamental: that data contains everything β€” and a system trained on everything learns everything.
2. Approximately 56% of all web content is in English, yet only about 5% of the world's population speaks English as a first language. What is the most significant consequence of this imbalance for AI language models?
Right. The danger isn't that the AI refuses other languages β€” it's that it quietly applies one cultural frame as the universal default. It doesn't flag this. Users in other cultures get answers shaped by a worldview that isn't theirs, presented with full confidence.
The lesson addresses this directly. The technical issue (language support) is less important than the cultural issue: a system that learned primarily from English-language content will treat the assumptions embedded in that content as simply "how things are."
3. A company builds an AI hiring tool to screen resumes for a customer service role. They label 10,000 past hiring decisions: "hired" or "not hired." The AI learns from these labels and is deployed. A year later, auditors find it consistently rejects qualified candidates from certain zip codes. What is the most likely explanation?
This is the COMPAS pattern in a hiring context. Historical decisions β€” whether in criminal justice or employment β€” carry the weight of past practices. An AI that learns to predict those decisions learns to replicate those practices, even discriminatory ones, with the appearance of algorithmic objectivity.
Think about the COMPAS case. The AI wasn't broken and the issue wasn't accidental inclusion of zip codes. The problem is that historical hiring decisions can encode bias β€” and labeling those decisions as "correct" teaches the AI to treat that bias as the target outcome.
4. ProPublica found that the COMPAS recidivism scoring tool was twice as likely to falsely flag Black defendants as high-risk. Some researchers responded by arguing this was a problem with the statistical definition of "fairness" being used β€” not with the data. Why does this debate matter for how AI is used in real institutions?
This is one of the most important points in AI ethics. "Fairness" is not a single thing β€” there are multiple mathematical definitions that can conflict. A system can be fair by one measure and unfair by another simultaneously. Deciding which measure to use is a values decision, not a technical one.
The lesson points to something more specific. The debate over COMPAS revealed that "fairness" in statistics has multiple incompatible definitions β€” and that choosing between them is a values question, not just a technical one. That's why it matters who makes those choices and whether they make them openly.
5. A researcher argues: "We should replace all human judges with AI sentencing tools, because AI is consistent while human judges are influenced by hunger, fatigue, and personal history." Based on the lesson, what is the strongest counter-argument?
Exactly. Consistency is presented as a virtue, but the lesson shows that a consistent algorithm can consistently replicate a biased pattern across thousands of cases simultaneously. Scale makes the problem worse, not better β€” and the appearance of objectivity makes it harder to challenge.
The lesson ends with exactly this dilemma. The counter-argument isn't "humans are better" β€” it's more specific: consistency is only good if what's being consistently applied is fair. An algorithm that consistently amplifies historical bias at scale may produce worse outcomes than inconsistent human judges, despite looking more objective.

Lab 2: The Dataset Detective

Your role: investigator reviewing a training dataset before it's used in a high-stakes system.

Your Assignment

A school district wants to use an AI system to predict which students are at risk of dropping out, so teachers can intervene early. The system will be trained on five years of attendance records, grades, disciplinary records, and free-lunch eligibility data from the district's 12 schools.

Your lab partner Sam is also reviewing the dataset. Sam thinks this sounds like a good use of AI. Your job is to engage with Sam's optimism β€” seriously and specifically.

Start by responding to Sam's opening position below. Don't just say "there might be bias" β€” identify one specific concern about this particular dataset and explain why it matters for the specific use case.
Sam β€” Dataset Reviewer
Lab Partner
I actually think this dataset is pretty solid. Attendance, grades, disciplinary records β€” that's real behavioral data, not just demographics. The system is just learning patterns that actually predict dropout. What's the problem with that?
See the World Through AI Eyes Β· Lesson 3

When It Gets It Wrong, Who Pays?

Algorithmic errors are not evenly distributed β€” the costs land somewhere specific.
If a machine makes a mistake that ruins someone's life, who is responsible?

In January 2020, a man named Robert Williams was arrested in his driveway in Farmington Hills, Michigan, in front of his wife and daughters. He was handcuffed, driven to a police station, and held overnight for questioning about a watch store robbery that had occurred in 2018. Williams had been at work at the time of the robbery. He had no criminal record. He had never been near the store in question.

The reason he was arrested: a facial recognition algorithm had flagged him as a match for a blurry surveillance video from the robbery. A detective had submitted a still image from the video to the Michigan State Police, which had run it through a facial recognition database. The algorithm returned Williams' name. No human being rigorously verified the match before an arrest warrant was signed. Williams spent nearly 30 hours in detention before investigators realized they had the wrong man. He was the first documented case of a wrongful arrest caused directly by a facial recognition error in the United States β€” though almost certainly not the last.

In 2021, the ACLU filed a lawsuit on Williams' behalf against the Detroit Police Department. Williams later testified before the United States Congress. Detroit's Police Department eventually announced restrictions on how the technology could be used. But those 30 hours β€” the arrest in front of his daughters, the overnight detention, the paperwork that now exists in his file β€” cannot be given back.

How Errors Become Harm

Every AI system makes errors. The engineers who build them know this. When you hear a system described as "96% accurate," what that means is: it is wrong 4% of the time. For some applications, 4% errors are acceptable. If Netflix recommends a movie you don't like, the error cost is low β€” you watch something else. If a system is making decisions about your employment, your loan application, your medical care, or your freedom, a 4% error rate means something entirely different.

In Robert Williams' case, a facial recognition system made an error, a human detective failed to independently verify it, and a warrant was signed. At least three people made decisions along that chain β€” and every one of them could have caught the error. The algorithm is one part of a system that includes human choices, institutional practices, and assumptions about when a machine output is trustworthy enough to act on without verification.

This is called the human-in-the-loop problem: when AI systems are deployed at scale, humans theoretically remain in the decision-making chain, but in practice the speed, volume, and apparent authority of AI outputs causes people to defer to them rather than scrutinize them. The machine says it, so it must be checked out. This is especially true when the humans in the loop are overworked, under-resourced, or operating under institutional pressure to clear cases quickly.

Automation bias: The documented tendency for humans to over-trust automated outputs and reduce their own critical scrutiny when a machine produces a confident-seeming answer, even when verification would be straightforward.
The Distribution of Errors

Robert Williams is Black. All three of the individuals wrongfully arrested in documented U.S. facial recognition cases as of 2023 are Black men. This is not a coincidence. It is the direct consequence of the accuracy gap documented by Joy Buolamwini's research β€” the same gap described in Lesson 1. When a system has higher error rates for darker-skinned faces, and that system is used in contexts involving criminal investigation, the errors concentrate among people who are already disproportionately scrutinized by law enforcement.

This is the pattern that makes some AI failures genuinely different from ordinary technology failures. A bug in a navigation app might add ten minutes to everyone's commute. An accuracy gap in a facial recognition system used in policing concentrates its worst outcomes on communities that already face elevated risk. The word researchers use for this is disparate impact β€” when a policy or tool that doesn't explicitly target a group produces systematically worse outcomes for that group.

In 2021, the National Institute of Standards and Technology (NIST) β€” a U.S. government agency β€” published a massive study of 189 facial recognition algorithms from vendors around the world. Their finding: false positive rates for African American and Asian faces were often 10 to 100 times higher than for Caucasian faces in one-to-many searches. NIST is not a civil rights organization. It is a technical standards body. The accuracy gaps it documented are engineering facts, not political opinions.

The Stakes at an Institutional Level

By 2023, facial recognition was being used by hundreds of law enforcement agencies in the United States, most without public disclosure and without external audit. The errors are not theoretical. They are happening in cases that affect real people's lives right now β€” and the decisions about whether to continue using these tools are being made by city councils, police departments, and federal agencies.

The Invisible Harm

Wrongful arrests are visible. They produce lawsuits, news coverage, congressional testimony. But most AI errors are invisible β€” not because they don't cause harm, but because the people harmed don't know why a decision was made or have no mechanism to challenge it.

In 2018, the Dutch government deployed a fraud detection algorithm called SyRI (System Risk Indication) to identify residents likely to commit welfare fraud. The system analyzed data on income, employment, housing, debts, and tax records, and flagged individuals for investigation. A Dutch court ruled in 2020 that SyRI violated European human rights law because it was being deployed almost exclusively in lower-income neighborhoods, disproportionately targeting people already facing economic precarity. The people flagged had no way to know they were being flagged, no way to see what data led to the flag, and no practical path to appeal.

This is the invisible harm: a government algorithm making decisions about your life, based on data you never consented to share, using criteria you can't see, producing a result you'll never be told about β€” until a government official shows up at your door. The Dutch SyRI case was one of the first successful legal challenges to an automated government profiling system in Europe. It set a precedent that directly influenced the European Union's AI Act, passed in 2024.

You now know something that shapes every conversation about AI governance: the visibility of harm depends on whether the harmed person has the resources, knowledge, and access to legal mechanisms to make the harm visible. Most people don't. That asymmetry is not an accident of the technology β€” it's a consequence of how power works, and AI systems can amplify it.

What You Can Now See

The next time you read about an AI system with an impressive accuracy rate, ask: what happens to the people who fall into the error margin? Where do the errors concentrate? And does the person being decided about have any way to know the decision was made β€” or to challenge it?

An Ethical Question Without a Clean Answer

Robert Williams' case led some advocates to demand that no government agency ever use facial recognition for identification in criminal investigations. Others β€” including some civil liberties advocates β€” argued for regulation rather than prohibition: require verification steps, require disclosure, require external auditing. Others pointed out that without any technological assistance, investigators rely on eyewitness identification, which research consistently shows is even less reliable than most facial recognition systems for many kinds of cases.

Here is the question you should sit with: if a technology causes demonstrable harm to specific communities, but removing it would also cause demonstrable harm β€” including to members of those same communities β€” whose voice should be most important in deciding what to do? The people whose communities are most affected? The engineers who understand the technical limits? The elected officials who set policy? The courts that weigh rights? There is no clean hierarchy here. But the fact that we're even asking the question represents progress. Most societies never asked it about technologies that affected far more people.

Lesson 3 Quiz

Five questions on errors, harm, and who absorbs the cost.
1. Robert Williams was wrongfully arrested in 2020 after a facial recognition algorithm flagged him as a match. Multiple humans β€” detectives, supervisors β€” were involved in the decision chain. What does his case most clearly demonstrate about AI errors in high-stakes systems?
Right. The Williams case is precisely a case study in automation bias. The detective didn't independently verify the match before seeking a warrant β€” the machine said it, so action followed. The lesson isn't "ban AI" or "blame the detective" β€” it's that systems and processes around AI matter as much as the AI itself.
The lesson focuses on a specific mechanism here: automation bias. The humans in the loop didn't fail because they were bad people β€” they deferred to a confident-seeming machine output rather than independently verifying it. That pattern is well-documented and happens across many domains, not just policing.
2. NIST's 2021 study found that facial recognition false positive rates for African American faces were sometimes 10 to 100 times higher than for Caucasian faces. Given what you know about how these systems are built, why does this accuracy gap specifically exist?
This connects lessons 1 and 3 directly. The root cause of the NIST-documented accuracy gap is the training data composition described in Lesson 1 β€” underrepresentation in training datasets leads to worse performance on underrepresented groups. The consequence, in policing contexts, is disparate harm.
This connects directly to Lesson 1's core finding from Joy Buolamwini's research. The accuracy gap isn't about facial features or camera quality β€” it's about whose faces appear frequently enough in training data for the system to learn to recognize them reliably. NIST's study is a government technical finding, not a disputed claim.
3. A city uses an AI algorithm to decide which households receive automatic approval for energy assistance benefits. Auditors later find that households in certain zip codes are being systematically denied even when they meet all the published criteria. Applying the concept of "disparate impact" from this lesson, what should the auditors investigate first?
Disparate impact is precisely about this: outcomes that fall unequally on specific groups even when no explicit targeting was programmed. The investigation needs to trace whether seemingly neutral variables β€” like zip code or data patterns associated with zip codes β€” are proxies for demographic characteristics that correlate with unequal outcomes.
Disparate impact doesn't require deliberate discrimination. The Dutch SyRI case is a perfect model here β€” the system wasn't programmed to target poor people, but it was deployed in ways that concentrated its scrutiny on them. The auditors need to look at whether neutral-seeming criteria produce demographically unequal outcomes.
4. The Dutch court ruled against the SyRI welfare fraud algorithm in 2020, citing human rights law. The European Union's AI Act was passed in 2024, influenced partly by that ruling. What does this sequence tell us about how AI governance actually develops?
This is the institutional-level pattern. Individual cases β€” one lawsuit, one court ruling β€” become the scaffolding for broader policy. Robert Williams' testimony to Congress and the Dutch court's SyRI ruling both follow this pattern: a specific harm becomes visible, legal mechanisms engage, precedent is set, and policy eventually reflects it.
The lesson describes this sequence specifically. The SyRI ruling β†’ EU AI Act connection is an example of how AI governance actually develops in practice: through the legal visibility of specific harms, not through proactive design. Individual cases have outsized institutional consequences when they get to court.
5. A company argues: "Our content moderation AI makes mistakes, but so do human moderators β€” and our AI is faster and more consistent." What is the most important thing this argument leaves out?
This is the core lesson applied to a new domain. "Faster and more consistent" doesn't tell you who absorbs the errors. If the AI consistently fails to moderate hate speech targeting a specific language community, or consistently over-removes content from certain political perspectives, the consistency is a problem, not a virtue. And if affected users have no real appeals path, the harm is invisible.
Apply the lesson's framework. The argument defends speed and consistency β€” but the lesson showed us that those qualities don't address the most important questions: where do errors land? Are they evenly distributed? Can harmed users challenge decisions? Those gaps are what the company's framing conveniently skips.

Lab 3: The Harm Mapper

Your role: policy advisor assessing a proposed AI system for a government agency.

Your Assignment

A state government wants to deploy an AI system that scores landlord-tenant dispute claims and automatically routes high-scoring complaints to priority review. Low-scoring complaints get a form letter and are closed. The system was trained on five years of past dispute resolutions.

Your lab partner Jordan is a government efficiency advocate who thinks this is a good idea. Engage with Jordan's position β€” and take a stance. Don't just raise abstract concerns.

Tell Jordan: Would you recommend approving this system, rejecting it, or approving it with specific conditions? Give a concrete reason for your position β€” not just a list of risks.
Jordan β€” Policy Advisor
Lab Partner
I've reviewed the proposal. The system processes 40,000 complaints a year and we have six human reviewers. Without automation, the average wait time is eight months. This system would cut that to three weeks for priority cases. Tenants are being evicted while they wait. What's your call?
See the World Through AI Eyes Β· Lesson 4

How to Push Back

Reading AI stories critically, asking the questions that matter, and knowing what "accountability" actually requires.
When an AI system affects your life, what would you actually need to know to challenge it?

In the spring of 2021, over 1,500 students at Robert Gordon University in Aberdeen, Scotland received algorithmically-generated exam grades instead of sitting actual exams β€” a consequence of COVID-19 pandemic policies. A separate but related situation had unfolded in England in 2020, when the UK government used an algorithm called the A-Level Standardisation Model to calculate grades for 700,000 students whose exams had been cancelled. The algorithm downgraded nearly 40% of teacher-predicted grades. Students from private schools β€” which historically perform well β€” saw fewer downgrades. Students from state schools in disadvantaged areas were downgraded at significantly higher rates.

The backlash was immediate. Students organized. Researchers published rapid analyses. Newspapers ran the data. Within nine days, the UK government reversed course entirely: Education Secretary Gavin Williamson announced on August 17, 2020 that all algorithmically-determined grades would be replaced with teacher predictions. The algorithm was scrapped. The reversal happened that fast because enough people understood what was happening quickly enough to make it politically impossible to maintain.

What made that reversal possible wasn't just anger. It was specific, informed, public pressure β€” researchers who could explain what the algorithm was doing and why it was unfair, journalists who could translate that for general audiences, and students who knew enough to recognize that what had happened to them wasn't inevitable. That combination β€” technical literacy plus public voice plus institutional pressure β€” is what AI accountability actually looks like when it works.

The Five Questions

Every time an AI system makes a decision that affects someone's life, there are five questions that, if answered honestly, reveal whether the system is operating appropriately. You can apply these to any AI story you read, any system that affects you directly, or any proposal a government or school or company puts in front of you.

1. What is it actually optimizing for? AI systems are designed to maximize something β€” a score, an accuracy rate, an efficiency metric, a profit measure. The thing it optimizes for is not always the thing it's described as doing. A college admissions algorithm described as "identifying talented students" might be optimizing for tuition revenue, retention rates, or alumni donation patterns. The optimization target shapes everything.

2. Who was in the room when the goals were defined? The lesson you've just completed covers this in depth. Who defines success? Whose needs were considered in the design phase? Which communities were consulted, and which weren't? These questions often reveal more about likely failure modes than any technical audit.

3. What data was it trained on, and what does that data leave out? Training data is never a complete picture of reality. It is a snapshot of what was recorded, by whom, in which contexts. Understanding the gaps in the data reveals what the system will fail to handle β€” and who will absorb those failures.

4. Where do the errors go? Average accuracy hides error distribution. Ask specifically: who is in the tail of the error distribution? Which groups are more likely to be incorrectly flagged, denied, or misclassified? Is the error rate consistent across demographic groups?

5. What is the recourse path? When the system gets it wrong β€” and it will β€” can the affected person find out? Can they appeal? To whom? At what cost? Systems with no real recourse path are not accountable systems, regardless of what their marketing materials say.

These Are Not Hypothetical Questions

All five of these questions were asked about the UK A-Level algorithm in 2020. The answers β€” published within days by researchers β€” made the reversal possible. The algorithm couldn't survive public examination of its actual optimization target, its data, and where its errors landed.

Reading AI Headlines Without Being Fooled

AI generates an enormous amount of media coverage, and most of it falls into predictable patterns of either uncritical enthusiasm or unfocused alarm. Learning to read past both of those patterns is one of the most practical skills this course can give you.

In January 2023, Google announced a new AI medical imaging system called MAIA that could detect tuberculosis in chest X-rays with accuracy exceeding many radiologists in one specific study. Headlines read: "Google AI Surpasses Doctors." The study was real. But the study also involved a specific patient population in South Africa, a specific type of X-ray equipment, and a specific definition of "accuracy" that measured sensitivity (catching real cases) rather than specificity (avoiding false positives). A radiologist reading the same headline would immediately ask: how does it perform on patients with other concurrent conditions? What are its false positive rates? Was it tested on the equipment actually available in low-resource clinics?

These aren't nitpicky questions. They are the difference between a research result and a deployable medical tool. The headline collapses that difference. Most readers don't have the framework to rebuild it. You now do.

Here are the specific warning phrases that should trigger your five questions:

"Our system is X% accurate" β€” accurate on what population, by what measure, compared to what baseline?

"AI outperforms humans" β€” in what specific task, under what conditions, measured how, with what error distribution?

"Objective, data-driven decision-making" β€” what data, collected by whom, with what gaps, optimizing for what goal?

"We take privacy seriously" β€” what specific data is collected, retained for how long, shared with whom, with what user controls?

What Accountability Actually Requires

The word "accountability" appears constantly in discussions of AI policy. It is used so broadly that it has almost stopped meaning anything. Here is what it actually requires in practice, based on the cases covered in this course:

Transparency means more than publishing a press release. It means making the optimization targets, training data composition, performance metrics by demographic group, and error rates accessible to people outside the company β€” including people who might be harmed. In 2022, the state of Illinois passed the Artificial Intelligence Video Interview Act, which requires employers who use AI to screen video interviews to tell candidates that AI is being used and to explain what it assesses. That is a meaningful transparency requirement. A company blog post describing its AI as "fair and robust" is not.

Contestability means that affected individuals can actually challenge a decision. Not theoretically challenge it by filing a lawsuit they can't afford β€” actually challenge it through a real process with real consequences. The Dutch court that ruled against SyRI established that affected people have a legal right to understand why an automated system flagged them. That is contestability. A form email saying "this decision was made by our system" is not.

External audit means that people with no financial interest in the system can examine it, test it, and publish findings. Joy Buolamwini's "Gender Shades" research was an external audit. ProPublica's COMPAS analysis was an external audit. Both produced findings that companies disputed β€” but the findings entered the public record and could not be erased. Systems that resist external audit are systems whose operators know they cannot survive examination.

The Minimum Standard

A truly accountable AI system is one where: affected people know it's being used; they can see the general criteria; they have a real path to challenge wrong decisions; and independent external parties can examine and publish findings about it. Most systems deployed today meet none of these four criteria. Knowing that changes what you should accept when a company, a school, or a government uses "AI" to make decisions about you.

An Ethical Question Without a Clean Answer β€” and What to Do With It

In 2022, the city of New York began requiring employers with 15 or more workers to use "automated employment decision tools" only if those tools had undergone a "bias audit" β€” and to tell candidates when such a tool is being used. It was the most comprehensive local AI employment regulation in the United States at the time.

Critics pointed out that the auditing requirements were written broadly enough that companies could conduct their own audits or pay audit firms they selected β€” raising obvious conflicts of interest. Advocates said that even an imperfect audit requirement was better than none, because it created a legal hook for challenging systems that produced discriminatory outcomes. Both sides were right. The New York law was a genuine step forward and a genuine compromise that left significant gaps.

This is what progress in AI governance looks like up close: imperfect, contested, partial. A regulation that falls short of what researchers want but moves the baseline forward. A ruling that sets precedent but doesn't solve everything. A company policy change that addresses the visible problem while leaving the structural one intact.

The question for you is not: "is this enough?" Nothing will ever be enough quickly enough. The question is: do you understand the difference between a system that is genuinely more accountable and one that is performing accountability while changing nothing? Because if you do, you can participate in the conversations that shape these decisions β€” and you can recognize when someone is trying to convince you that window dressing is reform. That is a skill that will matter every year of your life from here forward.

Lesson 4 Quiz

Five questions on accountability, critical reading, and pushing back effectively.
1. The UK A-Level algorithm was reversed within nine days in August 2020. What combination of factors made that speed possible?
Exactly. The lesson is explicit about this: the reversal was made possible by technical literacy plus public voice plus institutional pressure β€” not any single actor. This is the model for how AI accountability works when it succeeds.
Reread the opening scene carefully. No single actor caused the reversal β€” it was a combination of researchers who could articulate what the algorithm was doing, journalists who could communicate it broadly, and students who recognized that what happened to them wasn't inevitable. That combination was the mechanism.
2. A company releases an AI system for diagnosing skin conditions from photos. Their press release states: "Our AI achieves 94% accuracy, outperforming dermatologists in controlled tests." Applying the lesson's framework, what is the most important thing this statement leaves out?
Applying the lesson directly: an aggregate accuracy number hides error distribution. A skin condition AI trained primarily on lighter-skinned patients β€” which is a documented problem with most dermatology datasets as of 2023 β€” will perform worse for darker-skinned patients, who are also the patients most likely to have conditions that go underdiagnosed by human dermatologists. The accuracy gap compounds existing healthcare inequality.
Think about the five questions from the lesson. An accuracy claim tells you the average β€” not where the errors land. For a medical system, the critical question is whether accuracy is consistent across the demographic groups most likely to use it and most likely to be harmed if it gets it wrong.
3. A school district deploys an AI tutoring system and describes it as "accountable" because it publishes an annual report on student outcomes. Using the lesson's definition of accountability, why does this fall short?
Right. The lesson defines accountability as requiring three things: transparency (which the report partially addresses), contestability (can affected students challenge decisions?), and external audit (can independent parties examine the system?). Publishing a self-produced report satisfies the first requirement minimally and the other two not at all.
The lesson draws a specific distinction between performing accountability and being accountable. Publishing a report is a gesture toward transparency β€” but the lesson identifies three requirements: transparency, contestability, and external audit. A self-published annual report satisfies only one, and only partially.
4. New York City's 2022 law requiring bias audits for AI hiring tools was criticized because companies could hire their own auditors. What broader principle does this limitation illustrate?
This is a sophisticated point the lesson makes explicitly. The New York law was a real step forward and a real compromise. The gap it left β€” companies selecting their own auditors β€” is an example of accountability theater: the structural conditions that created the problem (no external scrutiny) are partially preserved within the framework designed to address them. Knowing this doesn't mean the law was worthless β€” it means you understand where the next fight is.
The lesson points directly at this. The New York law was both a genuine step forward and a genuine compromise. The specific gap β€” self-selected auditors β€” illustrates a general principle: accountability mechanisms can be designed to look robust while preserving the conditions that make them toothless. Understanding that gap is what lets you push for stronger versions.
5. You're reading a news article that says: "A new AI system will replace the current standardized testing process in schools, making admissions 'more objective and fair.'" Using everything from this module, what is the single most important question you would want answered before accepting that claim?
This synthesizes the entire module. "Objective and fair" are claims that require evidence across four dimensions: what the system optimizes for, whose data it learned from, where its errors go, and whether affected students have real recourse. Without answers to those questions, "objective and fair" is marketing, not a description. You can now see through it.
Apply the module's full framework. The word "objective" should always trigger: what is it actually optimizing for? The word "fair" should always trigger: fair by which measure, verified how, with error rates examined for which groups? Those are not hostile questions β€” they're the minimum standard for a claim of this magnitude in an educational context.

Lab 4: The Accountability Critic

Your role: independent critic reviewing an AI company's accountability report.

Your Assignment

A company has published what they call an "AI Ethics and Accountability Report." It includes sections on: model accuracy statistics, a diversity statement about their engineering team, a list of use cases they have chosen not to pursue, and a sentence saying "users can contact our support team with any concerns."

Your lab partner Casey works in communications and thinks this report is a solid example of corporate accountability in action. Your job is to evaluate it seriously β€” using the lesson's specific definitions β€” and engage with Casey's defense of it.

Start by telling Casey: Using the three-part definition from the lesson (transparency, contestability, external audit), how does this report actually score? Be specific β€” don't just say it's bad or good in general.
Casey β€” Communications Advisor
Lab Partner
Look, they published accuracy stats, they have a diversity statement, they listed things they won't do, and they gave users a way to contact them. That's more than most companies do. What exactly is your objection? Be specific β€” I need to know what to tell the leadership team.

Module Test

15 questions across all four lessons. Score 80% or higher to pass.
1. In 2015, Jacky AlcinΓ© discovered that Google Photos had labeled photos of him and his friend using a deeply offensive animal category. According to a 2018 Wired investigation, what did Google actually do to "fix" this?
Correct. The "fix" was concealment β€” removing the categories so the failure couldn't be triggered again β€” not a genuine improvement to the system's ability to recognize faces accurately.
According to the 2018 Wired investigation, Google removed the label categories ("gorilla," "chimp," "monkey") from the app entirely rather than fixing the underlying recognition problem. The failure was hidden, not solved.
2. Joy Buolamwini's 2018 "Gender Shades" study found error rates of up to 35% for darker-skinned women in facial recognition systems from IBM, Microsoft, and Amazon. The most direct cause was which of the following?
Training data composition is the root cause. The system learned what it was given β€” and what it was given was predominantly light-skinned male faces, producing reliable performance for that group and poor performance for everyone else.
The cause was training data composition β€” the systems were built from datasets that underrepresented darker-skinned faces, so the models never developed reliable performance for those individuals.
3. Amazon's AI hiring tool, developed from 2014 onward, downgraded resumes containing the word "women's." This happened because the system was trained on historical hiring data. What does this case most clearly demonstrate?
The Amazon case is the clearest example of how historical data carries historical bias forward. The system wasn't programmed to discriminate β€” it learned to discriminate by learning which patterns had led to hires in the past, in a company that had historically hired mostly men.
Amazon's engineers didn't program the discrimination explicitly β€” the system learned it from ten years of hiring data that reflected a historically male-dominated hiring pattern. The AI treated that historical pattern as the correct target to optimize for.
4. Microsoft's Tay chatbot was shut down in 2016 after sixteen hours. The key principle it illustrated about training data is:
Tay demonstrates that "learning from data" means learning from everything in the data β€” including coordinated bad-faith inputs. The system had no moral reasoning; it had pattern matching. That distinction is central to understanding what AI systems can and cannot do.
The key insight is more fundamental than platform choice. Tay illustrates that an AI learning from real human behavior inherits that behavior's worst aspects because it has no moral framework to evaluate what it's learning β€” only pattern recognition.
5. A large language model trained primarily on English-language web content is used globally. What is the most significant risk that most users will never be aware of?
Representation in training data shapes what the system treats as normal, default, or true. A model trained predominantly on English content will treat English-speaking cultural frames as universal β€” silently, and with the same confidence it applies to everything else.
The technical issue (language support) is less important than the cultural framing issue. A model trained on predominantly English content will treat English-speaking cultural assumptions as the universal default β€” without any signal to users from other cultural contexts that this is happening.
6. ProPublica's 2016 analysis of the COMPAS recidivism scoring tool found that it was twice as likely to falsely flag Black defendants as high-risk. The company disputed the findings, pointing to a different statistical definition of fairness. What does this debate reveal?
This is a crucial point about AI ethics. Different statistical fairness criteria are genuinely incompatible β€” optimizing for one can make another worse. That means the choice of fairness metric is a values decision with real-world consequences, not a technical detail that experts can resolve objectively.
The debate between ProPublica and Northpointe illustrated a genuine mathematical problem: different definitions of fairness can conflict. Choosing which definition to apply is a values decision, not a technical one. That decision should be made openly and with input from affected communities.
7. Robert Williams was wrongfully arrested in January 2020 after a facial recognition algorithm flagged him as a match. The concept from the lesson that best explains why human reviewers failed to catch the error is:
Automation bias is the specific mechanism: humans in the decision chain deferred to the algorithm's confident output rather than independently verifying it. This isn't unique to this case β€” it's a documented pattern across many domains where AI systems are integrated into human workflows.
The concept the lesson introduces for this case is automation bias β€” the well-documented tendency for people to defer to machine outputs rather than scrutinize them independently. This is distinct from deliberate malice and is specifically what allowed an unverified algorithmic match to become an arrest warrant.
8. NIST's 2021 study of 189 facial recognition algorithms found false positive rates for African American faces were sometimes 10 to 100 times higher than for Caucasian faces. NIST is a U.S. government technical standards agency, not a civil rights organization. Why is that detail significant?
The source matters in public debates. NIST's neutrality and technical authority makes its findings substantially harder to dismiss as advocacy β€” the accuracy gaps are engineering measurements, not political opinions. This is relevant to how accountability arguments are made in policy contexts.
The lesson notes this specifically. NIST's findings carry more weight in policy debates because it is a technical standards body without an advocacy position β€” meaning the accuracy gaps it documented are harder to dismiss as politically motivated. They are engineering measurements.
9. The Dutch government's SyRI welfare fraud algorithm was ruled unlawful in 2020 partly because it was deployed primarily in lower-income neighborhoods, targeting people who could not see the data used against them and had no practical path to appeal. This is an example of:
SyRI illustrates both concepts simultaneously: disparate impact (a neutral-seeming system concentrating its effects on a specific group) and invisible harm (affected people couldn't see the data used against them or challenge the results). Both elements were part of the court's ruling.
SyRI is primarily an example of disparate impact (concentrated effect on lower-income communities) combined with invisible harm (no transparency about the data or criteria, no real appeals path). The Dutch court ruled on both elements.
10. The UK A-Level standardisation algorithm was reversed in August 2020 after nine days of public pressure. What element was most essential to making that reversal happen quickly?
The lesson is explicit: this is the model for successful AI accountability β€” technical literacy (researchers), communication (journalists), and informed affected parties (students) combining to create pressure that was specific enough to be politically unsustainable for the government to resist.
The lesson identifies this specifically. The reversal was fast because it combined technical expertise, journalistic communication, and informed affected students β€” not any single factor. That combination made the criticism specific and credible rather than just emotional.
11. An AI system for loan applications is described as "objective because it uses only financial data β€” no race, no gender, nothing protected." Why might this claim be misleading?
This is the proxy variable problem β€” a real and documented issue in financial AI. Variables that seem neutral can be highly correlated with protected characteristics because of historical patterns of inequality. Using "objective" financial data doesn't eliminate discriminatory outcomes if that data encodes the history of unequal access.
Neutral-seeming variables can be proxies for protected characteristics. Zip code correlates strongly with race due to historical segregation. Employment history and credit history reflect historical patterns of unequal economic access. An algorithm that uses these "objective" variables can produce racially disparate outcomes without ever explicitly using race.
12. The lesson defines "contestability" as one of the three requirements for genuine AI accountability. Which of the following is an example of real contestability?
Contestability requires that affected individuals have a real, accessible path to challenge a decision β€” not a theoretical one. The Dutch SyRI ruling established that affected people have a legal right to understand the basis for automated decisions. That is contestability. An informational email or a general policy statement is not.
Contestability, as defined in the lesson, means affected individuals can actually challenge a decision through a process with real consequences. Emails, web pages, and terms of service don't provide that. The Dutch SyRI ruling β€” which established a legal right to understand and challenge automated decisions β€” is the model.
13. New York City's 2022 AI hiring law required bias audits but allowed companies to hire their own auditors. According to the lesson, this limitation illustrates which broader pattern?
The lesson calls this "accountability theater" β€” and the New York law is a real-world example. The law was a genuine step forward and a genuine compromise. The self-auditor gap shows how accountability requirements can be structured in ways that address the symptom while preserving the structural problem.
The lesson uses the New York law as an example of how accountability mechanisms can be designed to look substantive while preserving conditions that allow harm to continue. That's not a statement that cities can't regulate AI or that audits are useless β€” it's a specific observation about how the design of oversight mechanisms matters.
14. A government announces it will use an AI system to allocate disaster relief funds, claiming the system is "efficient and unbiased." Using the five questions from Lesson 4, what is the most important question to ask first?
The first of the five questions β€” "what is it actually optimizing for?" β€” is the right starting point. In disaster relief, different definitions of "need" produce very different outcomes. An optimization target built around prior tax records would favor wealthier applicants. One built around documented property damage might exclude renters. The optimization target shapes everything.
The lesson's five questions begin with: what is it actually optimizing for? In disaster relief, "need" can be defined in multiple ways β€” and the definition used in the optimization target will determine who gets aid and who doesn't. That question comes before everything else.
15. Across all four lessons of this module, the most consistent pattern in AI failures has been which of the following?
This is the through-line of the entire module. Google Photos, Amazon hiring, COMPAS, SyRI, Robert Williams' arrest β€” in every case, the failure reflects a human choice that was made without sufficient consideration of who would be affected, and the cost fell on the people least equipped to challenge it. That pattern is not accidental, and recognizing it is the lens this course was designed to give you.
Across every case in this module β€” Google Photos, Amazon, COMPAS, Tay, SyRI, Williams, the UK A-Levels β€” the pattern is consistent: failures reflect the assumptions, blind spots, and choices of builders, and the costs land hardest on people who had the least input into those choices. That is the core lesson, and it applies to every AI system you will encounter.