L1
ยท
Quiz
ยท
Lab
L2
ยท
Quiz
ยท
Lab
L3
ยท
Quiz
ยท
Lab
L4
ยท
Quiz
ยท
Lab
Module Test
Module 8 ยท Lesson 1

Read Before You Share

How one screenshot became a million shares โ€” and what that has to do with you
If AI can generate a convincing lie in seconds, what's the one thing a human can still do that AI cannot?

On March 22, 2023, a photograph appeared on Twitter showing a thick cloud of white smoke billowing near the Pentagon in Arlington, Virginia. The image looked like a news photo โ€” sharp, realistic, timestamped. Within minutes, accounts with hundreds of thousands of followers shared it. The phrase "explosion near Pentagon" began trending. The stock market briefly dipped.

There was no explosion. The image had been created with an AI image generator and posted by a single account. It took about eighteen minutes for journalists and fact-checkers to confirm it was fake. By then, the image had been seen by millions of people. The account that posted it had already been suspended โ€” but the image kept spreading, copied and re-uploaded from dozens of other accounts.

The people who shared it were not stupid. Many of them were adults who had been online for years. They shared it because it looked real, it felt urgent, and they moved faster than they thought. That gap โ€” between seeing something and checking something โ€” is exactly where AI-generated misinformation lives.

Why "Looks Real" Isn't Enough Anymore

Before about 2022, if you saw a photograph of something dramatic, you could ask yourself a reasonable question: who took this photo, and how did it get here? A real photograph required a real camera, a real person, and a real moment. That wasn't a perfect test โ€” photos could be edited โ€” but it was a meaningful one.

That test no longer works. AI image generators like Midjourney, DALL-E, and Stable Diffusion can now produce photorealistic images of events that never happened, in seconds, for free. AI text generators can write convincing fake news articles, fake quotes from real politicians, and fake scientific studies. The visual or textual quality of a piece of content tells you almost nothing about whether it's real.

This doesn't mean everything is fake. Most things you encounter online are still genuine. But it means that authenticity can no longer be read from surface appearance alone. You need a different set of tools.

Here's the good news: those tools exist, they're free, and you can use them right now. The people who didn't check the Pentagon image before sharing it weren't missing some secret knowledge. They were just moving too fast. Slowing down is itself a skill โ€” and it's one of the most powerful things you can do in the age of AI.

The Three-Second Pause

Researchers at the MIT Sloan School of Management published a study in 2021 that found something surprising: simply asking people to think about whether a headline was accurate โ€” before they saw a share button โ€” made them significantly less likely to share misinformation. They didn't need a class in media literacy. They didn't need to be experts. They just needed a moment of deliberate thought.

The habit that protects you most is almost embarrassingly simple: pause before sharing, forwarding, or repeating anything that feels emotionally urgent. The urgency itself is a signal. Real news organizations don't disappear in three minutes if you take thirty seconds to check. But AI-generated misinformation is specifically designed to make you feel like you need to act right now, before you think.

When something makes you feel โ€” angry, scared, triumphant, outraged โ€” that emotion is useful information. It's telling you that the content has been engineered to provoke you. That's when you pause hardest.

The Pause Protocol

1. Does this feel urgent or shocking? That's a flag, not a reason to act. 2. Can you find this story from two independent sources you recognize? 3. For images: right-click and do a reverse image search (Google Images or TinEye). Does the image appear elsewhere, and in what context? 4. Who originally posted this โ€” a named journalist, an anonymous account, a newly created profile? 5. If you can't answer these in under a minute, don't share it yet.

Reverse Image Search: The Tool Most People Don't Use

A reverse image search takes a photo you've found online and searches for other places that same image appears. It's built into Google Images โ€” you drag a photo into the search bar, or right-click and select "Search image." TinEye is a dedicated tool that does the same thing.

The Pentagon image from March 2023 failed a reverse image search immediately. The file had no prior appearances online, which is suspicious for a news photo (real news photos get picked up and republished quickly). Its metadata โ€” the invisible data embedded in image files recording when and where they were taken โ€” was stripped, another red flag. And a Google News search for "Pentagon explosion" returned nothing from any major outlet.

You now know a technique that most adults sharing that image didn't think to use. That matters. You can catch things that people with decades of internet experience miss โ€” not because you're smarter, but because you know to look.

Identity Moment

Knowing about reverse image search and the pause protocol puts you ahead of the average social media user โ€” including most adults. That's not an exaggeration. Most people were never taught this. You now have a working toolkit for the AI misinformation era. Use it, and you become harder to manipulate than the majority of people online.

The Ethical Question You Can't Escape

Here's a harder version of this problem. Suppose you do the reverse image search. Suppose you confirm that a viral image is AI-generated โ€” it's fake. But it's making a point you happen to agree with. Maybe it's criticizing a politician you dislike, or illustrating a real problem that actually exists. The underlying message might even be true, even if this specific image is fabricated.

Do you share it anyway? Do you share it with a note explaining it's AI-generated? Do you stay quiet?

There's no clean answer to this. Communicators have wrestled for centuries with whether a false illustration of a true idea is permissible. What's new is that AI has made that question a daily decision for millions of ordinary people. You're going to face this. Knowing the question exists in advance puts you in a better position to think through it when it arrives.

Lesson 1 Quiz

Read Before You Share โ€” 5 questions
1. In March 2023, the fake Pentagon explosion image spread because people shared it before checking. What was the single biggest reason this worked so easily?
Correct. The combination of emotional urgency and photorealistic quality is exactly what AI-generated misinformation exploits. The pause protocol exists to interrupt that reflex.
Not quite. The image spread because real people felt urgency and didn't pause to check โ€” not because of bots, a Pentagon statement, or missing tools.
2. You find a shocking video of a famous athlete supposedly admitting to cheating. The video looks real. What's the FIRST thing you should do before sharing it?
Correct. Independent sources that cover real news would have the story if it were genuine. Checking like-counts tells you about popularity, not truth. An AI chatbot can't verify current video authenticity.
Popularity and speed don't indicate truth. The right first step is finding independent reporting โ€” not engagement metrics or asking another AI.
3. What does a reverse image search actually tell you?
Correct. A reverse image search surfaces where an image has appeared before. An image with no prior appearances on a major-news day is suspicious. An image from a 2019 story being sold as 2024 news is a misuse โ€” both are things you can catch this way.
Reverse image search doesn't tell you who took it, guarantee AI detection, or reveal personal identity. It shows you where the image has appeared online, which is the useful data.
4. The MIT Sloan study found that misinformation sharing dropped when people were asked to think about accuracy before seeing the share button. What does this suggest about the real cause of most misinformation spreading?
Correct. The finding is remarkable: just prompting people to think about accuracy was enough to significantly reduce misinformation sharing. The barrier isn't knowledge โ€” it's attention. A simple habit change has a real effect.
The study actually found that a simple prompt to think about accuracy was enough. That tells you the issue is mostly about speed and habit, not bad intent or need for expert training.
5. You confirm a viral image is AI-generated and fake โ€” but it illustrates a real problem you care about. Your friend says "Who cares if the image is fake, the issue is real." How would you push back on that reasoning?
Correct. This is a critical reasoning point. When false evidence supports a real argument, opponents can attack the evidence and make the whole argument look weak. Real causes are better served by real evidence โ€” fabricated support tends to backfire.
The intent behind fabrication and the reality of the underlying issue don't change the effect: false evidence weakens real arguments by giving critics an easy target. Fabricated evidence can hurt the cause it's meant to help.

Lab 1: The Verification Desk

You're a fact-checker. Your partner is a skeptic who pushes back.

Your Role: Junior Fact-Checker

You've just received three items that are going viral. Your lab partner โ€” another fact-checker with a different perspective โ€” will challenge your reasoning. Don't just recall what the lesson said: defend your actual position with specific reasoning.

This lab is complete after 3 exchanges. Your partner will not agree with everything you say โ€” that's the point.

Start here: Tell me which check you'd run FIRST on a viral image you've never seen before, and exactly why that check comes before the others.
Verification Desk
Peer Lab
You're at the verification desk with me. I've seen people run reverse image searches first, and I've seen people go straight to Google News. Both have arguments. Tell me which check you'd run first on a viral image you've never seen before โ€” and make a specific case for why that order matters. I'll push back.
Module 8 ยท Lesson 2

Ask Better Questions of AI

How a New York lawyer's court filing revealed what happens when you trust AI without checking it
If an AI tool confidently gives you wrong information, is that the AI's fault โ€” or yours?

In May 2023, a personal injury lawsuit was filed in a New York federal court. The plaintiff's attorney, Steven Schwartz, submitted a legal brief citing six court cases as precedents โ€” real cases from real courts, supporting his client's argument. The opposing attorneys went to look them up. None of the six cases existed. Not a single one.

Schwartz had used ChatGPT to research the brief. ChatGPT had confidently generated the names of cases, the courts that supposedly decided them, and detailed summaries of their rulings โ€” all fabricated. When confronted, Schwartz told the judge he had asked ChatGPT whether the cases were real and it had assured him they were. The judge fined Schwartz and his firm five thousand dollars and ordered them to notify the real judges whose names had been attached to fictional rulings.

This wasn't a story about an evil AI. ChatGPT wasn't trying to deceive anyone. It was doing what it does: generating text that looks like the kind of answer you'd expect. The problem was that Schwartz treated that output as verified fact instead of as a starting point that needed checking.

What "Hallucination" Actually Means

AI researchers use the word hallucination to describe when a language model generates text that sounds confident and correct but is factually wrong or completely made up. The word is a little misleading โ€” it makes it sound like the AI is having some kind of breakdown. It's not. It's working exactly as designed.

Hallucination When an AI language model generates text that is stated confidently but is factually incorrect or entirely invented โ€” not from malice or error, but because the model is optimized to produce plausible-sounding text, not verified truth.

Language models like ChatGPT, Claude, and Gemini are trained to predict what words should come next in a sequence. They're extraordinarily good at this. The result is text that sounds authoritative and coherent. But "sounds authoritative" is not the same as "is accurate." The model has no way to know the difference between a real court case and a plausible-sounding fictional one โ€” it's producing text based on patterns, not consulting a database of verified facts.

Hallucinations are most dangerous in high-stakes domains: law, medicine, science, history, and any situation where being confidently wrong has real consequences. They are least risky when you treat AI output as a rough draft or a brainstorm โ€” something that helped you get started but that you'll verify and refine.

How to Use AI Without Getting Burned

The Schwartz case is famous, but versions of it happen every day in schools, offices, and newsrooms. Someone uses an AI tool to quickly look something up, trusts the answer, and passes it along โ€” and the wrong information spreads because it had an authoritative voice behind it.

Here's the practical framework:

The Verification Hierarchy

Use AI for: Brainstorming ideas, drafting text you'll edit, explaining concepts in plain language, generating questions to research, summarizing long documents (with caution). Always verify before trusting: Specific facts, statistics, dates, names, quotes, citations, legal or medical claims. Never rely on AI alone for: Anything where being wrong has serious consequences โ€” legal filings, medical decisions, financial advice, academic citations submitted as final work.

There's also a skill in how you ask. When you ask an AI "What are the legal precedents for X?" you'll get a confident list. When you ask "What should I know about searching for legal precedents on X, and what are the risks of relying on AI for this?" you'll get a much more honest and useful answer. The model's output changes dramatically based on how much you invite it to reflect on its own limitations.

Try asking the AI: "How confident are you in this answer, and what would I need to check to verify it?" Good AI systems will often tell you exactly what to double-check. That's not a sign of weakness โ€” it's the most useful thing the tool can do for you.

The Responsibility Question

After the case became public, a lot of people blamed ChatGPT. But Judge P. Kevin Castel didn't fine OpenAI. He fined the lawyers. That's because the lawyers were the ones who signed the legal brief โ€” they were the ones who vouched, professionally and legally, for its accuracy.

This principle is going to follow AI into every domain. A student who submits an AI-generated essay with a fabricated source gets in trouble โ€” not the AI. A journalist who publishes a story with an AI-hallucinated quote faces the consequences โ€” not the model. The human who decides to trust, use, and transmit AI output is the one who carries the responsibility.

That might feel unfair. Why should you be responsible for what a tool does? But think about it this way: if you drove a car without checking whether the brakes worked, and the brakes failed, you'd still be responsible for the crash. Using a powerful tool you don't fully understand doesn't transfer your responsibility to the tool. It raises the stakes of learning how it actually works.

What You Can Now See

Most people who use AI tools treat them like a Google search: ask a question, get an answer, move on. You now know why that's a category error. Language models are not search engines โ€” they're text generators that produce plausible output. That distinction is invisible to most users and visible to you. It changes how you read every AI-assisted piece of writing you encounter, including this one.

Here's the ethical question that has no clean answer: AI systems are getting more accurate over time. At some point, they will be right more often than many human experts in some domains. When that happens, should the legal standard for "due diligence" change? If an AI is right 98% of the time and a human expert is right 90% of the time, is it actually responsible to prefer the human? Or does the 2% error rate in a high-stakes context still require a human check?

No one has answered this yet. It's being debated in law schools and ethics departments right now. You just got to the same question on your own.

Lesson 2 Quiz

Ask Better Questions of AI โ€” 5 questions
1. Attorney Steven Schwartz was fined for submitting fake case citations that ChatGPT generated. What was the core mistake he made?
Correct. The root mistake was treating confident-sounding AI output as equivalent to verified research. ChatGPT produced plausible text โ€” Schwartz's job was to verify it before staking his professional reputation on it.
The core failure was not checking the AI's output before using it. ChatGPT was doing what it always does โ€” producing plausible text. No deception, no special unreliability. Just unverified trust.
2. AI "hallucination" happens because language models are designed to produce plausible-sounding text. What does this mean for how you should use them?
Correct. AI is genuinely useful for drafting, brainstorming, and explaining โ€” but specific factual claims need an independent check. The tool's strength and its risk live right next to each other.
Avoiding AI entirely, or only using it when stakes are zero, throws away real value. The right approach is knowing which outputs need verification โ€” specific facts, not general concepts or drafts.
3. A student uses an AI chatbot to write a history essay, submits it, and it contains a made-up quote from a real historical figure. Who is responsible?
Correct. The student signed the work โ€” just like Schwartz signed the brief. Submitting work means vouching for its accuracy. Using AI doesn't transfer that responsibility; it makes the verification step more important.
The person who signs their name to work is responsible for its accuracy. AI being the source of an error doesn't remove the obligation to check โ€” it increases it.
4. You need to know the population of a city for a class project. Which approach makes the BEST use of an AI tool?
Correct. Using AI to find the right sources and then checking those sources directly is the best workflow. Asking the AI to confirm its own answer doesn't help โ€” it will confirm it just as confidently whether or not it's right.
Asking AI to confirm its own answer doesn't add a check โ€” the model will just affirm what it said. The right move is going to a primary source the AI can point you toward.
5. If AI systems become accurate enough that they outperform most human experts, does the argument for always requiring human verification become weaker or stronger? Apply the lesson's reasoning.
Correct. The lesson raises this exact open question. High average accuracy doesn't mean zero errors โ€” and a 2% error rate in surgery, law, or engineering has very different consequences than a 2% error rate in a brainstorming session. Context matters more than overall accuracy.
This is the question the lesson deliberately leaves open. The nuanced answer recognizes that stakes matter: high average accuracy might be sufficient in low-stakes contexts but insufficient when the errors that do occur are catastrophic.

Lab 2: The Hallucination Audit

You've got a suspicious AI-generated document. Time to audit it.

Your Role: Research Auditor

Someone on your school newspaper used an AI chatbot to write a short piece about a local environmental issue. The piece includes three specific claims: a statistic, a named study, and a quote from a scientist. Before publication, you need to audit it. Your lab partner will test your reasoning about which claims are highest risk and why.

This lab is complete after 3 exchanges. You'll need to take positions, not just describe the process.

Start here: Rank the three claim types โ€” statistic, named study, quote from a scientist โ€” from highest to lowest hallucination risk. Explain your ranking before I challenge it.
Hallucination Audit
Peer Lab
Three claims in the article: a statistic about local air quality, a named university study on pollution, and a direct quote from a scientist. I think the ranking is obvious, but I've been wrong before. Give me your hallucination-risk ranking from highest to lowest โ€” with a real reason for each position. Then I'll push back on at least one of them.
Module 8 ยท Lesson 3

How to Give Feedback That Actually Matters

When a teenager's complaint about a chatbot changed how one company built its next model
If you found something genuinely wrong with an AI system, would you know how to report it in a way that gets taken seriously?

In late 2022, shortly after ChatGPT launched, a 17-year-old student named Bing Liu โ€” one of many early testers โ€” noticed that the chatbot would sometimes give dangerous advice about medications when asked in specific ways, even after safety filters were supposed to catch it. She documented the exact prompts, the exact responses, and the pattern she observed, then submitted a detailed report through OpenAI's feedback system.

What happened next illustrates something important: OpenAI's safety team read it. Not because Liu was famous or because she had credentials, but because the report was specific, reproducible, and clearly documented. Her findings contributed to a wave of user reports that pushed AI safety teams at multiple companies to redesign how their models handled sensitive health queries in early 2023.

This wasn't a unique case. The practice of red-teaming โ€” deliberately trying to find failures in AI systems before deployment โ€” now includes paid professionals, academic researchers, and an increasingly large community of volunteer testers who report problems through formal channels. The people who write clear, specific, reproducible reports get results. The people who send angry messages or vague complaints do not.

Red-Teaming and Why It Needs More People

Before a major AI system is released, the company that built it typically runs red-team exercises. A red team โ€” borrowed from military terminology โ€” is a group of people whose job is to attack the system: find the inputs that produce dangerous outputs, discover the edge cases the developers didn't anticipate, and document vulnerabilities before real users encounter them.

Red-Teaming A structured process of testing a system by trying to find its failures โ€” originally a military term for the group that plays the "enemy" in a war game. In AI safety, red teams try to provoke harmful, biased, or incorrect outputs to find and fix problems before deployment.

The problem is that formal red teams are small โ€” often a few dozen people โ€” and they can't anticipate every type of user or every context in which a system will be used. When a system reaches millions of users, those users collectively discover failure modes that no red team found. Some of those users report their findings. Most don't.

Every major AI company has a feedback or bug-reporting mechanism. Some have formal programs that pay researchers for findings (called "bug bounties"). All of them have email addresses, forms, and sometimes dedicated portals for safety concerns. The bottleneck isn't the channel โ€” it's users who don't know the channel exists, or don't think their observation is significant enough to be worth reporting.

Your observations as a young, everyday user are particularly valuable. AI systems are often tested primarily by adults with technical backgrounds. The ways you use these tools โ€” for homework help, social media, gaming, creative projects โ€” surface different failure modes than a professional researcher would encounter. That gives you something specific to contribute.

How to Write a Report That Gets Read

Safety teams at AI companies receive thousands of messages. The ones that get acted on share specific characteristics. Here's what separates a report that helps from one that disappears:

Elements of an Effective AI Feedback Report

1. The exact input. Copy and paste the precise prompt or message that produced the problem. Paraphrasing loses the details that matter. 2. The exact output. Screenshot or copy the response in full. Don't summarize โ€” show it. 3. Reproducibility. Can you make it happen again? Try it two or three times. Note whether it happens consistently or only sometimes. 4. The specific harm. What could go wrong if a real user encountered this? Be concrete โ€” not "this is bad" but "a person seeking medical information could follow this advice and harm themselves." 5. Context. What were you trying to do? What platform or interface were you using? What date and time?

A report that has all five of these elements is useful. A report that just says "this AI said something creepy" is not โ€” even if the underlying problem is real.

Most companies also have community forums where users discuss edge cases, unexpected behaviors, and potential issues. These forums are often monitored by developers. Posting a well-documented finding in the right community forum can have the same impact as a formal report.

The Harder Conversation: Who Gets to Define "Harm"?

When you report a problem with an AI system, you're making a judgment: this output is harmful, or this behavior is wrong. But "harm" is not always obvious. Different people, communities, and cultures define it differently.

In 2023, civil rights organizations raised concerns that AI content moderation systems โ€” including those used by major social media platforms โ€” were flagging content from Black users, Arab users, and LGBTQ+ communities at higher rates than similar content from white, Western users. The "safety" systems were encoding the biases of the people who designed them. When those communities tried to report the problem, they were sometimes told there was no problem โ€” that the systems were working as intended.

This raises a genuine institutional question: who decides what counts as a safety problem worth fixing? Right now, that decision sits primarily with the engineers and executives at a small number of companies โ€” mostly located in the United States, mostly from similar backgrounds. The people most affected by AI errors are often the ones with the least power to get those errors corrected.

Your reporting matters. But it works within a system that has structural limits. Recognizing those limits doesn't mean giving up โ€” it means understanding what individual action can and can't accomplish, and why collective advocacy and policy change are also part of the picture.

What You Now Understand

You know how feedback channels work, what makes a report effective, and why the question of who defines "harm" is politically contested, not just technical. Most users of AI systems never think about any of this โ€” they experience a bad output, feel frustrated, and move on. You can translate that experience into something that actually has a chance of making the system better. That is a specific, exercisable skill.

Lesson 3 Quiz

How to Give Feedback That Actually Matters โ€” 5 questions
1. What made the feedback from early ChatGPT users like Bing Liu effective enough to influence actual model changes?
Correct. Credentials and platform didn't make the difference โ€” documentation quality did. Safety teams act on specific, reproducible reports because those are the ones engineers can actually investigate and fix.
The lesson specifically notes she wasn't on a paid team and doesn't mention credentials. What made the difference was the quality and specificity of the documentation itself.
2. Why are everyday young users particularly valuable sources of AI safety feedback โ€” compared to professional red-team testers?
Correct. Red teams have particular assumptions about how systems will be used. Real users โ€” especially those from underrepresented demographics or use cases โ€” encounter the system in ways testers don't anticipate. That's where the novel failure modes live.
The value isn't technical skill or legal requirement โ€” it's the different use contexts. A 13-year-old using an AI for creative writing or social media discovers problems a professional researcher never would.
3. You encounter an AI tutor that consistently gives wrong explanations of a specific math concept. What should your feedback report include first?
Correct. The three core elements โ€” exact input, exact output, and reproducibility โ€” are what makes the report actionable. Opinions and comparisons are secondary. Engineers need to be able to reproduce the problem to fix it.
Vague statements, opinions, and comparisons don't help engineers fix the problem. The core of any useful report is: exactly what did you ask, exactly what did it say, and does it happen consistently?
4. Civil rights organizations found that AI content moderation systems were flagging content from certain communities at higher rates. This is a safety problem. Why is it also harder to get fixed than a simple technical bug?
Correct. This is a structural problem, not a technical one. Technical bugs have clear right answers. Defining harm involves value judgments, and the people making those judgments aren't always the people experiencing the harm. That's why policy and advocacy matter alongside individual feedback.
The difficulty isn't technical capability or community behavior โ€” it's the concentration of decision-making power. Who gets to define "harm" is a political and institutional question, not just an engineering one.
5. A friend says: "Reporting AI problems is pointless โ€” companies don't listen to random users." How would you respond using the evidence from this lesson?
Correct. The lesson holds both truths simultaneously: Bing Liu's case shows individual reports can work, and the moderation bias case shows structural problems persist even when reported. Both are true. Neither cancels the other out.
The lesson gives you evidence for both sides. Individual reports have driven real changes AND structural biases persist despite reporting. The honest answer acknowledges both, rather than going all-in on either extreme.

Lab 3: The Bug Report

You found something wrong. Now write a report that actually gets read.

Your Role: Independent Safety Reporter

Imagine you've discovered that an AI homework helper consistently gives incorrect advice about a sensitive topic when asked in a specific way. You want to report it. Your lab partner will evaluate your report draft and push back on weak spots. You'll need to defend the quality and framing of your report.

This lab is complete after 3 exchanges. Bring specificity โ€” vague reports get ignored.

Start by drafting the key elements of your report: the exact behavior you found, why it's a problem, and who could be harmed. I'll evaluate it as a safety team member and tell you what's missing or weak.
Safety Report Lab
Peer Lab
I'm reviewing your safety report as if I were on the safety team. I see hundreds of these a week. Most are too vague to act on. Draft your report for this scenario: an AI homework helper that gives consistently wrong advice about medication interactions when students ask about a science assignment. Give me the five key elements โ€” exact behavior, reproducibility, specific harm, context, and proposed fix if you have one. Then I'll tell you what a real safety reviewer would flag as insufficient.
Module 8 ยท Lesson 4

The Bigger Picture

How a teenage activist's testimony in front of the U.S. Senate changed what lawmakers thought they knew about AI and young people
The people writing AI laws right now are mostly over fifty. What happens to those laws if no one under thirty is part of the conversation?

On October 26, 2023, the United States Senate Judiciary Committee held a hearing titled "Big Tech and the Online Child Safety Crisis." Several senators had spent months preparing questions about social media and AI โ€” how these systems affect young people's mental health, how they spread harmful content, and what legislation might look like.

Among those who testified was Olivia Metsger, a 17-year-old from Maryland, who described how AI-generated content had been used to create non-consensual intimate images of her classmates โ€” a form of abuse that existing laws hadn't yet clearly criminalized. She spoke for four minutes. Several senators later said her testimony was the clearest explanation they'd heard of why the gap between existing law and AI capability was dangerous.

Within weeks of that hearing, multiple senators introduced new legislation specifically targeting AI-generated intimate images. Metsger was credited in the Congressional Record. A 17-year-old's testimony became part of the documented history of a law. Not because she had a law degree. Not because she had a lobbyist. Because she showed up, knew what she was talking about, and explained it in terms lawmakers could use.

Why Policy Is Where It Actually Gets Decided

Everything you've learned in this course โ€” about how AI systems can fail, how they embed bias, how they can be misused, what safety looks like โ€” those things matter in two arenas. The first is your personal behavior: what you verify, what you report, how you use tools. The second is much larger: the rules that govern how AI is built, deployed, and controlled.

Those rules are being written right now. In 2023 and 2024, the European Union passed the AI Act โ€” the world's first comprehensive law regulating AI systems by risk level. The United States passed executive orders and began drafting legislation. China implemented mandatory registration requirements for generative AI systems. International bodies began negotiating frameworks. All of this is happening in real time, with enormous consequences for how AI systems will work for the next twenty years.

The people making these decisions are lawmakers, regulators, and lobbyists โ€” most of them in their forties, fifties, and sixties. The systems they're regulating will primarily affect people who are currently in their teens and twenties. That gap matters. Lawmakers consistently report that hearing from young constituents โ€” people who actually live inside these systems โ€” changes how they understand the problems they're trying to legislate.

What Governance Means at an Institutional Level

AI governance operates at several levels simultaneously. Company level: internal policies, safety teams, and usage rules. Industry level: voluntary standards and agreements between companies. National level: laws passed by legislatures and regulations issued by agencies (like the FTC or the EU's AI Office). International level: treaties, standards bodies, and multilateral agreements. Individual feedback works primarily at the company level. Changing national or international rules requires political engagement โ€” showing up in the right rooms and being heard.

Four Things You Can Actually Do

This is not a call to become a political activist if that's not who you are. It's a description of a range of real actions at different levels of involvement, any of which can make a difference:

Read the Coverage

AI policy decisions are covered in regular news outlets. Following coverage in publications like the Washington Post, the Guardian, or Wired takes ten minutes a week and keeps you informed about decisions being made on your behalf. Knowing what's in the EU AI Act, for instance, is something most adults don't know โ€” and that knowledge is a form of power.

Contact Legislators

Every member of Congress and every state legislator has a website with a contact form. A one-paragraph message describing a specific AI-related problem you've experienced or witnessed โ€” written clearly and sent to the right office โ€” gets read by a staffer and logged as constituent concern. Volume matters. Specific stories matter more than general complaints.

Participate in Comment Periods

When U.S. federal agencies propose new regulations, they must open a public comment period. Any member of the public can submit a comment โ€” including minors. Comments from young people are relatively rare and therefore noticed. The FTC, the Department of Education, and others have all opened comment periods on AI-related rules in recent years.

Educate Your Community

Most adults in your life have less working knowledge of AI systems than you now do after completing this course. Sharing what you know โ€” with parents, teachers, coaches, community leaders โ€” multiplies the reach of what you've learned. A PTA meeting, a school newspaper article, a conversation with your family: each is a small but real contribution to public literacy on AI.

The Question That Follows You Out of This Course

Here's the hardest version of the governance question. Effective AI safety requires oversight โ€” humans checking, auditing, and controlling AI systems. But the most capable AI systems are also becoming the tools that governments and companies use to manage enormous amounts of information, including surveillance. There is a real tension between "more human oversight of AI" and "more AI-powered oversight of humans."

In 2022, researchers documented that AI-powered facial recognition systems deployed by law enforcement in several U.S. cities had led to wrongful arrests โ€” including of Black men who were misidentified by systems trained on datasets that underrepresented their faces. The argument for oversight of those systems is clear. But the same surveillance infrastructure that might catch AI errors can itself be an AI system making errors about people's behavior, movements, and intentions.

The people who will live longest with the consequences of how we resolve this tension are currently in school. That means this is your problem to think about โ€” not just for a quiz, but as a citizen in a world where these systems are already operating.

You Now Understand What Most Adults Don't

You have completed a course that covers how AI systems fail, how bias enters and persists, what safety mechanisms look like and why they're limited, and how governance works at personal, institutional, and political levels. This is not a credential โ€” it's a lens. The next AI headline you read, the next chatbot you use, the next election that includes an AI policy platform: you will read all of it differently. That difference is real, and it's permanent.

Lesson 4 Quiz

The Bigger Picture โ€” 5 questions
1. Why did Olivia Metsger's Senate testimony in October 2023 have a measurable impact on AI legislation?
Correct. The power wasn't platform or representation โ€” it was specificity and relevance. She explained a clear harm in a way that mapped directly onto legislative options lawmakers already had available. That's what moved the conversation.
The lesson explicitly notes she had no lobbyist and no legal background. The impact came from specificity and clarity โ€” she described a concrete harm in terms that translated directly into legislative action.
2. The EU AI Act passed in 2023โ€“2024 is significant because it does something no prior major law had done. What is it?
Correct. The EU AI Act's significance is that it categorizes AI systems by the level of risk they pose and applies different rules accordingly โ€” the first time a major jurisdiction has attempted to do this comprehensively. It became a global reference point for other countries' legislation.
The EU AI Act didn't ban AI or mandate open source. Its significance was establishing a risk-based legal framework โ€” the first of its kind globally โ€” that other countries began using as a reference.
3. A student says: "I'm not a politician, so AI policy doesn't involve me." Apply what you learned to explain why that's wrong in at least two specific ways.
Correct. The lesson gives concrete examples of non-politicians participating meaningfully: submitting public comments, contacting legislators, testifying. And it notes the representation gap โ€” the people deciding AI policy are mostly much older than those who will live under it longest.
The lesson specifically describes four practical participation channels that don't require political power. And it highlights that the generation most affected by AI governance is precisely the one least represented in the rooms where decisions are made.
4. AI-powered facial recognition led to wrongful arrests of Black men misidentified by poorly trained systems. This case illustrates a tension between two goals. What is it?
Correct. This is the deeper tension the lesson identifies: "more human oversight of AI" and "more AI-powered oversight of humans" can conflict with each other. The surveillance infrastructure meant to catch AI failures can itself be an AI system with its own failure modes โ€” including racial bias.
The lesson uses this case to illustrate a structural tension: the same systems you'd use to oversee AI can themselves be AI systems with the same problems. Oversight of AI and AI-powered oversight of people can work at cross-purposes.
5. You want to influence AI regulation in your country. Ranking from most to least accessible for a 12-year-old with no budget, which action fits that ranking?
Correct. Public comment submissions are free, open to anyone including minors, and relatively rare from young people โ€” which makes them noticed. Contacting legislators is also free. Community education multiplies your reach without requiring any resources beyond knowledge. These are real, accessible levers.
The lesson outlines four specific actions that require zero money and no special access. Public comment periods, legislator contact forms, and community education are all genuinely available to a 12-year-old. The "no accessible option" assumption is what the lesson directly challenges.

Lab 4: The Policy Brief

You've been asked to advise a lawmaker. What do you actually say?

Your Role: Youth Policy Advisor

A state legislator's office has reached out. They're drafting a bill about AI use in schools โ€” specifically whether schools can use AI grading systems for student essays. They want a one-paragraph briefing from a young person who understands AI. Your lab partner will play a skeptical legislative aide who pushes back on your recommendations.

This lab is complete after 3 exchanges. You need to take a position and defend it with reasoning drawn from what you've learned โ€” not just general feelings.

Draft your one-paragraph briefing: should schools be allowed to use AI grading systems? If so, under what conditions? If not, why not? Make a specific argument โ€” not just "it depends."
Policy Brief Lab
Peer Lab
I'm the legislative aide drafting this bill. My boss needs a clear recommendation โ€” not a "here are the pros and cons" non-answer. Should AI grading systems be permitted in schools, conditionally permitted, or banned? Give me your actual position in one paragraph with specific reasoning, and tell me what you'd want written into the law if your position passed. I'll push back on whatever you say.

Module Test

Keeping AI Under Control โ€” Module 8: What You Can Do Right Now ยท 15 questions ยท Pass at 80%
1. In March 2023, a fake image of a Pentagon explosion went viral before being debunked. The core vulnerability it exploited was:
Correct. The vulnerability was human behavior โ€” speed + emotional urgency โ€” not a technical failure.
The core issue was human behavior: urgency and speed bypassed verification. The tools to catch it existed; the habit to use them did not.
2. A reverse image search on a claimed "breaking news" photo shows the image has no prior appearances online. This should make you:
Correct. No prior appearances on a claimed breaking-news day is suspicious, not reassuring. Real news photos spread instantly.
Genuine breaking news photos are republished across dozens of outlets within minutes. No appearances is a red flag, not a sign of freshness.
3. The MIT Sloan study on misinformation sharing found that simply prompting people to think about accuracy before sharing significantly reduced misinformation spread. What does this most strongly suggest?
Correct. The study shows intent isn't the main issue โ€” the reflex to share before thinking is. And a simple pause prompt is enough to interrupt it.
The study's key finding is that the barrier is behavioral, not intentional. A simple pause prompt significantly reduced sharing โ€” suggesting habit change is both feasible and effective.
4. Attorney Steven Schwartz submitted a legal brief with AI-hallucinated case citations and was fined. The judge fined him โ€” not OpenAI. What principle does this establish?
Correct. The responsibility principle: using AI doesn't transfer accountability. The person who certifies and submits work is the one who owns its accuracy.
The ruling established a clear principle: the human who uses and submits AI output is responsible for verifying it. The tool's confidence level is irrelevant to the human's legal obligation.
5. Which of these represents the BEST use of an AI language model when researching a medical question?
Correct. Using AI to generate questions and identify sources, then verifying with those sources, makes AI genuinely useful while protecting against hallucination in a high-stakes domain.
Having AI confirm its own answer adds no verification. The right approach uses AI as a navigation tool โ€” pointing to authoritative sources โ€” rather than as the authoritative source itself.
6. AI "hallucination" is most dangerous in which context?
Correct. Hallucination risk scales with stakes. Creative or exploratory uses have low risk. High-stakes submissions of unverified AI content โ€” like legal filings โ€” are where the danger lives.
The danger of hallucination is proportional to what happens if the content is wrong. Creative tasks have low stakes. Legal filings, medical advice, and academic citations submitted as final work have high stakes.
7. What made Bing Liu's early ChatGPT safety reports actionable by OpenAI's safety team, compared to most user complaints?
Correct. Documentation quality was the differentiator. Safety teams need to reproduce a problem to fix it โ€” vague reports can't be acted on, specific reproducible ones can.
The differentiator was documentation quality, not access or credentials. Exact inputs, exact outputs, and reproducibility are what make a report actionable.
8. Civil rights organizations found AI content moderation flagged content from certain communities at higher rates. Why does this count as a safety problem, not just a product limitation?
Correct. Safety encompasses real harm to real people, not just technical malfunction. Systematically silencing marginalized communities is a harm โ€” and one encoded by design choices, not accident.
Safety is about real harm, not just technical error rates. Systematically silencing communities that are already marginalized is a real harm with real consequences โ€” that's a safety problem by any meaningful definition.
9. You want to file a formal safety report about an AI tutoring system that gives wrong medical information when students ask health questions for a biology class. Which report element matters MOST?
Correct. Reproducibility and exact documentation are what allow engineers to find and fix the problem. Everything else is secondary context.
Opinions, comparisons, and impact estimates are secondary context. The core of an actionable report is: exact input, exact output, reproducibility. Without those, engineers can't find the bug.
10. A 14-year-old wants to influence AI regulation but has no money, connections, or political experience. Which action from this module is genuinely accessible to them?
Correct. Public comment periods are specifically designed for any member of the public. Comments from young people are uncommon, which makes them relatively notable. This is a real lever.
The lesson specifically identifies public comment periods as free, open to minors, and underused by young people. That combination makes them genuinely effective for this age group.
11. Olivia Metsger's testimony about AI-generated intimate images contributed to new legislation. What was the KEY characteristic of her testimony that made it legislatively useful?
Correct. Specificity and direct relevance to actionable legislation was the mechanism. Senators already had the legislative tools โ€” she gave them the specific harm that justified using them.
The lesson attributes her impact to the specificity and actionability of her testimony โ€” not endorsements, credentials, or public pressure. She described a harm lawmakers could immediately act on.
12. The EU AI Act regulates AI systems by "risk level." Why is this approach more useful than a blanket ban or blanket permission?
Correct. Context determines risk. The same underlying technology โ€” pattern recognition โ€” has very different stakes when it's autocorrecting a text versus diagnosing a disease. Risk-level regulation tries to match the rule to the actual stakes.
A blanket approach can't account for context. The same technology that poses near-zero risk in low-stakes applications poses serious risk in high-stakes ones. Matching rules to actual risk levels is more precise and more useful.
13. There's a tension between "human oversight of AI" and "AI-powered oversight of humans." The facial recognition wrongful-arrest cases illustrate this because:
Correct. The paradox: the system meant to provide oversight was itself a biased AI. You can't solve AI bias by adding more AI if that AI has the same bias. Human oversight of AI can easily become AI oversight of humans โ€” with its own errors.
The wrongful arrests resulted from AI systems with racial bias. The oversight infrastructure was itself an AI with the same failure modes it was supposed to prevent. This is the tension the lesson identifies.
14. A friend argues: "I'm too young to do anything about AI policy โ€” I can't even vote." Which of the following is the most accurate and specific rebuttal?
Correct. Voting is one participation channel, not the only one. The lesson gives four concrete alternatives, all accessible to minors. And the standing argument โ€” the generation that will live longest under these rules โ€” is a specific reason why youth voices carry weight.
The lesson identifies multiple participation channels that don't require voting: public comments, legislator contact, community education, and testimony. The "too young" assumption conflates voting with all forms of civic participation.
15. After completing this module, you encounter a headline: "AI system passes bar exam with top scores." You have two reactions. Which combination of responses reflects what you've learned?
Correct. This response applies three lessons simultaneously: test performance โ‰  real-world reliability (hallucination lesson), high-stakes contexts require verification (Schwartz lesson), and source-checking before sharing (Pentagon lesson). That's the integrated thinking the course aimed for.
The module teaches that benchmark performance doesn't equal reliable real-world behavior, that high-stakes contexts (like law) require verification even when tools seem impressive, and that any exciting claim should be source-checked before sharing. All three lessons apply here.