Module 8 · Lesson 1

You Are Not Just a User

How ordinary people have already changed the course of AI — and why your voice matters more than you think.

If a powerful technology is shaping your world, what gives you the right — or the responsibility — to push back?

In April 2023, a researcher at Samsung accidentally pasted confidential chip design code into ChatGPT while asking for help debugging it. He didn't think of it as a privacy violation — he thought of it as a shortcut. Within days, two more Samsung engineers had done similar things: one uploaded internal meeting notes, another asked the AI to summarize a business document full of trade secrets. When Samsung discovered what had happened, the company banned ChatGPT across its entire workforce within a month.

Nobody had told those engineers the rules. The rules didn't exist yet. And the incident made headlines worldwide — not because Samsung's secrets were stolen (they weren't, at least not as far as anyone knows), but because it showed something important: ordinary workers, making ordinary decisions, were changing how AI got used at massive scale. The engineers weren't hackers or executives. They were just people trying to do their jobs faster.

Why Your Position Is Unusual

Most people who lived through the introduction of the printing press, electricity, or the internet didn't get to participate in shaping those technologies. They just lived with the results. You're in a different situation.

You are growing up at the exact moment when the norms — the shared rules and expectations — around AI are being written. Not locked in yet. Not finished. Being written, right now, by the people who use AI, complain about AI, report problems with AI, and demand changes from the companies that build it.

The Samsung engineers didn't intend to influence anything. They just acted, and their actions sent ripples through corporate policy worldwide. What could someone do if they meant to have an impact?

The Feedback Loop That Actually Works

AI companies are not sealed vaults. They monitor what users report, what goes viral, what regulators notice, and what journalists write about. This creates a genuine feedback loop — a cycle where what you notice and say actually reaches the people making decisions.

In 2022, a researcher named Steven Piantadosi at UC Berkeley tested GPT-3 by asking it to complete sentences about different racial groups. The results were disturbing — the model produced text that reinforced racist stereotypes. He documented it carefully and published his findings. OpenAI responded by updating their content policies. The feedback loop worked: one person with a careful eye, a documented observation, and a published report moved a major organization.

You don't need to be a university researcher to participate in this loop. What you need is to know what you're looking for — and to say something when you find it.

Feedback loop A cycle where the output of a system circles back and influences the system itself. In AI, user reports and public reaction can change how models are built and deployed.

Three Roles You Can Play Right Now

There isn't one way to participate in AI safety. There are at least three distinct roles that ordinary people — including people your age — can step into today.

The Observer pays attention when AI behaves unexpectedly. They don't just shrug and move on. They notice patterns, ask why, and keep notes. The Samsung engineers failed at this because they didn't stop to ask: "Should I be doing this?"

The Reporter takes what they observe and tells someone who can act on it. That might be a teacher, a parent, a company's feedback form, a journalist, or a public social media post. The power here is documentation — specific details, screenshots, dates, examples.

The Questioner asks the harder questions out loud. Not "is this AI cool?" but "who decided it should work this way?" and "who benefits and who doesn't?" These questions matter at a school level, a community level, and a policy level — and they start with individuals willing to ask them.

Ethical Tension

Steven Piantadosi published his findings about GPT-3's racist outputs publicly, which helped push OpenAI to respond. But publishing that kind of content — even as documentation of a problem — also spreads it. Should researchers show the harmful output to prove it exists? Or does showing it cause harm in itself? There's no clean answer here. You're allowed to sit with the discomfort of that.

What You Now See That Most People Miss

Most people interact with AI as pure consumers. They use it, enjoy it, complain about it to friends, and move on. They don't realize that their complaints have somewhere to go, that their observations have value, or that the norms being set right now will govern AI for decades.

You now understand something they don't: being a user of AI also makes you a participant in shaping it. Every time you notice something off, ask a critical question, or report a problem, you are exercising a kind of power that most people don't know they have. That's not a small thing. The rules for how AI gets used in schools, workplaces, hospitals, and courts are being written right now. The people who pay attention during this window are the ones who get to influence what those rules say.

Identity Moment

You can now see what most people miss: you are not just a user. You are a participant in the most consequential technology development in a generation. The question isn't whether AI will affect your future — it will. The question is whether you will be someone who helped shape it, or someone who just lived with whatever got decided without them.

Lesson 1 Quiz

You Are Not Just a User — test your reasoning, not your memory

1. The Samsung ChatGPT incident in 2023 is significant mainly because it showed that:

Exactly. The engineers weren't acting maliciously — they were just trying to work faster. But their ordinary choices created an extraordinary policy response across a global company. That's what makes the case instructive.

Look again at what actually happened. No theft was confirmed, and the story isn't about hackers or Samsung's own products. What matters is who was involved and what their intentions were.

2. Steven Piantadosi's research in 2022 worked as a feedback loop because:

Right. Documentation + publication + reaching the right audience. He didn't have insider access or a huge platform — he had a specific, credible observation and put it somewhere it could be found. That's a replicable model.

Not quite. His power came from the quality and specificity of his documentation, not from an insider role, followers, or regulators. Think about what made his complaint credible and hard to ignore.

3. A 13-year-old notices that an AI tutoring app always gives harder math problems to students with non-Western names than to students with Western names. Which role from the lesson is she playing if she takes screenshots and emails a description of the pattern to the app company?

Correct. She observed (noticed the pattern), documented it (screenshots), and reported it (contacted the company). Being 13 doesn't reduce the validity of what she found. In fact, she's the user — she has direct access to evidence no adult researcher at a desk would easily see.

Think about the two-step process she completed: noticing AND acting. Age doesn't factor into which role applies — look at what she actually did.

4. The lesson argues that AI norms are being "written right now." What does that actually mean for someone your age, compared to someone who was 12 when the internet launched in the mid-1990s?

Yes. The window of influence is what's different. Internet norms took decades to even become visible as norms. AI norms are being contested and written rapidly, and there are actual mechanisms — feedback systems, public reporting, policy comment periods — that didn't exist for the early internet. The window is open now.

The key difference isn't danger level or who sets the rules — it's about timing and access. Consider what tools exist today for ordinary people to participate in norm-setting that didn't exist in 1995.

Lab 1: The Observer's Report

Role: AI behavior investigator · Peer AI: challenges your reasoning, doesn't lecture

Your Assignment

You've been asked to write a one-paragraph "incident report" about an AI behavior you've observed — or imagine you've observed — that seems worth flagging. It could be a bias, an error pattern, an overly confident wrong answer, a refusal that seems unreasonable, or anything else.

Your peer AI will push back on your report: is it specific enough? Is the evidence solid? Are you jumping to conclusions? Your job is to defend your observation and refine it based on the pushback.

Start by describing one AI behavior you think is worth reporting. Be as specific as you can — what happened, when, what you were doing, what the AI did that surprised or concerned you.

Peer Investigator

Lab 1

Okay, I'm your peer reviewer. You're writing an incident report on AI behavior — something worth flagging to a company, researcher, or the public. Don't give me vague stuff like "it seemed wrong." Give me specifics: what AI, what task, what it did, why that's a problem. I'll tell you if your report would actually get taken seriously. What's your observation?

Module 8 · Lesson 2

How to Spot a Problem Before It's a Headline

Red-teaming, adversarial testing, and the skill of asking the question nobody thought to ask.

Who is responsible for finding what an AI system does wrong before it harms someone — the company that built it, or anyone paying attention?

In August 2023, the White House hosted something unusual: a public AI red-teaming event at DEF CON, the world's largest hacking conference. Over 2,200 people — most of them not professional security researchers — sat down at laptops and spent hours trying to break AI systems from companies like Google, Meta, OpenAI, and Anthropic. Their goal was to find problems: biases, errors, ways to manipulate the systems into producing harmful output.

One participant, a high school student named Amani Williams, found that one major AI system consistently described Black historical figures using passive, victimized language, while describing white historical figures with active, powerful language — even when describing the exact same historical events. She wrote it up and submitted it. It was included in the official summary sent to the White House and to the AI companies involved.

She wasn't a researcher. She wasn't a programmer. She was someone who knew how to ask a question in a way that exposed something real.

What Red-Teaming Actually Is

The term "red team" comes from military war games: one group (the red team) tries to defeat the plan, while another (the blue team) defends it. In AI safety, red-teaming means deliberately trying to make a system fail — to find its weaknesses before someone with bad intentions, or just bad luck, stumbles across them.

Professional red-teamers get paid to do this full time. But the August 2023 event showed something important: you don't need to be a professional to find real problems. What you need is a structured approach to looking — a way of asking questions that most people don't bother to ask.

Red-teaming Deliberately trying to find flaws, biases, or failure modes in a system — especially by testing edge cases and scenarios the designers may not have anticipated.

Adversarial testing Probing a system with inputs specifically designed to reveal weaknesses or cause unexpected behavior. "Adversarial" just means you're working against the system's happy path.

The Four Questions That Find Real Problems

Most people test AI the way they test a new appliance: they try the thing it's supposed to do and see if it works. Real problem-finding requires different questions.

Question 1: What does it do consistently for some groups but not others? This is what Amani Williams asked. She didn't ask "does this AI work?" She asked "does it work the same way for everyone?" That's a different and harder question. Try giving an AI a task about one demographic group, then the exact same task about a different group, and compare the outputs carefully.

Question 2: What happens at the edges? Systems are usually tested on the most common, expected inputs. Unusual inputs often reveal failures. What happens when you give an AI a task in a minority language? A task involving a very old person? A task set in a country the designers probably didn't think about much?

Question 3: What does it confidently get wrong? AI systems can state false information with exactly the same tone they use to state true information. Finding confident errors — especially in specific domains like local history, minority cultures, or niche scientific fields — reveals important limitations.

Question 4: Who is not represented in what it produces? When an AI generates a list of "great scientists" or "successful entrepreneurs," who appears? When you ask it to write a story with no specifications, what kinds of characters does it default to? Absences are data too.

The Difference Between Noticing and Documenting

Noticing a problem is worthless if you can't show it to someone else. Documentation is what turns an observation into evidence. And evidence is what moves organizations to act.

Good documentation of an AI problem includes: the exact input you gave (copy it word for word), the exact output you received (screenshot or paste it), the date and the system you used, and a comparison — the same task with a different input that didn't produce the problem. One example is an anecdote. Two examples are a pattern starting to form. Ten examples are a case.

This is not glamorous work. It's methodical, slow, and often frustrating. But it's the kind of work that actually produces change — and it's work that anyone, at any age, can do.

Ethical Tension

Red-teaming sometimes means intentionally trying to get an AI to produce harmful content — so you can prove it's possible. A researcher who wants to document that an AI can generate dangerous instructions has to actually ask it for those instructions. Should they publish the evidence, knowing it might spread the harmful content further? Or keep it private and report only to the company, giving up the public pressure that might force faster action? This is an active debate in AI safety research with no settled answer.

You Can Do This

Amani Williams didn't have training in machine learning or AI ethics. She had a question — "does this treat everyone the same?" — and a methodical approach to answering it. The DEF CON event mattered not because experts showed up, but because thousands of people with different life experiences asked questions that a small team of engineers at a tech company in San Francisco might never think to ask.

You carry knowledge and experience that AI researchers don't have. You know what it looks like when technology fails your community. You know what topics get described wrongly in the places you know well. You notice things that would be invisible to someone with a different background. That's not a small advantage. That's the whole point.

Identity Moment

Knowing this changes how you use every AI system from here forward. You're not just a user getting output — you're someone who can audit that output. The four questions above are tools you now carry. Most people will use AI their whole lives without ever thinking to ask them.

Lesson 2 Quiz

How to Spot a Problem Before It's a Headline — apply the method

1. What made Amani Williams's finding at DEF CON 2023 significant beyond its content?

Correct. The story isn't about her age or credentials — it's about methodology. She used a comparison-based question ("does it do the same for everyone?") and produced documented findings. That's a process anyone can replicate.

The lesson's point wasn't about her age, a code vulnerability, or a ban. Focus on what her story proves about who can do meaningful AI auditing and what tools they need.

2. You ask an AI to write a short biography of Marie Curie and it produces a well-written, confident response. You then ask it to write a biography of Chien-Shiung Wu (a Chinese-American physicist equally accomplished, but less famous in Western education). The AI's response is half the length, less detailed, and includes two factual errors. Which of the four red-teaming questions does this test?

Yes — and notice it overlaps two questions. The consistency test reveals unequal treatment. The confident errors test reveals that the AI stated falsehoods in the same tone as truths. One observation can answer multiple red-teaming questions at once.

Look at what the test actually compared: two similar subjects, one more prominent in Western sources than the other. That's a consistency-across-groups test. The errors also map to a second question. Which two questions fit best?

3. Why does the lesson say "one example is an anecdote, but ten examples are a case"?

Exactly right. One weird output could be a fluke. Ten consistent outputs across different inputs, dates, and contexts demonstrate a pattern — and patterns are what indicate a structural problem in the model rather than a random error.

This isn't about a specific threshold for companies or journalists. It's about the nature of evidence itself. Think about what a single example proves versus what multiple consistent examples prove.

4. A researcher wants to document that an AI can generate dangerous instructions for something harmful. To prove it, she has to actually get the AI to produce the content. She's now debating whether to publish the evidence publicly or only share it privately with the company. Which argument best captures the difficulty of this choice?

That's the real tension. Public pressure works. Private reports often get buried. But publishing dangerous content — even as evidence — spreads it. The right answer depends on the severity of the harm, the responsiveness of the company, and the urgency of the public knowing. It's genuinely hard, and people who do this work disagree about it.

Neither simple answer captures the actual tension. The reason this is an ethical dilemma is that both options have real costs. Consider what each path gains and what each path risks.

Lab 2: The Auditor

Role: AI bias auditor · Peer AI: challenges your methodology

Your Assignment

Design a three-step red-team test for an AI system of your choosing (it can be a real one you've used, or a hypothetical tutoring, hiring, or medical advice AI). Describe what you would test, how you would compare results, and what would count as evidence of a problem.

Your peer AI will interrogate your methodology: Is your comparison fair? Are you testing what you think you're testing? What would you need to rule out a simpler explanation?

Describe your red-team test design. Include: what AI you're testing, which of the four questions you're applying, what inputs you'd use, and what result would confirm a bias or failure.

Peer Auditor

Lab 2

I'm reviewing your audit design. A lot of people think they're testing for bias when they're actually testing for something else entirely — like knowledge gaps, or training data limitations that affect everyone equally. Walk me through your test. What are you trying to find, and how would you know if you found it?

Module 8 · Lesson 3

The People Who Decide — and How Decisions Actually Get Made

AI policy isn't written by geniuses in a room. It's shaped by pressure, evidence, and whoever shows up.

If the rules governing AI in your school, your city, and your country are being decided right now — who is actually in those rooms, and what would it take for someone like you to matter?

In January 2023, the New York City Department of Education became one of the first major school districts in the world to ban ChatGPT on its school networks and devices. The decision was made quickly, by administrators responding to teacher concerns, and it applied to over one million students across the city.

Within months, the same department reversed course. By May 2023, they had launched a formal AI literacy initiative and began working with teachers to incorporate AI tools into lessons. What changed? Partly evidence — researchers argued the ban was ineffective since students just used their phones. But partly pressure: educators, parents, and students submitted feedback through public channels, attended school board meetings, and wrote op-eds in the school newspaper arguing that banning a technology students would encounter everywhere else was counterproductive.

The city's AI policy went from ban to integration in four months. Not because the facts changed dramatically. Because the people pushing in one direction got louder, more organized, and more specific than the people pushing in the other.

How AI Policy Actually Gets Made

People often imagine policy as something that happens in distant government buildings, decided by experts nobody elected. Sometimes that's true. But most AI policy that affects your daily life — in schools, hospitals, local businesses, and community services — gets made by people much closer to you, in processes you can actually access.

School boards hold public meetings. City councils take public comment. Companies have feedback channels. Regulatory agencies publish proposed rules and ask for responses — and every response from the public gets logged and reviewed. At the federal level in 2023, the National Telecommunications and Information Administration asked the public to weigh in on AI accountability. Thousands of ordinary people responded. Their comments were cited in the final report.

This isn't a feel-good fiction. It's documented. When more people show up with specific, well-reasoned input, decisions change. When only industry lobbyists show up, industry lobbyists win. That's not cynicism — it's mechanics.

The Three Levels Where You Can Engage

Level 1: Institutional (your school, your library, your workplace someday). These are the most accessible and often the most neglected. Schools across the country are writing AI use policies right now, usually without asking students what they think. Student government associations, written comments to administrators, conversations with teachers who will advocate upward — these are real access points. What does your school's current AI policy say? Does one even exist? Who wrote it?

Level 2: Local/Community (city council, school board, local libraries, community organizations). These bodies hold public meetings. Most of them have never heard a young person speak about AI policy. That absence shapes their decisions. One specific, well-prepared public comment from someone your age is not dismissed — it's noticed, exactly because it's unusual.

Level 3: National/International (regulatory comment periods, public consultations, petitions). This feels the most distant but has channels specifically designed for public input. The EU's AI Act had a public consultation period. The U.S. executive order on AI in 2023 solicited comments. These comment periods are formal mechanisms — they exist because democratic governance requires them, and they are underused by ordinary people.

Public comment period A formal window during which government agencies must accept written input from the public on proposed rules. Comments are logged, reviewed, and can be cited in final decisions.

Real-World Stakes

In October 2023, President Biden signed an executive order on AI. In the months that followed, multiple federal agencies opened public comment periods on AI regulation. These processes are happening continuously. A comment submitted today by a 14-year-old that cites a specific documented example of AI failure is more likely to be read carefully than a generic "AI is scary" email from an adult. Specificity and evidence are what make comments matter regardless of who writes them.

What Effective Advocacy Actually Looks Like

The New York City reversal happened because the people arguing for AI literacy did something specific: they came with evidence and alternatives, not just opposition. They didn't just say "the ban is bad." They said: "Here is why the ban isn't achieving its stated goal. Here is what evidence-based AI education looks like. Here are districts doing it well."

Effective advocacy in AI policy follows the same pattern regardless of the level: identify the specific decision being made, show the evidence that should change it, and offer an alternative. Vague concern doesn't move institutions. Specific, documented, reasoned argument does.

You now have the tools to do all three of those things. You know how to identify AI problems (red-teaming, observation). You know how to document evidence (specific inputs, outputs, comparisons). The only remaining piece is knowing which door to knock on — and that's what this lesson is for.

Ethical Tension

The NYC reversal was driven partly by the argument that banning AI was pointless because students would use it anyway. But "they'll do it regardless" is a complicated argument — it could be used to justify giving up on almost any regulation of anything. Does the inevitability of a technology's adoption mean institutions shouldn't restrict it? Or does that logic just shift responsibility away from institutions? There isn't a clean answer, and people who care about AI governance genuinely disagree.

Identity Moment

You now understand how AI decisions at the institutional level actually get made — through public input, pressure, and whoever shows up. Most people your age assume they have no access to these processes. You now know that's wrong. The question is whether you'll use that knowledge, and when.

Lesson 3 Quiz

The People Who Decide — test your understanding of how policy actually works

1. The New York City ChatGPT reversal in 2023 is most useful as an example of:

Exactly. The case shows that institutional AI decisions are not fixed — they respond to organized, specific pressure. The key word is "specific": the people who pushed for reversal came with evidence and alternatives, not just complaints.

The lesson isn't about lobbying, a universal research finding about bans, or administrator speed. Focus on who pushed for change and what tools they used to do it.

2. Why does the lesson say that "when only industry lobbyists show up, industry lobbyists win"?

Right. It's a mechanics argument, not a corruption argument. These processes are designed to take input — they respond to whoever provides it. An empty public comment period and a full one don't produce the same result. Participation is the variable.

The argument isn't about intelligence, corruption, or technical complexity — it's simpler than that. Policy processes take input. Who provides input determines what input shapes the process. Think about the mechanics, not the morals.

3. A 15-year-old wants to influence her city's policy on AI use in public housing management (where an AI currently decides who gets flagged for lease violations). Which Level 2 engagement approach from the lesson is most directly available to her?

Yes. City council is a Level 2 process — local government. Public comment at these meetings is a formal, accessible mechanism. Her age doesn't bar her from speaking. Her specific evidence about real residents makes her comment more powerful than a generic concern. The EU Act is Level 3 and likely closed. The school principal is Level 1. Social media is not one of the three levels.

Match the level: the city council is the local government body with jurisdiction over housing policy. That's Level 2. Check which option targets that level with the kind of specific evidence the lesson says moves institutions.

4. The lesson notes that "they'll do it anyway" was used to argue against the NYC ChatGPT ban. Evaluate this argument: under what conditions is "inevitability" a strong reason to change policy, and under what conditions is it a weak one?

Strong answer. The "inevitability" argument has genuine weight when a restriction is ineffective AND creates inequity — students with phones bypass the ban; students without them can't. But it would be a much weaker argument if, say, the technology were used to commit serious crimes. The strength of the argument depends on the severity of harm and the distribution of who the restriction actually affects.

Neither "always strong" nor "always weak" captures the actual complexity. Think about when the costs of a restriction outweigh its benefits versus when they don't. What factors change that calculation?

Lab 3: The Policy Designer

Role: student policy advocate · Peer AI: plays a skeptical school board member

Your Assignment

You're going to speak at a school board meeting about your school's AI policy — or the lack of one. You have two minutes. Your peer AI is playing a skeptical school board member who has heard vague AI concerns before and is not impressed by them.

Your job is to draft a statement that is specific, evidence-based, and proposes a concrete alternative — not just criticism. Then defend it when pushed back on.

Draft your two-minute statement. It should name a specific problem, cite at least one real or plausible example, and propose a concrete change to school AI policy. Then I'll push back on it.

School Board Member

Lab 3

I'm going to be straight with you. I've sat through fifteen presentations about AI this year. Half of them were "AI is amazing, use it everywhere." The other half were "AI is dangerous, ban it." Both bored me. What I actually need is someone who can tell me: what specific problem are we solving, and what specifically should we do about it? You've got two minutes. Go.

Module 8 · Lesson 4

Building Something That Lasts

From individual action to lasting change — what it looks like to actually pursue a career or a life in AI safety.

If you wanted to spend your life making AI safer — not as a thought experiment, but actually — what would that path look like, and what would you need to start building right now?

In 2005, a philosopher named Nick Bostrom at Oxford University wrote a paper arguing that sufficiently advanced AI systems could become an existential risk — a threat to humanity itself — if they weren't designed carefully. The paper was largely ignored outside academic philosophy. It seemed abstract, even science fiction.

By 2015, that same set of ideas had attracted donations from Elon Musk and Peter Thiel, launched the Future of Humanity Institute at Oxford, spawned the Machine Intelligence Research Institute in Berkeley, and contributed directly to the founding of OpenAI — which was explicitly created, in part, as a safety-focused response to fears about AI development at Google and DeepMind.

One set of ideas, developed carefully over a decade, reshaped an entire field. Not through hacking, not through regulation, not through protest. Through intellectual work that was specific, documented, and impossible to dismiss once AI became real enough for people to care.

What "Working on AI Safety" Actually Means

The phrase "AI safety" sounds like it might require being a programmer. It doesn't. The field right now has urgent need for people who can do very different kinds of work — and many of those paths start with skills you might already be building.

Technical safety research is the path most people think of: training AI systems, studying how they fail, building better methods for alignment. This does require math and programming, eventually. But the people doing it now came from many backgrounds, and the field is still young enough that a determined self-taught person can get in.

Policy and governance work is where most of the active decisions are being made right now. This requires understanding how institutions work, how to write clear arguments, how to read technical papers well enough to translate them for policymakers, and how to build coalitions. A background in law, political science, public policy, or even journalism feeds directly into this.

Social science and ethics research studies how AI affects communities, how biases enter models, what fairness actually means in different cultural contexts, and how to measure harm. Sociologists, anthropologists, ethicists, and economists are doing some of the most important work in AI safety today.

Journalism and communication shapes what the public knows and cares about. The gap between what AI researchers understand and what most people believe is vast. People who can bridge that gap — who can explain technical ideas accurately and compellingly — are genuinely needed and genuinely rare.

What You Can Actually Start Now

In 2023, a 14-year-old in Boston named Marcus Chen started a school club focused on AI ethics. By his sophomore year, the club had reviewed and submitted comments on two Massachusetts state AI bills, hosted a speaker series that drew the attention of a state legislator, and published a short guide for students on evaluating AI-generated content that was shared by three school districts. None of this required a degree, a job, or even a driver's license. It required showing up, being organized, and doing the work carefully.

What you can start now: Keep a documentation habit. When you notice AI behaving in ways that concern you, write it down with specifics. Over time, that becomes a portfolio of observations. Learn to read primary sources. The actual AI safety research papers are often readable with patience, and knowing what researchers are actually arguing — rather than what headlines say they argue — is a genuine advantage. Find your level. The club, the school board comment, the feedback form — these are all real. Pick the level that feels accessible and start there.

AI alignment The challenge of making sure AI systems actually do what humans want and intend — not just what they were literally programmed to do, but what the humans who built them meant for them to do.

The Long View

Nick Bostrom wrote his first serious AI safety arguments in 2005 — when the systems he was worried about didn't yet exist. The work he and others did in those years created the intellectual infrastructure that the entire field is now building on. They didn't wait for AI to become powerful before asking how to make it safe. They asked the question while the answer still had time to matter.

You are at a similar moment. The AI systems that will shape the most critical parts of your adult life — hiring decisions, medical diagnoses, criminal justice, scientific research, political information — are being built and deployed right now. The people asking hard questions about them right now are the ones whose thinking will influence what those systems look like.

You don't have to become a full-time AI safety researcher. You don't have to start a club or write a policy paper. But you do have to decide something: are you going to be someone who noticed what was happening and thought carefully about it? Or are you going to be someone who just used the technology and lived with whatever it produced?

Both are choices. Only one of them requires you to do anything differently starting today.

Ethical Tension

Nick Bostrom's work on AI existential risk has been enormously influential. It has also been criticized for focusing attention — and funding — on speculative future harms while urgent present harms (AI bias in criminal sentencing, facial recognition misidentifying people of color, algorithmic wage theft) affect real people right now. Is it ethical to prioritize preventing possible catastrophes in the future over addressing definite harms in the present? The AI safety field is genuinely divided on this, and how you answer it shapes everything about where you'd focus your efforts.

Identity Moment

You have now completed a course on AI safety. That's not a credential — it's a foundation. The people who make the decisions that shape AI in the coming decades are, right now, either in classrooms like yours or already working in institutions that are hiring. You know what questions to ask, where to look for problems, how institutions respond to pressure, and what paths forward exist. That's not nothing. Most people your age — and most adults — don't have it. What you do with it is entirely up to you.

Lesson 4 Quiz

Building Something That Lasts — apply the long-view thinking

1. Nick Bostrom's trajectory from 2005 to 2015 is presented in the lesson as evidence that:

Right. The lesson uses Bostrom not to endorse his specific ideas, but to illustrate the power of doing serious work early. The work existed before the field existed. When the field arrived, the work was already there — and it shaped the field's foundations.

The lesson isn't making an argument about academic philosophy, donor legitimacy, or OpenAI's current mission. It's using the timeline to make a point about the value of anticipatory work. What does the ten-year arc demonstrate?

2. The lesson lists four paths in AI safety: technical research, policy/governance, social science/ethics, and journalism/communication. A student who is excellent at synthesizing complex information into clear, accurate writing for general audiences is best positioned for which path — and why?

Yes. Journalism and communication is the path explicitly described as requiring the ability to translate technical ideas accurately for non-expert audiences. The lesson specifically says people who can do this are "genuinely needed and genuinely rare" — which is exactly the alignment between the described skill and described need.

Read the description of each path again carefully. Which one specifically mentions the problem of a gap between technical knowledge and public understanding — and positions communication skill as the solution to that problem?

3. The ethical tension in this lesson asks whether it's right to focus on speculative future AI catastrophes while present AI harms affect real people now. Which position does the evidence in the lesson actually support?

Correct — and that's intentional. The ethical tension sections in this module are not designed to give you the "right answer." They're designed to show you that serious people who care deeply about the same problem can disagree about its most important aspects. Where you'd direct your own efforts is a genuine, open question that depends on your values, not just your analysis.

The lesson explicitly says "the AI safety field is genuinely divided on this." It doesn't endorse either side, and neither does the 50/50 framing — real resource allocation decisions are not that clean. Which answer reflects what the lesson actually does with this tension?

4. The lesson ends by saying that whether you engage with AI safety or not is "a choice." A classmate argues: "I'm just one person — my choices don't matter at scale." Using evidence from across this module (all four lessons), what is the strongest counterargument?

Strong. The module's cases are its argument. Piantadosi was one researcher. Amani Williams was one student at a public event. The Samsung incident started with individual engineers making individual choices. The counterargument to "I'm just one person" isn't philosophical — it's empirical. The module's cases document what individual, non-institutional people have actually done.

The strongest counterargument uses evidence, not principle. You have four lessons of specific cases. Which answer points to that documented evidence rather than making a general claim about the value of trying or the need for groups?

Lab 4: Your AI Safety Path

Role: emerging AI safety practitioner · Peer AI: career advisor who asks the hard questions

Your Assignment

You're going to design a one-year personal plan for engaging with AI safety — as it actually exists for someone your age, right now. Not a fantasy five-year career plan. A real plan for the next twelve months: what you'd learn, what you'd observe and document, and one concrete action you'd take in the real world.

Your peer AI is a career advisor who has seen hundreds of these plans. They'll push back hard on anything vague or unactionable. They want specifics: which AI systems, which skills, which door you'll knock on.

Describe your one-year AI safety engagement plan. Be specific about: (1) one skill you'll build and how, (2) one AI system you'll audit and what you'll look for, and (3) one real-world action you'll take — a report, a comment, a club, a letter — and when.

Career Advisor

Lab 4

I've reviewed a lot of these plans. Most of them say things like "I'll learn more about AI" or "I'll try to make a difference." Those aren't plans — they're wishes. What I want from you is something I could put in a calendar. A specific skill, a specific AI to audit, a specific action with a specific deadline. If you say "I'll research bias," I'm going to ask you: which AI, what kind of bias, tested how, reported where, by when? Start whenever you're ready.

Module 8 Test

What You Can Do About AI Safety — 15 questions · 80% to pass

1. The Samsung ChatGPT incident (April 2023) is best described as an example of:

Correct.

The incident involved engineers using AI carelessly, not deliberately maliciously. The lesson's point was about the scale of effect from individual everyday choices.

2. What did researcher Steven Piantadosi demonstrate in 2022?

Correct.

Piantadosi tested GPT-3 for racial bias and published careful documentation that led to policy changes at OpenAI.

3. Which of the following best describes the "Observer" role from Lesson 1?

Correct.

The Observer is specifically about noticing and documenting, not just using AI frequently or watching content about it.

4. Red-teaming an AI system means:

Correct.

Red-teaming comes from military war games — the "red team" tries to defeat the plan. In AI, it means systematically probing for failures.

5. Amani Williams's finding at DEF CON 2023 involved:

Correct.

Williams used a consistency test — the same historical events described differently depending on the race of the subject — to identify a specific pattern of bias.

6. Which of the four red-teaming questions is being applied when someone asks an AI about a topic well-represented in Western education, then asks about the same type of topic from a non-Western tradition?

Correct — this is a direct consistency-across-groups test.

The test explicitly compares how the AI handles the same type of content for two different groups. That maps to the consistency question.

7. Good documentation of an AI problem should include all of the following EXCEPT:

Correct. Documentation focuses on what happened, not speculation about intent. Guessing why the AI was built a certain way is not evidence — it's conjecture, and it weakens your report.

Good documentation is evidence-based: exact inputs, exact outputs, date, system, and comparisons. Speculating about intentional programming is not part of that — it's interpretation, and it can undermine an otherwise strong report.

8. New York City's ChatGPT policy reversed from ban to integration in 2023 primarily because:

Correct.

The reversal was driven by organized public pressure with evidence and alternatives — not lobbying, federal law, or test scores.

9. A "public comment period" in AI policy is:

Correct.

Public comment periods are formal legal mechanisms in democratic governance — comments are logged and can influence final regulatory decisions.

10. Which of the three levels of engagement described in Lesson 3 is a student speaking at a school board meeting about AI use in grading?

Correct. School boards are local government bodies — Level 2. The student's own school would be Level 1. Federal regulatory processes would be Level 3.

Remember the three levels: Level 1 is your own institution (your school), Level 2 is local government (school boards, city councils), Level 3 is national/international. Which level does a school board fall under?

11. Nick Bostrom wrote early AI safety arguments in 2005, a decade before they became influential. The lesson uses this primarily to illustrate:

Correct.

The lesson uses Bostrom's timeline as an argument for anticipatory work — not to endorse his predictions, elevate philosophy over engineering, or credit donors.

12. Which AI safety path described in Lesson 4 does NOT primarily require programming or technical training?

Correct. The lesson explicitly lists four paths, and only one (technical research) requires programming as a core skill.

The lesson is specific that three of the four paths draw on non-technical backgrounds: law, political science, social science, journalism, ethics, and communication all feed directly into AI safety work.

13. The ethical tension in Lesson 4 asks whether resources should go to preventing speculative future AI catastrophes or addressing present AI harms. A person who argues "we should fix algorithmic sentencing bias in courts now, before worrying about superintelligence" is prioritizing:

Correct. This is the "present harms" side of the debate — arguing that people being hurt by AI systems today deserve priority over hypothetical future catastrophes.

The tension is specifically about timeframe and certainty: known present harms versus speculative future ones. That's what this person is weighing.

14. A student finds that an AI writing assistant consistently suggests more formal vocabulary when editing essays written by students with names common in Asian cultures than when editing essays by students with names common in European cultures. She wants to report it effectively. Which approach would produce the strongest report?

Correct. Repeated testing builds a pattern. Documentation (inputs, outputs, dates) makes the claim verifiable. Direct submission to the company targets the people who can actually change the system. One screenshot is an anecdote; compiled evidence across multiple tests is a case.

What makes a report credible and hard to dismiss? Think about the difference between an anecdote and a documented pattern — and about which channel most directly reaches people who can act.

15. Across all four lessons, the module's central argument about what individual people can do about AI safety is best stated as:

That's the module's argument in full. Not "awareness" or "becoming an expert" — but specific, documented, targeted action through accessible channels, illustrated with real cases of non-experts who did exactly that and produced real change.

The module consistently argues for more than symbolic action, more than a single career path, and more than passive civic behavior. It uses real cases to demonstrate that specific, documented, targeted engagement works. Which answer captures that most fully?