Module 4 · Lesson 1

When Schools Tried to Ban Their Way Out

What happened when New York City blocked ChatGPT — and what it tells us about the difference between a policy and a rule.

Can you ban a tool that students already have in their pockets?

The school year had barely started after winter break when David Banks, Chancellor of New York City Public Schools — the largest school district in the United States, serving over one million students — issued a network-wide block. As of that first week of January 2023, ChatGPT was cut off from every school WiFi network and every school device in NYC's 1,600-plus schools. The decision was announced quietly, almost as if speed was the goal rather than deliberation.

The reasoning was straightforward: students might use it to cheat. The solution was equally straightforward: block access. Within days, education reporters were calling it the first major US school district ban on AI. Within weeks, critics — including many of the district's own teachers — were pointing out that the block did almost nothing. Students still had phones on cellular data. The tool was still reachable from any home network. The ban had blocked the school's WiFi; it had not blocked ChatGPT.

By May 2023, fewer than five months later, New York City reversed course entirely. The same administration that issued the ban announced a new initiative: a pilot program to explore how AI could be used responsibly in classrooms. Chancellor Banks told reporters the initial ban had been "wrong" and "the knee-jerk reaction." The city that had moved fastest to block AI was now moving to teach students how to use it.

What a Ban Actually Does (And Doesn't Do)

Here's the first thing you need to understand about the NYC story: the ban was not irrational. The people who made the decision weren't foolish. They were responding to a real problem — students submitting AI-generated work as their own — with the fastest tool they had available, which was a network filter. It's the same logic a school uses when it blocks gaming sites during class hours. Block the distraction, restore focus.

The difference is that a gaming site and a general-purpose AI tool are not the same kind of thing. You can block a gaming site and students lose access to that game. You block ChatGPT on the school network and students switch to their phone's data connection in about four seconds. The tool is too accessible, too embedded in ordinary life outside school walls, for a network block to meaningfully restrict it. What the block actually communicated was not "you may not use AI" — it communicated "we, your school, are not going to help you navigate this."

That gap — between what a rule signals and what it actually achieves — is the central design problem of any classroom AI policy. A rule that cannot be enforced is not a policy. It is a statement about what the administration wishes were true.

Policy A set of principles and procedures that guide behavior over time, across different situations. A policy explains why certain choices are made, not just what is prohibited.

Rule A specific instruction about what to do or not do. Rules are easier to state but harder to enforce when the thing being ruled is everywhere.

The New York City situation sits at the exact boundary between these two things. The January decision was a rule — "block it." The May reversal was the beginning of a policy — "figure out how to integrate it responsibly." The first was fast and felt decisive. The second was slower and required actually thinking through what responsible use means in a classroom.

The District Isn't the Only Player

While NYC was reversing its ban, something else was happening across the country that didn't make as many headlines. Individual teachers — without waiting for district guidance — were already building their own classroom-level approaches. Some were banning AI use outright in their specific assignments, redesigning assessments so they required things AI genuinely couldn't produce: in-class handwritten responses, oral defenses of written work, documented revision processes. Others were doing the opposite: explicitly building AI into assignments, requiring students to submit both their AI-assisted draft and a written reflection on what the AI got wrong or missed.

Neither of these teacher-level decisions appeared on any official policy document. They were responses to a gap — the school or district hadn't decided what to do, so individual teachers made judgment calls. This is normal in education. Teachers constantly make micro-policy decisions in their classrooms. The difference in 2023 was that those decisions now had major implications for whether students were developing skills or bypassing them, and whether a student's work could be fairly graded against a peer's work made with different tools.

The Equity Problem

If one teacher bans AI use and another allows it, students in different classrooms are developing different skills and learning different norms — even within the same school. A policy that exists only at the classroom level creates a lottery: your relationship with AI tools depends on which teacher's room you end up in.

This is why designing AI policy is a genuinely hard problem. It's not hard because the technology is complicated. It's hard because the decisions interact with fairness, with what counts as authentic learning, with what it means to develop a skill, and with who gets to decide all of the above. Those questions don't have clean answers, and anyone who tells you they do is probably selling something.

What Makes a Policy "AI-Safe"?

Before you can design something, you have to know what problem you're solving. The phrase "AI-safe classroom" sounds like it means "a classroom where AI is blocked." But after the NYC story, you should be skeptical of that framing. Blocking AI doesn't make a classroom safe from AI's effects — it just makes the school invisible to how students are actually working.

A better definition: an AI-safe classroom is one where students know what AI can and can't do, understand why certain uses undermine their own development, and have clear norms that make it possible for honest work to be recognized as honest. That's a lot more complex than a network filter. It requires students to understand something, not just comply with something.

You Now See What Most Adults Miss

Most of the people writing AI policies for schools right now are focused on detection — how do we catch students using AI? You can now see why detection is the wrong frame. Detection is reactive. A real policy is proactive: it defines what honest work looks like before the assignment starts, not after suspicion arises. Knowing this puts you ahead of most of the current policy debate.

Over the next three lessons, you're going to build that understanding piece by piece. We'll look at what "academic honesty" actually means in an AI era — because the traditional definition doesn't quite fit anymore. We'll examine how other institutions (universities, law firms, newsrooms) have tried to write AI policies, and what worked versus what fell apart. And in Lesson 4, you'll design a complete policy framework of your own.

The ethical question you should be sitting with as you move forward: If a student uses AI to help organize their ideas but writes every word themselves, have they cheated? What if the AI suggested which ideas to include? What if it wrote the transitions? At what point does help become replacement — and does that point move depending on the subject, the grade level, or the purpose of the assignment? There is no universal answer. But the policy you design will have to pick a position.

Lesson 1 · Quiz

When Schools Tried to Ban Their Way Out

5 questions — apply what you read to new situations.

1. New York City reversed its ChatGPT ban in May 2023. What most directly showed that the ban had failed to achieve its goal?

Correct. The block only covered school WiFi and devices — students used phones on cellular to access the same tool, meaning the ban's technical mechanism could not reach the actual problem.

Not quite. The key issue was a technical one: the ban's mechanism (network filtering) couldn't stop access through cellular data or home networks. Review the "What a Ban Actually Does" section.

2. A school announces: "Students must not use AI for any school assignments." According to the lesson, what distinguishes this from an actual policy?

Correct. A rule states what is prohibited. A policy explains the reasoning and has realistic enforcement mechanisms. A blanket prohibition on a tool students carry in their pockets every day fails on both counts.

That's not the distinction the lesson draws. Think about the difference between a "rule" and a "policy" — one states a prohibition, the other explains reasoning and has realistic enforcement. Which does this announcement do?

3. A student in Ms. Chen's class is allowed to use AI for brainstorming. A student in Mr. Patel's class, down the hall, is not allowed to use AI at all. What equity problem does this create?

Correct. When individual teachers make AI decisions independently, a student's relationship with these tools — and the skills they develop — depends on classroom assignment, not any fair or deliberate policy.

The equity concern here isn't about grades or strictness — it's about consistency. Students in different rooms are developing different skills and operating under different norms. What does that mean for fairness?

4. A school redesigns its writing assignments to require in-class handwritten first drafts, followed by a revision session students must explain aloud. Which problem is this approach solving that a network ban cannot?

Correct. Assessment redesign targets the underlying problem: can the student actually do the thinking? A network ban targets the tool. The redesign targets the demonstration of learning itself.

Think about what the assignment redesign is actually measuring. It's not about internet access — it's about whether the student can demonstrate their own understanding in a format AI can't complete for them.

5. The lesson defines an "AI-safe classroom" differently from how most adults define it. Which of these best matches the lesson's definition?

Correct. The lesson argues that "AI-safe" must mean students understand the stakes, not just that tools are blocked or monitored. Understanding produces lasting behavior; blocking produces workarounds.

The lesson explicitly challenges the idea that "AI-safe" means "AI-blocked." Review the final section of the lesson for its specific definition, which is centered on student understanding, not detection or restriction.

Lesson 1 · Lab

The Policy Auditor

You're reviewing a real school's draft AI policy. Your job isn't to fix it — it's to expose its assumptions.

Your Role: Policy Auditor

Your school's principal has shared a draft AI policy and asked you to poke holes in it before it goes to the school board. You've been brought in because you've studied how the NYC ban played out. The AI assistant here is a fellow student auditor — skeptical, direct, and not going to accept vague answers.

The draft policy reads: "Students may not use AI-generated text in any submitted schoolwork. Violations will be treated as academic dishonesty."

Your opening move: Tell the auditor what the single biggest flaw in this draft policy is — and back up your reasoning with something specific from the NYC case.

Policy Auditor AI

Lab 1

Alright. I've read the draft. "Students may not use AI-generated text in any submitted schoolwork." Before we tear it apart — what's the first flaw that jumps out at you? Give me something concrete, not just "it's too strict."

Module 4 · Lesson 2

What "Academic Honesty" Means Now

The old definition was written for a world where the only thing that could write your essay was you. That world is gone.

If a tool does the work, but you understand the result — is that cheating?

In the spring of 2023, a high school student in the Boston suburb of Wellesley, Massachusetts submitted a five-paragraph essay for an AP English class. The teacher ran it through Turnitin's AI detection feature, which had been released just weeks earlier in February 2023. The detection software flagged the essay as 94% likely to have been AI-generated. The teacher reported it as academic dishonesty. The student was called into the principal's office.

The student insisted the essay was theirs. They had used ChatGPT, they said — but only to help them organize their outline. Every sentence in the submitted draft had been written by the student themselves. The AI had helped them figure out what to say, but not how to say it. The teacher said the detection score spoke for itself. The student's parents hired a lawyer.

Cases like this one played out in variations across the country throughout 2023. By August of that year, Turnitin itself acknowledged that its AI detection tool had a false positive rate of roughly 4% — meaning that for every hundred essays it flagged as AI-written, four were almost certainly written entirely by humans. In a school of a thousand students, that's forty wrongly accused students per submission cycle. The company recommended that the tool be used as one signal among many, not as definitive proof. Entire school districts had already been using it as definitive proof.

The Definition Problem

Academic honesty policies at most schools were written decades ago, updated periodically, and designed around a specific threat: one student copying another student's work, or copying from a published source without citation. The concept at the center of those policies is called plagiarism — representing someone else's words or ideas as your own without attribution.

AI creates a genuinely new problem for that definition. When a student uses ChatGPT to write a paragraph, whose words are they copying? ChatGPT doesn't have an author. Its output isn't taken from a specific source that can be cited. It was synthesized from millions of sources simultaneously. Traditional plagiarism detectors, which work by matching text to a database of existing documents, often find nothing — because the AI output doesn't match any single document. The concept of plagiarism, as written, may simply not apply.

Plagiarism Representing someone else's specific words or ideas as your own without attribution. Traditionally requires an identifiable source being copied from.

AI Ghostwriting Having AI generate text that you then submit as your own work. Different from plagiarism in legal/technical structure but similar in educational effect — the submitted work does not represent the student's own thinking.

AI-Assisted Work Using AI as a tool during the work process — for brainstorming, feedback, editing — while the core intellectual effort remains the student's. The ethical status of this varies enormously by context.

These distinctions matter because they have to appear somewhere in a real policy. A policy that simply says "no AI" doesn't capture the difference between a student who had AI write their whole essay and a student who asked AI "does this paragraph make sense?" Those are not the same act. Any honest policy has to distinguish between them.

The Detection Trap

The Turnitin story reveals a second problem: detection technology is unreliable, and schools that treat detection scores as verdicts are creating a system where an algorithm can end a student's academic record. That's worth sitting with. Turnitin's own documentation says its detection tool is not designed to be used as evidence of wrongdoing. It is designed to flag work for human review. But when administrators or teachers under time pressure receive a 94% AI score, the temptation to treat that as a verdict is real.

Several researchers have documented that AI detection tools flag non-native English speakers at disproportionately high rates. Students who write in clear, structured prose — a skill many students spend years developing — are sometimes penalized by detection software that reads clarity as machine-like. This is not a minor edge case. It's a systematic bias embedded in a tool that schools are using to make consequential decisions.

Ethical Question — No Clean Answer

A school uses AI detection software that it knows has a 4% false positive rate. School leadership decides this is acceptable because it catches most cheaters. If 40 innocent students per semester are falsely accused, and the school knows this, is using the tool ethical? Does it matter how severe the consequences of a false accusation are? Does it matter if there's no better alternative?

This question doesn't have a clean answer, and you shouldn't expect one. But it has to be answered somehow — because every school that uses detection software has implicitly answered it, whether they've thought about it carefully or not.

Rebuilding the Definition

Several universities moved faster than high schools on this problem. MIT, in a policy guidance document released in August 2023, distinguished between AI use that undermines the learning objectives of an assignment and AI use that does not. The key question wasn't "did you use AI?" — it was "does your submitted work demonstrate your own mastery of the skills this assignment was designed to develop?"

This reframe is significant. It shifts the question from "what tools did you use?" to "what can you actually do?" An oral defense of written work answers that second question directly. A revision log showing how the student's thinking developed answers it. A written reflection on what the AI got wrong, submitted alongside the AI-assisted draft, answers it. These are not tricks to catch cheaters — they're pedagogical tools that make the student's thinking visible regardless of what tools they used.

The Insight You Now Carry

When you read a school's academic honesty policy now, you can immediately ask: was this written before 2022? Does it mention AI specifically? Does it distinguish between AI ghostwriting and AI assistance? Does it address detection reliability? Most policies you encounter will fail all four questions. You can see the gap that most adults haven't noticed yet — the definition of "honest work" needs to be rebuilt for an era when AI can produce text on demand.

The goal of an AI-era academic honesty policy isn't to catch cheaters after the fact. It's to design assignments and norms that make cheating less useful and honest work more visible. That's a design problem, not a policing problem. And design is what you're going to practice in the labs ahead.

Lesson 2 · Quiz

What "Academic Honesty" Means Now

5 questions — reason through new scenarios using what you've learned.

1. A Wellesley student's essay was flagged as 94% AI-generated by Turnitin in 2023. The student claimed they only used AI for outlining. Which fact from the lesson most directly challenges the school's ability to treat the detection score as proof of cheating?

Correct. The detection tool's own creators said it should not be used as a verdict — only as a signal for further human review. Using a 94% score as proof of cheating exceeds what the tool was designed to claim.

The lesson specifically addresses this with a documented fact about the tool itself. Review the "Detection Trap" section and think about what the tool's creators said about its reliability.

2. Traditional plagiarism policies require an identifiable source that was copied. Why does AI-generated text create a structural problem for these policies?

Correct. Plagiarism, as traditionally defined, requires copying from an identifiable source. AI doesn't have one — its output is synthesized. The old definition may not fit the new situation.

Think about how the lesson defines plagiarism — it requires an identifiable source. What is AI's "source"? That structural difference is the core of the problem.

3. MIT's 2023 policy guidance reframed the central question about AI use. Which option best captures that reframe?

Correct. MIT's reframe focuses on what the student can demonstrate, not what tools they used. This shifts the problem from policing to assessment design.

Review the "Rebuilding the Definition" section. MIT's guidance shifted focus away from the tools used and toward what the student can actually demonstrate about their own learning.

4. A student who learned English as a second language writes very clear, structured prose — a skill she worked on for years. An AI detection tool flags her essay as likely machine-generated. According to the lesson, what broader problem does this illustrate?

Correct. The lesson documents that detection tools flag non-native English speakers at higher rates — meaning the tool embeds a bias that can harm students who have worked hardest on their writing skills.

Think about what this case reveals about who gets disproportionately flagged. The lesson addresses this directly in the "Detection Trap" section as a systematic issue, not an individual one.

5. A student submits an essay along with a written reflection explaining what the AI suggested, what the AI got wrong, and what changes the student made. Which assessment purpose does this approach best serve?

Correct. A reflection on AI use requires the student to demonstrate critical thinking about the tool's output — which shows their own understanding, independent of how much text the AI generated.

Think about MIT's reframe: the goal is to make the student's thinking visible. How does a reflection accomplish that, even if AI helped write the original draft?

Lesson 2 · Lab

The Definition Rewriter

The school's old "academic honesty" definition doesn't cover AI. You're rewriting it — one clause at a time.

Your Role: Policy Language Designer

You've been handed a school's existing academic honesty statement: "Students shall not represent the work of others as their own. Violations include copying, plagiarism, and submitting work produced by other persons."

Your task: propose specific new language that addresses AI use — distinguishing AI ghostwriting from AI assistance — without making the policy so complicated that no one can follow it. The AI here will push back on vague phrasing and ask you to be more precise.

Start by proposing one new clause you would add to this policy. Be specific about what it allows, what it prohibits, and how a teacher would know which is which.

Policy Language AI

Lab 2

I've read the existing policy. It mentions "work produced by other persons" — but AI isn't a person. That's already a problem. Before you write your new clause, tell me: what's the single most important distinction your clause needs to capture? And how will a teacher actually apply it?

Module 4 · Lesson 3

How Other Institutions Tried — And What Broke

Universities, newsrooms, and law firms all wrote AI policies in 2023. Some held. Most are already being revised. Here's what the wreckage teaches.

What does it take for a policy to actually change behavior — not just prohibit it?

In the spring of 2023, Harvard Law School faced a problem it hadn't anticipated. Students were using AI tools to draft legal memos — foundational assignments that teach students how to construct legal arguments from scratch. The school's honor code prohibited submitting work "not your own," but legal memo writing had always existed in a gray zone: students could ask classmates to review their drafts, visit writing centers, and incorporate feedback from instructors. The question was whether using AI crossed a line that these other forms of help did not.

Harvard's initial response was a faculty committee. The committee spent most of the spring semester deliberating. Meanwhile, individual professors made individual calls: one banned AI from all course submissions; another required students to disclose any AI use in a footnote; a third designed a new assessment format — a timed in-person exercise — and stopped worrying about take-home assignments altogether. By the end of spring 2023, Harvard Law had a draft policy framework but no final policy. The professors had effectively each built their own.

Across town, the Boston Globe newsroom was navigating something similar. In March 2023, the Globe issued internal guidance that journalists could use AI for research and background summarization but not for drafting published text. Within weeks, reporters discovered that the line between "research summarization" and "drafting" was impossible to enforce in practice: if a journalist asked an AI to summarize five sources and then wrote from those summaries, was the resulting text AI-influenced or not? The guidance created a compliance gray zone that nobody knew how to navigate consistently.

Why "Gray Zone" Policies Fail

The Harvard and Globe cases share a structural problem: they drew a line where the activity on both sides of the line looks almost identical from the outside. "Using AI for research is okay; using AI for drafting is not" sounds meaningful. But the moment you try to apply it to real work, you discover that research and drafting blur together constantly. Writers research by drafting. They draft by synthesizing research. The categories weren't designed for a tool that can do both simultaneously.

Policies that draw lines in blurry places fail for a specific reason: they rely on every person making consistent judgment calls about where the line is, and judgment is precisely what varies most from person to person. When a policy requires consistent judgment that the policy itself hasn't defined, it essentially outsources the policy to individual interpretation — which is what you had before the policy existed.

Gray Zone Policy A policy that draws distinctions between permitted and prohibited behavior without providing clear criteria for telling them apart. These policies often produce inconsistent enforcement and create anxiety for people trying to comply in good faith.

Bright-Line Rule A rule with a clear, objective threshold — either you've crossed it or you haven't, with no ambiguity. Easier to enforce, but harder to write for complex situations where context matters.

Neither gray-zone policies nor pure bright-line rules solve the AI problem perfectly. A bright-line rule ("no AI text in any submission, ever") is enforceable in theory but ignores important distinctions between uses. A gray-zone policy ("use AI responsibly") acknowledges those distinctions but can't actually enforce them. The institutions that have had the most success are the ones that redesigned what they were measuring — rather than trying to regulate the tool.

What Actually Worked: Three Cases

Not everything broke. Three institutional responses from 2023 are worth examining because they produced policies that held up, at least initially.

Case 1: University of Michigan — The Transparency Framework

In August 2023, the University of Michigan released a faculty guide that organized AI policies into three clear tiers: AI-prohibited (the assignment is specifically designed to assess unaided human capability), AI-disclosed (AI use is allowed but must be documented and cited like any other source), and AI-integrated (AI is part of the assignment design; students are expected to use and evaluate it). The innovation was that faculty had to declare which tier applied at the start of each assignment — students weren't left guessing.

Case 2: The Associated Press — The Function-Based Rule

The AP, one of the world's major news agencies, issued guidance in August 2023 that avoided the "research vs. drafting" distinction that broke the Globe's policy. Instead, AP's guidance focused on function: AI could not be used to generate content that would be published under a journalist's byline. What the journalist did with AI in their own research process was their professional judgment, but the published output had to represent the journalist's own reporting and writing. This worked because it created a clear, verifiable endpoint — the published text — rather than trying to regulate an invisible process.

Case 3: A Rural Montana High School — The Conversation-First Approach

In September 2023, Corvallis High School in Corvallis, Montana (enrollment: approximately 350 students) piloted a different approach entirely. Rather than issuing a policy and enforcing it, teachers spent two weeks at the start of the school year running classroom discussions about what AI was, what it could do, and why certain uses would undermine students' own development. Students then co-created classroom agreements — not rules handed down from administrators, but norms the students articulated themselves. Teacher Rebecca Hanson reported that honor code violations dropped significantly and that students were more likely to ask openly when they weren't sure what was allowed.

The Pattern in What Worked

All three successful cases share something: they gave the person making the decision — the student, the journalist, the teacher — a clear way to know where they stood before they acted. Michigan's tiers were declared upfront. AP's bright-line was about the published endpoint. Montana's co-created norms were internalized, not just announced. Compare this to the policies that failed: they all required people to make ambiguous calls after the fact, under pressure, without clear criteria.

The Institutional vs. Classroom Scale

There's one more thing the Harvard and Michigan cases reveal when you put them side by side. Harvard Law School — with its enormous resources, its dedicated faculty committees, its long tradition of rigorous policy deliberation — still ended in a draft framework and no final policy after a full semester. A rural Montana high school with 350 students, no committee, and two weeks of classroom conversation produced something that worked. Scale and prestige did not predict effectiveness.

What predicted effectiveness was specificity of context. The Montana teachers knew their students. They could run a real conversation. They could iterate when something wasn't working. Harvard's committee was writing for a hypothetical average student, in hypothetical average conditions. The gap between policy-as-written and policy-as-lived is always largest when the people writing the policy are far from the people living under it.

What You Now Understand About Institutional Policy

When a school, company, or government body releases an AI policy, you can now read it with a specific set of questions: Does it draw clear lines or blurry ones? Does it regulate the process or the output? Did the people who will live under it have any voice in writing it? Most policies you'll encounter in the next few years will fail at least one of these. Knowing the failure modes in advance is what lets you design something better.

In Lesson 4, you're going to put all of this together. You'll design a full policy framework for a real-ish scenario. The question you should be holding now: If the people most affected by a policy have the most accurate information about what will actually work — but they have the least institutional power to write it — whose job is it to close that gap?

Lesson 3 · Quiz

How Other Institutions Tried — And What Broke

5 questions — apply the failure patterns to new scenarios.

1. Harvard Law School's spring 2023 AI policy effort ended without a final policy. According to the lesson, what most directly caused this outcome?

Correct. The committee's deliberation took so long that the vacuum was filled by individual faculty decisions — producing exactly the inconsistency a unified policy was meant to resolve.

The lesson describes a specific practical outcome: while the committee deliberated, individual professors acted independently. What does that mean for the purpose of having a unified policy?

2. The Boston Globe issued guidance that AI could be used for "research summarization" but not for "drafting." What specific problem caused this guidance to fail almost immediately?

Correct. The Globe's policy drew a line inside a process that doesn't actually have a clear dividing point — making consistent compliance impossible even for journalists acting in good faith.

Think about what a "gray zone policy" is. The Globe's distinction between research and drafting sounds clear but isn't. Why not? What do writers actually do when they research and when they draft?

3. The Associated Press chose to focus its AI guidance on the "published text" rather than the research process. What advantage did this approach have over the Boston Globe's guidance?

Correct. The AP's approach regulated the output — something observable — rather than the process — something invisible. This gave the policy a clear enforcement point the Globe's guidance lacked.

Compare what the two guidelines tried to regulate. The Globe tried to regulate a process. The AP regulated an outcome. Which is easier to verify consistently?

4. Corvallis High School's approach involved students co-creating classroom norms rather than receiving top-down rules. The lesson says this approach reduced honor code violations. Which explanation best fits the lesson's reasoning for why this worked?

Correct. The lesson's logic is about internalization vs. compliance. When people understand why a norm exists and participated in creating it, they apply it from conviction rather than from fear of getting caught.

The lesson's explanation isn't about access or surveillance — it's about how norms get internalized. What's the difference between following a rule because you'll be caught and following it because you understand why it exists?

5. A large tech company issues an AI policy for its 10,000 employees, written entirely by the legal and HR departments without consulting any of the workers who will use AI daily. Based on the lesson's pattern of what works vs. what fails, what is the most likely outcome?

Correct. The lesson's pattern: policies written far from the people who live under them tend to draw distinctions that don't survive contact with actual work. The gap between policy-as-written and policy-as-lived grows with institutional distance.

The lesson directly addresses this: "The gap between policy-as-written and policy-as-lived is always largest when the people writing the policy are far from the people living under it." What does that predict for this scenario?

Lesson 3 · Lab

The Policy Failure Analyst

You've been handed three real policy texts. One will work. Two have hidden failure modes. Find them.

Your Role: Policy Failure Analyst

A school board is choosing between three draft AI policies for their district. They've asked you to identify which policies have hidden failure modes before the vote. Study each one, then tell the AI which you think will fail and why. The AI will challenge your reasoning — and may not agree.

Policy A: "AI use in schoolwork must be disclosed in a footnote."

Policy B: "AI may not be used on any assignment designated as an AI-prohibited assessment by the teacher at the start of the unit."

Policy C: "Students should use AI responsibly and in ways that support their learning."

Pick the policy you think is most likely to fail in practice. Explain the specific failure mode — not just "it's too vague," but how and when it will break down for a real student or teacher.

Policy Analyst AI

Lab 3

I've read all three. Before you tell me which fails, I want to know: what's your test for failure? What would have to happen in a real school for you to say "this policy broke"? Define that first, then apply it to whichever policy you think is most vulnerable.

Module 4 · Lesson 4

Design Your Own AI-Safe Classroom Policy

You now know the failure modes. You've read the cases. This is the lesson where you build something.

If you were the teacher, what would your policy say — and how would you know if it was working?

In February 2024, Auckland Grammar School in New Zealand took an unusual step. Rather than issuing an AI policy from the top down, the school's senior leadership team — Principal Timothy O'Connor among them — convened a working group that included students alongside teachers. The students weren't there to rubber-stamp an already-written policy. They were given actual drafting authority over several clauses. They were asked: what AI uses do you think are genuinely helpful to your learning? Which ones do you feel are replacing learning you should be doing yourself? What would it take for you to trust that a policy was fair?

The resulting document — released in March 2024 — was unusual in several ways. It specified, by subject area, which AI uses were permitted in each kind of assessment. It included a student "right to ask" — any student could ask their teacher before an assignment whether a specific AI use was permitted, and the teacher was required to give a written answer. And it contained a built-in review clause: the policy would be revisited every six months because the technology was changing fast enough that a policy written today might be irrelevant by the end of the year.

Was it perfect? No. Some teachers found the subject-by-subject specificity burdensome. Some students felt the "right to ask" created awkward dynamics when peers assumed asking meant you planned to cheat. But the school had built something that its community understood, had partly authored, and was designed to evolve. The policy's biggest success was not that it got every decision right — it was that it gave everyone a common language for the conversation.

The Five Elements of a Durable Policy

The Auckland example — along with everything you've read in the previous three lessons — points to five elements that appear in every AI policy that has actually held up over time. A policy that's missing any of these will usually develop a specific, predictable problem.

Element 1: A Clear Statement of Purpose

Not "AI is prohibited" or "AI is allowed." The purpose statement answers: what is this policy trying to protect? Usually that's something like: the development of specific skills, the fairness of assessment, or the integrity of feedback loops between teacher and student. Without this, every specific rule is disconnected from its reason — and rules without reasons get followed resentfully and abandoned easily.

Element 2: Tiered Permissions by Context

Different assignments have different purposes. A timed in-class writing assessment is measuring something different from a month-long research project. A policy that treats all assignments the same will either be too restrictive for projects or too permissive for assessments. The University of Michigan's three-tier system (AI-prohibited / AI-disclosed / AI-integrated) works because it maps to the actual diversity of academic work.

Element 3: Observable Enforcement Points

Whatever the policy prohibits or requires must be checkable at a specific, visible moment — not buried in an invisible process. The AP's "published byline text" is an observable enforcement point. A required disclosure footnote is observable. "Using AI responsibly" is not observable at any specific moment. Design your enforcement around what can actually be seen.

Element 4: A Conversation Mechanism

Auckland's "right to ask" was not just a courtesy — it was a signal that ambiguity was expected and that the proper response to ambiguity is a conversation, not a violation. Policies that assume every situation is already covered by the written rules create a culture where students who are genuinely unsure choose not to ask — because asking feels like admission of intent. A formal mechanism for clarifying gray zones reduces this pressure.

Element 5: A Built-In Review Cycle

AI capabilities are changing every few months. A policy written in September 2023 was partially obsolete by March 2024. Any AI policy written without a review date is building in its own obsolescence. Six months is a reasonable review interval. Annual review is the bare minimum. A policy that cannot be updated is not a living document — it's a historical artifact waiting to become irrelevant.

What Your Policy Has to Choose

Designing a policy means making genuine choices where reasonable people disagree. There is no technically correct answer to these questions — but every policy has to answer them, either explicitly or by default.

Choice 1: Detection or Trust?

Will your policy rely on AI detection software? If so, how will you handle false positives? If not, what establishes that submitted work is authentic? You cannot fully avoid this choice — even a "no detection" policy is choosing trust as the default mechanism, with all the vulnerabilities that implies.

Choice 2: Restrict the Tool or Redesign the Task?

Some AI uses can be blocked at the assignment level by changing what's being assessed. Others can't. A blanket prohibition requires enforcement; an assessment redesign builds authenticity into the task itself. Most effective policies use some combination, but you have to decide the ratio.

Choice 3: Who Has Voice?

Will students participate in creating the norms? Research from the Montana case and Auckland case suggests participation increases compliance and reduces resentment. But it also takes time, creates negotiating complexity, and requires teachers to share some authorship of classroom norms. That trade-off is real.

The Position You Now Hold

You have now read more carefully about AI policy design than most administrators who are currently writing these policies. You understand why bans fail, why detection is unreliable, why gray-zone policies produce inconsistency, and what five elements make a policy durable. When you encounter a school AI policy — your own school's, a college you're considering, a workplace in the future — you can evaluate it with the same framework you'd bring to any other institutional design question. That's not a small thing. Most people never learn to read policy this way.

The final ethical question this module leaves open: A student who uses AI to compensate for a documented learning disability — getting help with organization, sentence structure, or expression of ideas they genuinely have — is using AI very differently from a student who simply doesn't want to do the work. Should a single classroom policy treat these two uses the same way? If not, how does a teacher know which is which — and what happens when they guess wrong? There is no clean answer. But the policy you design in the lab will have to take a position.

Lesson 4 · Quiz

Design Your Own AI-Safe Classroom Policy

5 questions — evaluate policy designs using the five-element framework.

1. Auckland Grammar School's March 2024 policy included a built-in six-month review clause. Which of the five elements does this most directly address?

Correct. The review clause directly implements Element 5 — the recognition that AI capabilities change fast enough that any policy written today may need significant revision within months.

Think about which element specifically addresses the problem of a policy becoming outdated as AI technology changes. Which of the five was designed for exactly that scenario?

2. A school issues an AI policy with this statement of purpose: "This policy ensures that AI is used fairly." According to Element 1, what is missing from this statement?

Correct. Element 1 requires a purpose statement that answers "what is this policy protecting?" — naming the specific educational value at stake. "Fairness" is too abstract to anchor specific rules to a clear reason.

Review Element 1. The key question a purpose statement must answer is: what specific thing is this policy trying to protect? "Fairly" doesn't name the thing being protected — it describes a desired quality of the policy itself.

3. A teacher designs a policy rule: "If you're unsure whether an AI use is permitted, don't use it." According to the lesson, which of the five elements does this rule fail to provide?

Correct. "When in doubt, don't" removes the path of seeking clarification. Element 4 says that ambiguity should lead to conversation, not avoidance — because avoiding AI entirely when unsure may itself disadvantage students compared to those who are simply more willing to risk a wrong call.

Think about what this rule does when a student genuinely doesn't know what's allowed. It closes off asking. Which of the five elements is specifically about keeping that clarification pathway open?

4. Two teachers are discussing AI policy. Teacher A says: "I'll detect violations after submission." Teacher B says: "I'll redesign the assignment so that AI can't do the key thinking for students." Which of the three core "choices" does this debate represent?

Correct. Teacher A is regulating the tool after the fact (restriction + detection). Teacher B is embedding the authentication of learning into the task design itself (redesign). This is exactly the Restrict vs. Redesign choice the lesson describes.

The question is about how each teacher plans to handle the possibility of AI-assisted work. One reacts after submission; the other builds around it. Which of the three policy choices does that distinction match?

5. A student uses AI to help organize their essay because they have dyslexia and struggle with sequencing ideas — a documented difficulty. The school's AI policy prohibits AI use in essay planning. According to the lesson's final ethical question, what does this scenario reveal about single-policy approaches?

Correct. The lesson surfaces this as a genuine ethical tension without a clean answer. Uniform enforcement of a single rule treats different situations identically — which isn't always fair, even when it's consistent.

The lesson's final ethical question doesn't have a clean answer — and that's the point. Think about what this scenario reveals about the limitation of any policy that applies the same rule to every student regardless of context. What is a "fair" outcome here?

Lesson 4 · Lab

The Policy Designer

You're building a complete AI policy for a real school scenario. Not a wish list — a working document.

Your Role: Policy Designer

You're designing an AI policy for Ridgeline Middle School — 600 students, grades 6–8, mix of urban and suburban families, one-to-one laptop program, teachers with varying comfort levels around technology. The school board wants a policy ready for the start of next semester. You have the five elements. The AI here is your skeptical co-designer — it will stress-test everything you propose.

Your goal is to produce a policy framework covering at least three of the five elements. You don't have to solve every problem — you have to make honest choices and be able to defend them.

Start with Element 1: write your purpose statement for Ridgeline Middle School's AI policy. What specifically is this policy trying to protect — and why does that matter for these particular students?

Policy Co-Designer AI

Lab 4

Alright, Ridgeline Middle School. 600 kids, grades 6–8, one-to-one laptops. Before I react to your purpose statement — quick challenge: who is this policy for? Because "students shall not..." language protects the school. "This policy exists to ensure students develop..." language protects the students. Those are different policies with different consequences. Keep that in mind when you write your purpose statement. Go ahead.

Module 4 · Final Assessment

Module Test — AI-Safe Classroom Policy

15 questions across all four lessons. Score 80% or higher to pass. Apply reasoning — not just recall.

1. New York City reversed its ChatGPT ban in May 2023. The reversal was described by Chancellor Banks as a "knee-jerk reaction" — referring to the original ban. Which of these best captures why the ban failed on its own terms?

Correct. The ban's mechanism — a network filter — couldn't control access through devices or connections outside school control.

Think about what a network filter actually blocks. Could it stop a student using their own phone on cellular data?

2. A school announces: "Students caught using AI will face academic consequences." Which statement correctly distinguishes this from an actual policy?

Correct. Rules state consequences. Policies explain reasoning and define the behavior they're regulating. This announcement does neither for the critical question of what "using AI" means.

A policy and a rule are different things. Which one explains the reasoning behind the prohibition and gives people criteria to know where the line is?

3. In 2023, AI detection tools were documented to flag non-native English speakers at disproportionately high rates. What does this reveal about using AI detection as a primary enforcement mechanism?

Correct. A 4% false positive rate sounds small but produces systematic harm at scale — and the burden falls disproportionately on non-native speakers who write clearly and structurally.

Think about what a "systematic bias" means at scale. Even a small percentage of false positives becomes many real students harmed when applied across hundreds of submissions — and the harm isn't random.

4. MIT's 2023 policy guidance shifted the central question from "did you use AI?" to "does your work demonstrate mastery of the skills this assignment was designed to develop?" Why is this reframe significant for classroom policy design?

Correct. Regulating process (invisible, hard to verify) vs. measuring outcome (observable, connected to learning) is the core design difference that makes MIT's approach more robust.

Think about what can and can't be observed. The process of using AI is invisible. But whether a student can demonstrate understanding in a timed, in-person, or oral format is directly observable. Which is easier to build a fair policy around?

5. The Boston Globe's March 2023 internal AI guidance allowed AI for "research summarization" but not "drafting." What type of policy failure does this represent?

Correct. Research and drafting are not two distinct phases in real writing — they happen simultaneously and recursively. Trying to draw a policy line between them creates a gray zone that nobody can navigate consistently.

Think about what a "gray zone policy" is — one that draws a line where the activity on both sides looks the same from the outside. Does the research/drafting distinction have a clear dividing line in real writing practice?

6. The Associated Press chose to focus its AI guidance on the "published bylined text" rather than the journalist's research process. According to the pattern of what makes policies work, why did this approach have a structural advantage?

Correct. Observable enforcement points (Element 3) are critical to durable policy — and the published text is observable in a way that the research process is not.

Compare what can be checked: the research process (invisible, internal) vs. the published text (visible, external). Which gives you an actual enforcement point?

7. Corvallis High School in Montana reported reduced honor code violations after students co-created classroom AI norms. What mechanism best explains this outcome?

Correct. The distinction is between compliance (following a rule because of consequences) and internalization (following a norm because you understand and accept its purpose). The latter is more durable.

The lesson's explanation focuses on how norms get internalized, not on access or surveillance. What's the difference between following a rule because you'll be caught vs. following it because you participated in creating it?

8. Harvard Law School's spring 2023 AI policy effort produced a draft framework but no final policy after a full semester. Meanwhile, a rural Montana high school with 350 students produced a working set of norms in two weeks. What does this contrast reveal?

Correct. The lesson's explicit conclusion: "What predicted effectiveness was specificity of context." Closeness to the situation, not resources or prestige, produced the working result.

The lesson draws a specific conclusion from this contrast. It's not about size or prestige — it's about a specific quality that predicted which approach worked. What did the Montana teachers have that Harvard's committee didn't?

9. Auckland Grammar School's March 2024 policy included a student "right to ask" — any student could ask their teacher before an assignment whether a specific AI use was permitted, and the teacher was required to give a written response. Which of the five policy elements does this most directly implement?

Correct. Element 4 is specifically about providing a formal pathway for clarifying ambiguity — which is exactly what the "right to ask" provision creates.

What does the "right to ask" provision accomplish? It creates a structured way to handle situations the written policy doesn't clearly cover. Which of the five elements is about exactly that — handling ambiguity through conversation rather than assumption?

10. A school writes a comprehensive 12-page AI policy with detailed guidance for every subject — but publishes it in September 2024 with no review date. According to the five elements, what failure is built into this policy from the start?

Correct. Element 5 exists precisely because AI capabilities change every few months. A policy with no review date is a policy that will become a historical artifact, describing a technology that no longer exists in the form the policy imagined.

Think about why Element 5 — a built-in review cycle — is on the list at all. What happens to a policy about a fast-changing technology if it has no mechanism for updating itself?

11. The lesson argues that traditional plagiarism policies may not cleanly apply to AI-generated text. Which specific structural difference makes AI output different from traditional plagiarism?

Correct. The structural mismatch — plagiarism requires an identifiable source; AI has none — means the existing legal and policy concept may not fit the new situation, requiring new language.

Think about the definition of plagiarism: copying from an identifiable source. What is AI's "source"? The answer to that question reveals the structural problem.

12. A student who uses AI to overcome a documented processing disorder is using the tool very differently from a student who uses it to avoid effort. What challenge does this create for uniform policy enforcement?

Correct. Uniform enforcement of identical rules is procedurally consistent — but procedural consistency and actual fairness are not the same thing when the situations are meaningfully different.

The lesson surfaces this as an unresolved tension. Think about the difference between consistent enforcement (same rule for everyone) and fair outcomes (accounting for different situations). Can a single rule achieve both simultaneously?

13. University of Michigan's three-tier system (AI-prohibited / AI-disclosed / AI-integrated) requires faculty to declare which tier applies at the start of each assignment. Which failure mode from earlier lessons does this declaration requirement most directly address?

Correct. The declaration requirement is a pre-assignment clarification mechanism — it solves the gray-zone problem by making the category explicit before any ambiguity can arise, not after a violation is suspected.

Think about what gray-zone policies fail to do: they leave people uncertain about where the line is, especially in advance. What does the Michigan tier declaration do to that uncertainty?

14. A school designs new essay assessments that require: (1) an in-class handwritten brainstorm, (2) a first draft submitted in class, and (3) an oral five-minute explanation of the essay's argument. Which policy approach does this design represent?

Correct. This is the "Redesign the Task" approach: instead of trying to detect AI use after the fact, the assessment design requires demonstrations that AI cannot produce on a student's behalf.

The assessment requires things done in class, in real time, that the student must personally demonstrate. Which of the three core policy choices does that represent — regulating the tool, detecting after the fact, or building authenticity into the task itself?

15. An administrator says: "Our AI policy is working — we haven't had any reported violations." A student who has read this module evaluates that claim. Which response best demonstrates the reasoning developed across all four lessons?

Correct. Absence of reports is not evidence of compliance — it may be evidence of unclear definitions, low trust in the reporting process, or a policy that people have simply stopped taking seriously. A policy's success must be measured against its stated purpose, not its violation count.

Think about what "no reported violations" actually measures. Does it measure whether students are complying — or does it measure something else, like whether students know what the policy requires, or whether they trust the reporting process?