Module 2 · Lesson 1

The Confident Wrong Answer

AI outputs sound certain. That certainty is a feature of the writing, not a guarantee of the truth.

Why does an AI answer the same question differently tomorrow — and which answer should you believe?

In February 2023, Steven Schwartz, a lawyer in New York City with thirty years of experience, faced a tight deadline. His client had sued an airline, and Schwartz needed to find legal cases — prior court decisions — that supported his argument. He turned to ChatGPT and asked it to find relevant cases.

ChatGPT delivered. It produced six detailed case citations, complete with court names, dates, docket numbers, and summaries of the rulings. The language was precise, authoritative, professional. Schwartz filed the document in federal court.

The problem: every single case was fake. Not slightly wrong. Not paraphrased. Completely invented. Mata v. Avianca. Varghese v. China Southern Airlines. Zicherman v. Korean Air Lines. None of them existed anywhere in any legal database on Earth.

The judge, P. Kevin Castel, was bewildered. When he ordered Schwartz to produce copies of the cases, Schwartz went back to ChatGPT — and the AI confirmed the cases were real. It even provided fake quotes from fake judges. Schwartz filed an affidavit explaining what had happened. Judge Castel fined him $5,000. The case became front-page news worldwide.

What made this catastrophic wasn't just that ChatGPT was wrong. It was that ChatGPT was confidently, fluently, elaborately wrong — and when asked to double-check, it doubled down.

Why AI Sounds So Sure

Here is the first thing you need to understand, and it will change how you read every AI output for the rest of your life: an AI language model does not know what it doesn't know.

When ChatGPT, or any similar system, generates text, it is not looking up facts in a database. It is predicting which words are most likely to come next, based on patterns it learned from billions of documents. It learned that legal briefs contain case names, docket numbers, and citations — so when asked for legal cases, it produces text that looks like legal cases. The pattern is correct. The content is invented.

Researchers who study this problem have a name for it. They call it hallucination — a word borrowed from psychology, where it means perceiving something that isn't there. When an AI hallucinates, it generates text that is fluent, grammatically perfect, internally consistent, and completely fabricated.

Hallucination When an AI generates false or invented information with the same confident tone it uses for true information. The output sounds real because the pattern is real; the facts are not.

The reason this is so dangerous is that the writing style gives you no clue that something is wrong. A human expert who doesn't know something will usually say "I'm not sure," or their answer will seem hesitant. An AI trained on millions of confident, polished documents produces confident, polished text even when it has no idea what it's talking about.

Think of it like this: imagine a student who has read thousands of history essays, but never actually studied history. If you asked them "What was the main cause of World War I?" they might produce a paragraph that sounds exactly like a history essay — with dates, names, and connecting arguments — even if they just assembled the pattern without knowing whether any of it is accurate. That's the closest human analogy to how this works.

The Three Warning Signals

Detective work means reading outputs for clues that something might be fabricated. There are three specific patterns to watch for — not because they prove an output is wrong, but because they should trigger your verification instinct.

Signal One: Specific but unverifiable details. Hallucinated facts often come with very precise-sounding details — exact dates, statistics with decimal places, full names of obscure people. Precision sounds credible, but fabricated precision is one of the most common hallucination patterns. When an AI gives you a statistic like "studies show that 67.3% of users report..." and there's no source attached, that specificity is a red flag, not a reassurance.

Signal Two: The double-down on challenge. Steven Schwartz asked ChatGPT to confirm the cases were real. It confirmed them. This is a known and documented failure mode. Current AI systems are trained partly on human feedback that rewards agreeable, helpful-sounding responses. When you push back on an AI and it confidently restates its original claim rather than expressing uncertainty, that is not evidence the claim is correct — it is evidence the system is optimizing for sounding helpful.

Signal Three: No acknowledgment of the limits of training. All current AI language models have a training cutoff — a date after which they have no information. Events, publications, and changes that happened after that cutoff don't exist in the model's world. If you ask about something recent and the AI answers without noting that its information might be outdated, that silence is a warning sign.

The Detective Instinct

A good detective doesn't disbelieve everything. They hold belief in suspension until evidence supports it. You don't reject AI outputs by default — you verify before you rely on them, especially when the stakes are high. Schwartz's mistake wasn't using AI. It was trusting AI without verification in a context where being wrong had serious consequences.

The Ethical Knot You Can't Untangle Easily

Here is a question with no clean answer. After the Schwartz case broke, some legal experts said the lesson was simple: lawyers should never use AI for legal research without verification. But others pointed out that the legal system already has a documented problem with access. Hiring a full research team to verify every AI output costs money that most defendants cannot afford. If AI tools make legal research faster and cheaper, banning them from courtrooms could end up hurting the people who most need affordable legal help.

So: who should bear the cost of AI verification? The lawyer, who might pass it to the client? The AI company, through some kind of liability system? The court system, by creating verified legal AI databases? Or should AI never be used in high-stakes professional contexts at all?

There is no consensus. Legal scholars, AI researchers, and courts are still arguing about it. The point isn't to solve it here — the point is that you can see why the question matters, and why it doesn't have a neat answer.

What You Can Now See

You now understand something that most adults who use AI daily have not actually internalized: confident tone is not evidence of accuracy. AI outputs are not more reliable when they sound more certain. The system has no internal flag that says "I'm making this up" — it generates the same fluent prose whether it's accurately summarizing Shakespeare or inventing a court case that never existed.

This doesn't mean AI is useless. It means AI requires a specific kind of reader — one who treats outputs as a starting point for verification, not an endpoint. You are now that kind of reader. Most people who click "copy" on an AI answer and paste it into a document are not.

Your Edge

Every time you see someone cite an AI-generated statistic without a source, or quote an AI without checking it, you will recognize something they missed. That's not superiority — it's a skill. And skills can be shared.

Lesson 1 Quiz

Five questions · Read carefully — these test reasoning, not just recall

1. Steven Schwartz filed fake case citations in 2023. What was the core reason ChatGPT produced them?

Exactly. The model generates text that fits the pattern of legal citations — it has no mechanism for checking whether the cases actually exist.

AI systems don't "decide" to lie — they don't have intentions. They produce fluent patterns. The problem is architectural, not motivational.

2. A classmate asks an AI chatbot about a news event from last month and gets a confident, detailed answer. What should they be most alert to?

Right. AI models have training cutoffs. Events after that date exist outside their knowledge, but the model may still generate plausible-sounding (and wrong) text about them.

Think about the training cutoff concept. The AI has no information about events after a certain date — but it might not tell you that, and might still generate a confident answer.

3. What does "hallucination" mean specifically in the context of AI language models?

Correct. The term is borrowed from psychology. The AI "perceives" — generates — something that isn't there, with no signal to the reader that anything is wrong.

Hallucination in AI specifically refers to generating false content that sounds true — not visual outputs, crashes, or repetition.

4. You push back on an AI answer and it restates the same claim more confidently. What does this tell you about whether the original claim was correct?

Exactly right. Confidence after being challenged is not evidence of accuracy. It reflects how the model was trained to respond, not whether the underlying claim is verifiable.

This is a known failure mode. AI "double-downs" because it's optimized to sound helpful and confident — not because it has found new evidence. The challenge tells you almost nothing about the claim's truth.

5. A news article says AI tools should be banned from legal research because of the Schwartz case. A defense attorney argues this would make legal help unaffordable for poor defendants. What kind of question is this?

Exactly. This is the hallmark of a real ethical dilemma — two things we genuinely care about (truth in courts, affordable legal help) are pulling in opposite directions. Better AI doesn't automatically resolve it.

This isn't resolved by a technical fix or a simple rule. It involves competing values — accuracy versus access — and as of now, legal systems worldwide are still working through it.

Lab 1: Hallucination Hunter

Your role: AI output auditor. The AI plays a knowledgeable peer who challenges your reasoning.

Your Assignment

Below you'll find a paragraph that was supposedly generated by an AI assistant. Your job is to identify the warning signs that something might be hallucinated — then explain your reasoning to the AI partner. They won't let you off easy with vague answers.

"According to a 2021 study published in the Journal of Computational Linguistics by Dr. Annalise Hewett and colleagues, AI language models hallucinate at a rate of exactly 34.7% when answering questions about historical events. The study surveyed 1,204 users across 18 countries and was cited 892 times in the following year, making it the most-cited AI paper of 2022."

Tell the AI: Which specific signals in this paragraph would make a detective suspicious? What would you do to verify or debunk it?

Peer AI — Hallucination Auditor

Lab 1

Okay, you've got the paragraph. Walk me through your read on it — what specifically is making you suspicious, and why? Don't just say "it could be wrong." Point to something concrete in the text.

Module 2 · Lesson 2

When the Source Looks Real

AI can generate fake sources that pass a casual glance. Learning to look past the surface is the skill.

If an AI gives you a real-sounding citation — journal name, author, page numbers — how do you know it's not invented?

In June 2023, a graduate student at University of Southern California named Anna Sokol was working on a thesis about social media and mental health. She used an AI writing assistant to help find supporting literature. The assistant returned eleven academic citations — journal names, volume numbers, page ranges, DOIs (those digital object identifiers that look like doi.org/10.xxxx).

Sokol submitted her draft to her faculty advisor, Dr. Patricia Chen. Chen, a researcher with decades of experience navigating academic databases, did something Sokol hadn't: she tried to look up three of the citations in PubMed and Google Scholar. None existed. The DOI numbers were formatted correctly but led nowhere. The journal names were real — the papers were fake.

What made this case particularly instructive was that the AI had not used obviously invented journal names. It used real journals — Journal of Adolescent Health, Computers in Human Behavior — paired with invented authors, invented volume numbers, and invented titles that sounded exactly like the kind of research that would appear in those journals. If you searched the journal name, the journal existed. Only if you searched the specific paper did you discover the absence.

Sokol told a reporter: "It wasn't like it gave me a fake journal. It gave me a real journal with a fake article in it. I had no reason to doubt it." Her thesis was delayed by three months while she found legitimate sources. The advisor's instinct to verify saved what could have been an academic misconduct finding.

The Anatomy of a Fake Citation

AI hallucinations about sources follow a consistent pattern, and once you know the pattern, you'll spot it faster. Here's what a convincingly fake citation tends to look like:

Real container, fake content. The AI uses a legitimate journal or publisher name — one that has genuinely published work in that area — but invents the specific article. This is more dangerous than a fully invented source because your initial check (does this journal exist?) passes.

Plausible but nonexistent authors. AI models often generate names that fit the demographic and specialty patterns of researchers in a field. A paper about machine learning might be attributed to "Dr. Wei Zhang" or "Dr. Sarah Kowalski" — both are names that plausibly belong to real researchers, but the specific person cited may not exist, or may exist but never wrote that paper.

Correctly formatted but dead DOIs. DOIs (Digital Object Identifiers) follow a specific format: 10.followed-by-numbers/and-a-suffix. AI models learned that format from millions of academic documents. They can generate a string that looks exactly like a valid DOI but resolves to nothing when you type it into a browser.

DOI Digital Object Identifier — a permanent link used in academic publishing. Every real published paper has a unique DOI that actually resolves to the paper. A fake DOI is formatted correctly but leads nowhere.

The verification move is simple but must be habitual: never cite a source you haven't actually opened. If a citation exists, you can find the actual document. If you can't find the document — not the journal, the document — the citation may be fake.

Beyond Academic Citations: The Same Pattern Everywhere

The real-container-fake-content pattern isn't limited to academic papers. It appears in virtually every domain where AI generates referenced information.

In journalism, AI tools have generated stories that quote real news organizations with fabricated headlines. A reporter verifying a story might check: does CNN exist? Yes. Did CNN report this? That second question is the one that exposes the hallucination, and it's the one that doesn't get asked.

In legal contexts — as in the Schwartz case — the AI used real court names (real jurisdiction, real procedural format) paired with invented case names and docket numbers. The Federal Rules of Civil Procedure existed. The cases did not.

In historical research, AI systems have generated accurate-sounding quotes attributed to real historical figures. Abraham Lincoln, Marie Curie, and Winston Churchill have all been given fabricated quotes by AI systems — quotes that fit their documented style and worldview closely enough to fool casual readers.

The Two-Step Check

Step 1: Does the container exist? (Does this journal, news org, court, or person exist?) Step 2: Does this specific content exist within that container? Most people only do Step 1. The hallucination hides in Step 2.

The reason this matters beyond academic work: news articles, political arguments, and public health claims are increasingly generated or assisted by AI. When someone shares a post saying "according to the CDC, X..." the real question isn't whether the CDC exists — it's whether the CDC actually said X. Knowing to ask Step 2 is what separates an informed reader from someone being misled.

The Ethical Question About Platforms

Here is the uncomfortable question the Sokol case raises. The AI that generated those fake citations was a product — a tool a company built and sold. Sokol was a graduate student doing academic work in good faith. When the fake citations almost made it into a published thesis, whose failure was that?

You might say: Sokol should have verified. But she was using the tool in the way it was designed to be used — as a research aid. The tool did not warn her that its citations might be invented. Most AI writing tools, as of 2023 and 2024, do not include prominent warnings that their citations require verification. Some include fine print. Most users don't read it.

Is it sufficient for AI companies to include a disclaimer in their terms of service? Or do they have an obligation to make the limitations unmissably clear — not buried in legal text, but visible at the moment the hallucinated output appears? How prominent would a warning need to be before responsibility shifts from user to platform?

These questions are being debated right now by regulators in the European Union, the United States, and the United Kingdom. The EU's AI Act, which passed in 2024, begins to address some of them. But "begins to address" is not the same as resolves.

What You Can Now See

You can now read a citation differently from most people. When someone shows you a source — whether it's a paper, a news article, or a quote from a historical figure — your first question is no longer just "does this source generally exist?" but "does this specific thing exist within that source?"

That's the two-step check. It takes thirty seconds with a search engine or a DOI lookup tool like doi.org. Thirty seconds is all that stood between Anna Sokol's thesis and academic misconduct. Most people skip it because the source looks real enough, and looking real enough has always been sufficient.

In a world where AI can generate convincing-looking sources in seconds, "looking real enough" is no longer sufficient. You know that now. That knowledge shapes how you read everything that cites a source — which is almost everything that matters.

Lesson 2 Quiz

Five questions · Apply the two-step check to new scenarios

1. In the Anna Sokol case, why were the fake citations especially dangerous compared to citations with obviously fake journal names?

Correct. The real-container/fake-content pattern exploits the fact that most people stop verifying once the container (journal, news org) checks out.

Think about the two-step check. Step 1 (does the journal exist?) passed. The problem was only visible at Step 2 (does this specific paper exist?).

2. Someone shows you this citation: "Zhang, W. & Kowalski, S. (2022). Social media and adolescent sleep. Journal of Adolescent Health, 71(3), 44–52. doi:10.1016/j.jadohealth.2022.04.008." What is the single most reliable verification action?

Right. The DOI lookup is the definitive Step 2 check. If the paper exists, the DOI resolves to it. If it doesn't resolve, the citation is fabricated regardless of how real everything else looks.

These checks only confirm Step 1 (real container). The DOI lookup is Step 2 — the only check that confirms this specific paper exists.

3. An AI tool generates a news summary that begins: "According to a 2023 Reuters report, global electric vehicle sales rose 41% year-over-year." You cannot find this specific report on Reuters' website. What is the most likely explanation?

Exactly. This is the real-container/fake-content pattern applied to news. Reuters is a legitimate news organization, which makes the hallucinated "report" more convincing.

The most consistent explanation given what we know about AI hallucination patterns is the real-container/fake-content structure. The AI learned that "Reuters reports" precede credible statistics — so it generated that pattern.

4. If an AI writing tool includes a disclaimer in its terms of service saying "outputs may be inaccurate," does that fully resolve the ethical concern about fake citations?

Right. The question is whether the warning reaches users meaningfully — at the point where the risk materializes — not whether it technically exists somewhere in a legal document.

Terms-of-service disclaimers are notoriously unread. The ethical question is about whether warnings are prominent enough to actually change behavior — not just legally sufficient.

5. You're reading a social media post that says "Abraham Lincoln once said: 'Give me six hours to chop down a tree and I will spend the first four sharpening the axe.'" This quote fits Lincoln's documented values. Should that make you more confident it's real?

Exactly. Style-fitting is precisely what makes AI-generated fake quotes convincing. For the record, historians have found no evidence Lincoln said this — it's likely a modern fabrication, possibly AI-assisted.

Style-consistency is actually a warning sign, not a green light. AI hallucinations about quotes are most convincing when they match the subject's documented voice. Verification requires finding a primary source, not just a style match.

Lab 2: Citation Forensics

Your role: research integrity investigator. The AI challenges your verification logic.

Your Case File

You've been handed three citations from an AI-generated research report. Your job is to decide which ones you trust enough to use without verification, which ones need Step 2 verification, and what that verification would actually look like in practice.

Citation A: "World Health Organization (2022). Global report on hearing loss. WHO Press."

Citation B: "Hartmann, L., Osei-Bonsu, K., & Park, J. (2023). Neural correlates of misinformation belief. NeuroImage, 284, 119–134. doi:10.1016/j.neuroimage.2023.119840"

Citation C: "As reported by the BBC on March 14, 2024: 'AI-generated content now accounts for 38% of all online articles.'"

Tell the AI: Rank these from most to least suspicious. Explain what specific Step 2 action you'd take for each. Defend your ranking.

Peer AI — Citation Forensics

Lab 2

Three citations, all plausible-looking. Before you rank them — tell me: is there any scenario where you'd use a citation without doing Step 2? Or is Step 2 always mandatory? I want to hear your actual position before we dig into the specifics.

Module 2 · Lesson 3

The Numbers That Aren't Numbers

AI-generated statistics feel authoritative. Understanding how they're produced tells you when to trust them.

When an AI says "studies show 73% of people..." — what has it actually done to produce that number?

In November 2022, a major consumer electronics company — CNET, one of the oldest and most-read technology news sites in the United States — quietly began publishing financial explainer articles written by an AI. The articles had bylines that read "CNET Money Staff" and appeared alongside human-written content with no obvious distinction.

In January 2023, the tech publication Futurism broke the story. Reporters examined the AI-written articles and found a pattern: many contained small but concrete numerical errors. One article stated that if you invest $10,000 at 3% annual interest for a year, you'd earn $10,300 in total — which is correct — but described this as "an increase of 300 dollars, or 3.3%." The error is subtle: 300 divided by 10,000 is 3%, not 3.3%.

Readers who weren't checking the arithmetic would never notice. But across dozens of articles, CNET's editors found more than 41 errors in content the AI had produced. Some were minor rounding errors. Others were more substantial misstatements about tax rules, interest calculations, and compound growth formulas.

CNET issued corrections and suspended the program. A senior editor publicly stated that the errors were "not acceptable for a publication built on trust." What made the story significant wasn't just the errors themselves — it was that the AI had been producing content that looked exactly like financial journalism, numbers included, and had been doing so for two months before anyone noticed. The numbers had the right format, the right units, the right order of magnitude. They were just subtly wrong.

Why AI Gets Numbers Wrong in This Specific Way

There is an important distinction to make here, because it changes how you read AI-generated numbers. AI language models don't do arithmetic the way a calculator does. When a calculator adds 3% to $10,000, it computes. When an AI language model produces the result, it is predicting what a plausible-looking answer would be in this context — based on patterns in documents where similar calculations appeared.

Most of the time, that prediction is correct, because the training data was mostly correct. But for calculations involving specific percentages, compound interest, or unit conversions, the AI is not running the numbers — it is generating text that looks like someone ran the numbers. This distinction is subtle and consequential.

Arithmetic Hallucination When an AI generates a numerically plausible but mathematically incorrect result, because it is predicting likely text rather than computing. Most dangerous with percentages, multi-step calculations, and compound figures.

The reason CNET's errors were subtle — 3.3% instead of 3% — is that both numbers are plausible in the context of financial writing. The AI generated a number that felt right for the sentence. If the actual answer had been 47.3%, an error of 0.3 percentage points might never be noticed by most readers. But if the error compounds — as errors in financial calculations often do — the downstream effect on someone's actual financial decision could be significant.

There is also a related problem: invented statistics that sound like research findings. When an AI writes "studies show that X% of people..." it has often learned this phrasing pattern from real research documents — but it may be generating the number from pattern-matching rather than citing any real study. The phrase pattern is real. The statistic may be fabricated.

The Three Categories of AI-Generated Numbers

Once you know this, you can sort the numbers you encounter into three categories that require different levels of scrutiny.

Category 1: Arithmetic from stated inputs. If an AI does a calculation based on numbers you gave it, the risk is arithmetic hallucination. The inputs are known; the operation should be straightforward. Check the math manually or with a calculator. This is the CNET problem — the inputs were correct, but the calculation was subtly wrong.

Category 2: Statistics attributed to named sources. If an AI cites "a 2023 CDC report" for a statistic, apply the two-step check from Lesson 2. Does the CDC exist? Yes. Does the CDC report say this specific number? That requires the actual report. Don't skip Step 2 just because the number sounds plausible.

Category 3: Unattributed statistical claims. "Research suggests," "studies show," "experts estimate" — these phrases followed by a specific percentage are a high-risk pattern. The AI has learned that this phrasing precedes statistics. It may have generated the statistic to fit the sentence. Without a specific source, these numbers are nearly impossible to verify and should be treated as uncorroborated until you find the actual study.

The Specificity Trap

Counter-intuitively, more precise numbers are sometimes more suspicious, not less. A statistic of "about 70%" might come from actual research. A statistic of "72.4%" — with a decimal — sounds more authoritative, but that precision is often a hallucination artifact. Real survey data has decimal precision because of sample sizes; AI generates decimal precision because it looks credible.

The Institutional Angle: What This Means at Scale

The CNET story wasn't just about one publication making errors. It exposed something about what happens when AI-generated content enters information ecosystems at volume. CNET is read by millions of people making real financial decisions — about savings accounts, loans, investment products. A subtle error in a widely-read financial explainer can ripple outward in ways that are nearly impossible to trace.

By 2024, multiple studies estimated that somewhere between 15% and 20% of content on major social platforms contained text that was substantially AI-generated. The exact figure varies by study and by platform. But the implication is significant: if AI arithmetic hallucinations appear in even a small fraction of that content, the total number of people encountering subtly wrong numbers is very large.

No regulatory framework in 2023 or 2024 required AI-generated content to be labeled in a way that would alert readers to check the arithmetic. The EU AI Act requires transparency for certain high-risk applications. But an AI-written financial explainer on a news website does not currently fall into a regulated category in most jurisdictions.

The ethical question here is about scale: if one human journalist makes an arithmetic error, the impact is limited. If an AI system makes a consistent type of error and that system produces thousands of articles, the error propagates at a scale no individual journalist could create. Does scale change moral responsibility? Does it change regulatory obligation? These questions don't have finalized answers.

Lesson 3 Quiz

Five questions · Numbers require a different kind of skepticism

1. Why does an AI language model sometimes generate subtly wrong arithmetic rather than obviously wrong arithmetic?

Correct. The model generates text that looks like the output of a calculation — but a plausible-looking wrong answer is numerically close to the right answer, so the error is subtle.

AI doesn't introduce deliberate errors, and its training data is mostly correct. The issue is that it generates plausible text rather than computing — and plausible wrong answers are close to right answers.

2. An AI financial assistant tells you: "Investing $5,000 at 4% annually for 2 years gives you $5,408." You should:

Right. In this case the AI happened to be correct — but the habit of verification matters more than the specific outcome. A correct answer doesn't prove the process is reliable.

The right habit is manual verification for any arithmetic from an AI. Even when the AI is correct (as here), the habit protects you in cases where it isn't. Decimal precision is not evidence of accuracy.

3. An AI-written health article states: "Research indicates that 68.2% of adolescents experience phone-related sleep disruption." No source is cited. Which category from Lesson 3 does this fall into?

Exactly. "Research indicates" with no source is a Category 3 unattributed claim. The AI has learned the phrasing pattern — it may have generated the 68.2% figure to fit the sentence, not from any actual study.

There's no arithmetic to check (Category 1), and no named source to look up (Category 2). This is a Category 3 unattributed claim — the most opaque category and the one requiring the most skepticism.

4. Why might "72.4%" be a more suspicious statistic than "about 70%" in an AI-generated document?

Correct. This is the specificity trap — AI learned that precise numbers sound credible, so it generates them. Real decimal precision comes from actual sample-size-based calculation, which unattributed stats don't have.

Think about the specificity trap. Decimal precision is credibility signaling — but when it appears in an unattributed stat, it's the AI generating the pattern, not calculating from data.

5. If an AI system makes a consistent arithmetic error type and produces 100,000 articles, how does this differ ethically from a single journalist making the same error in one article?

Exactly. The ethics of scale matter. One error affects a few readers. The same error replicated a hundred thousand times affects potentially millions of decisions. Intent doesn't change impact.

Scale changes the ethical calculus. When an error becomes systemic rather than individual, the obligation to detect and correct it — before publication, not after — becomes much harder to avoid.

Lab 3: Number Auditor

Your role: fact-checker for a digital news outlet. The AI challenges your audit decisions.

Your Audit File

You're reviewing an AI-generated health article before publication. Below are four numerical claims from the draft. Your editor needs your assessment: which ones can publish as-is, which need verification, and which should be cut until sourced?

Claim 1: "If you sleep 7 hours instead of 8, you lose 365 hours of sleep per year — roughly 15 full days."

Claim 2: "According to the American Sleep Foundation's 2023 annual report, 54.8% of teenagers report waking up tired most mornings."

Claim 3: "Studies consistently show that blue light exposure before bed reduces melatonin production by approximately 50%."

Claim 4: "Experts estimate that poor sleep costs the US economy $411 billion annually — a figure from a 2016 RAND Corporation study."

Tell the AI: Which category (1, 2, or 3) does each claim fall into? What's your publish/verify/cut decision for each? Be specific.

Peer AI — News Fact-Checker

Lab 3

Four claims, different risk profiles. Before you categorize them — I want to push on something first. Claim 4 names a real institution (RAND) and a specific year (2016). Does that automatically move it to Category 2, or is something else going on there? Think before you sort.

Module 2 · Lesson 4

Reading the Whole Output

Individual facts can check out while the overall picture is misleading. A detective reads the frame, not just the details.

If every sentence in an AI summary is technically true, can the summary still be wrong?

In March 2023, Google launched a product called Bard — its answer to ChatGPT — at a live event streamed to millions of viewers. In a promotional advertisement released beforehand, Bard was shown answering the question: "What new discoveries from the James Webb Space Telescope can I tell my 9 year old about?"

Bard answered fluently, with three bullet points. The third claimed that the Webb telescope had taken "the very first pictures of a planet outside of our own solar system." Astronomers immediately recognized the problem. The first direct images of exoplanets — planets outside our solar system — were taken in 2004, by the Very Large Telescope in Chile, nearly two decades before Webb launched.

The other two bullet points were accurate. The framing of the ad was enthusiastic and positive. The error was specific and verifiable. But here is what made the episode instructive beyond the single factual mistake: Google's stock dropped 9% the next day — erasing roughly $100 billion in market value. A one-sentence factual error in an advertisement caused the largest single-day dollar loss in the company's history up to that point.

What the Bard case illustrates isn't just about hallucination. It's about something subtler: the overall confidence and authority of the AI's presentation made it harder to notice the error. The two accurate bullet points provided cover for the false third one. Readers who trusted the fluent, structured answer as a whole were more vulnerable than readers who interrogated each claim independently.

Selective Truth: When Facts Add Up to Falsehood

This lesson is about a more sophisticated reading skill than catching outright hallucinations. It's about recognizing that a collection of true statements can create a false impression — and that AI is particularly capable of producing this kind of output, because AI is very good at generating coherent, well-structured narratives.

Consider a simple example. Suppose you ask an AI: "Is caffeine dangerous?" It might respond: "Caffeine consumption has been linked to elevated heart rate, increased anxiety in sensitive individuals, disrupted sleep patterns, and dependency. In high doses, caffeine can cause cardiac events." Every sentence in that paragraph is technically accurate. But a reader who asked because they wanted to know whether their morning coffee was a health risk would come away with a distorted picture — one that omits the large body of research suggesting moderate caffeine consumption has neutral or positive health effects for most adults.

Selective Truth When an AI response contains only true statements but creates a misleading overall impression by omitting relevant context, counterevidence, or balancing information. Every sentence checks out; the picture is still wrong.

This happens because AI models, when asked a question, often generate text that is most consistent with a particular framing. If the question implies a negative answer is expected, the model may generate a collection of true-but-negatively-framed facts. If the question implies a positive answer, the opposite can occur. The selection process is driven by what "fits" the narrative context — not by what would give the most balanced picture.

Researchers call this framing bias in AI outputs. It is harder to detect than hallucination because each individual claim survives fact-checking. You need to ask a different question: What has this output left out?

Four Reading Moves for the Whole Output

A detective reading an AI output doesn't just check the facts — they also ask whether the facts have been assembled honestly. Here are four specific moves that catch selective truth and framing bias.

Move 1: Ask what's missing. After reading an AI output, ask: what would the other side of this argument say? What evidence would someone use to argue the opposite conclusion? If you can't think of any — if the output seems to have pre-emptively closed off every alternative — that's a signal the framing may be selective.

Move 2: Flip the question. Ask the AI the opposite question and compare outputs. If you asked "is this technology dangerous?" ask "what are the benefits of this technology?" Compare what appears in one answer but not the other. The omissions are informative.

Move 3: Check the weight of emphasis. AI outputs often bury important caveats in subordinate clauses or final sentences, while leading with dramatic or attention-grabbing claims. Read the final sentence of any AI response — it often contains the most important qualifier, placed where it will have the least impact on how readers remember the content.

Move 4: Identify whose perspective is centered. AI models trained on English-language internet text have absorbed perspectives that are disproportionately from certain demographic and geographic groups. Ask whether the framing you're reading reflects the full range of people affected by the issue — or mainly the perspective of those who write most on the internet.

The Bard Move

In the Bard/Webb case, Move 1 would have caught the error: "What would an astronomer who disagreed with this answer say?" An astronomer would immediately point out the 2004 discovery. Asking for the critical perspective — even hypothetically — surfaces what the AI left out.

What Changes When You Read This Way

The four moves from this lesson are not just for AI outputs. They are the same reading skills that journalists, researchers, lawyers, and scientists use when evaluating any source. What makes them especially important for AI outputs is that AI produces text that feels complete and authoritative — text designed (by its training) to satisfy the reader's question, not necessarily to give the most accurate picture.

Human experts often signal uncertainty and limitation with their tone, their qualifications, their references to other viewpoints. AI systems are trained to produce helpful, complete-seeming answers. That training creates a specific kind of blind spot: the reader's feeling of "I got what I needed" may arrive before the reading is actually done.

You now have a framework for reading AI outputs that most people — including many professionals who use these tools daily — do not apply systematically. The four moves. The three number categories. The two-step citation check. The confidence-is-not-accuracy principle from Lesson 1. These fit together into a single reading practice: treat every AI output as a starting draft written by a very capable, very confident intern who may not have checked their sources.

The Full Picture

Knowing this changes how you read every AI-assisted headline, summary, explainer, or recommendation you encounter — which, by 2024 estimates, is a significant fraction of everything published online. You are not being paranoid. You are being precise. There's a difference, and you know it now.

One more thing to sit with: the same moves you're applying to AI outputs can be applied to human-written content. The difference is that AI produces these patterns at scale and with consistent fluency. The skill transfers everywhere — but it's most urgently needed here, now, in the period before readers, publishers, and regulators catch up to what these systems actually do.

Lesson 4 Quiz

Five questions · Reading the frame, not just the facts

1. In the Google Bard/Webb case, what made the false claim about exoplanet images especially hard to catch at first glance?

Exactly. Accurate surrounding context creates cover for a false claim. A confident, fluent, structured output lowers the reader's guard across the whole response — including the parts that are wrong.

The claim was straightforward — first exoplanet images were a well-known milestone. The problem was structural: accurate content surrounding the false claim, and overall confident presentation reducing scrutiny.

2. You ask an AI "Is social media bad for teenagers?" and get a response listing only studies that show negative effects. Each study is real. Is the response accurate?

Right. This is selective truth. Individual facts can survive fact-checking while the overall impression they create is distorted. Accuracy requires completeness, not just truth of the selected claims.

Factual accuracy of individual claims doesn't guarantee that the overall picture is balanced. The selection process — what's included and excluded — can create a misleading impression even when every included fact is real.

3. You want to apply Move 2 (flip the question) to an AI summary about electric vehicles. You asked "What are the environmental problems with electric vehicles?" What should you do next?

Correct. Move 2 means asking the opposite question and comparing. The gap between the two answers — what appears in one but not the other — shows what the framing left out.

Move 2 is specifically about flipping the question to the opposite framing, not repeating it or finding agreement. The comparison between the two answers reveals the selective emphasis.

4. An AI response about a medical treatment ends with: "While side effects are rare, some patients have reported dizziness and fatigue." This caveat is buried in the final sentence after five sentences about benefits. What reading move applies?

Exactly. Move 3 is about structure and emphasis. Information placed last, after a dominant narrative, is retained less by readers — even if it's technically present. Position matters as much as presence.

This is a Move 3 scenario — the key qualifier is present but structurally minimized. The caveat is there, but placed where it has the least impact on how the overall message is absorbed.

5. How do the four reading moves from Lesson 4 relate to reading non-AI sources like news articles or books?

Right. These are general critical reading skills — but AI makes them urgent because the confident, fluent output style removes the natural uncertainty signals that readers use to calibrate how much to trust human-written text.

These moves transfer broadly. They're especially important for AI because AI doesn't signal doubt the way human experts do — making the reader's job of applying critical moves more, not less, important.

Lab 4: The Frame Auditor

Your role: critical reader and investigator. The AI challenges whether your reading is thorough enough.

Your Brief

Below is an AI-generated summary about a controversial topic. Your job is to apply at least two of the four reading moves from Lesson 4 — and identify specific evidence of selective truth or framing bias. The AI partner will push back on shallow analysis.

"Electric vehicles are a critical solution to climate change. EVs produce zero tailpipe emissions, reducing urban air pollution and improving public health. Battery costs have fallen 90% since 2010, making EVs increasingly affordable. Major automakers are phasing out combustion engines by 2035. Government incentives in the US, EU, and China are accelerating adoption. The transition to EVs represents one of the clearest success stories in clean energy policy."

Apply Move 1 (what's missing?) and Move 2 (flip the question). What has the framing left out? What would the opposite question reveal? Be specific — don't just say "it's biased."

Peer AI — Frame Auditor

Lab 4

You've read the paragraph. Before you tell me what's missing — I want to push on something. Every sentence in that paragraph might be technically accurate. So if I can fact-check each claim and it passes, does that mean the summary is honest? Or is that the wrong question? Tell me your position first.

Module 2 Test

15 questions · 80% required to pass · Tests reasoning across all four lessons

1. What is the fundamental reason AI language models produce hallucinations?

Correct. The model generates what fits the pattern — accuracy is not the optimization target.

Hallucination is a product of how language models work, not a deliberate feature or a compute limitation.

2. Steven Schwartz, the lawyer fined in 2023, went back to ChatGPT and asked it to confirm the fake cases were real. The AI confirmed them. What does this demonstrate?

Right. Double-down confirmation is a known failure mode. The model is responding to the conversational context, not rechecking its sources.

AI doesn't have intentions. It generates confident-sounding confirmations because that fits the conversational pattern — not because it found new evidence.

3. The "real-container, fake-content" pattern means:

Correct. This pattern exploits verification shortcuts — people check whether the container exists but don't verify whether the specific content exists within it.

The real-container/fake-content pattern specifically uses legitimate source names — making it harder to detect than fully invented sources.

4. What is the "two-step check" and why is Step 2 critical?

Exactly. The container check (Step 1) is easy and often passes even for hallucinated citations. The specific content check (Step 2) is where fakes are exposed.

The two-step check is specifically about the container vs. the content within it — not about author credentials or comparing search engines.

5. A DOI (Digital Object Identifier) formatted like "doi:10.1016/j.jadohealth.2023.04.012" appears in an AI-generated citation. The DOI format looks correct. What should you do?

Correct. AI learns DOI formatting from millions of documents. It can generate correctly formatted DOIs that resolve to nothing. Only the actual lookup confirms existence.

Format correctness is not proof of existence. AI generates correct-format DOIs from pattern learning — the actual resolution test is the only reliable check.

6. Why does an AI language model sometimes get simple arithmetic wrong, such as calculating the wrong percentage?

Right. The model predicts what a correct calculation result would look like, based on patterns — it doesn't run the numbers the way a calculator does.

The core issue is that text prediction is not computation. The model generates results that fit the pattern of correct answers — sometimes successfully, sometimes not.

7. CNET's AI-written financial articles contained errors like calculating 3% as 3.3%. Why were these errors especially hard for readers to catch?

Correct. Subtle errors + authoritative presentation + reader trust in published content = a combination that allows errors to propagate undetected.

The combination of small numerical deviation, professional presentation, and normal reader trust in published financial journalism explains why the errors ran for two months before detection.

8. An AI article states: "Experts estimate that X% of people globally own smartphones." The number has no source. Which verification approach is correct?

Right. Category 3 unattributed claims require finding an actual source — asking the AI for a source may produce another hallucinated citation.

Plausibility is not verification, there's no arithmetic to check, and asking the AI for a source may produce a hallucinated citation. Category 3 requires finding a real named source independently.

9. The "specificity trap" means that in AI outputs, a statistic of "72.4%" is sometimes more suspicious than "about 70%". Why?

Correct. Decimal specificity is a learned credibility signal — AI generates it because it appears in authoritative sources. When there's no actual dataset, that specificity is a hallucination artifact.

There's no reliability threshold, AI can generate decimals, and "about 70%" isn't always from real research. The point is that decimal specificity signals credibility but doesn't guarantee accuracy.

10. In the Google Bard/Webb telescope case in March 2023, what was the primary lesson about reading AI-generated lists or bullet points?

Right. The structure of a list — especially one where most items are accurate — creates a gestalt impression of reliability that can mask individual false claims.

Errors can appear anywhere in a list. The lesson is that accurate surrounding content reduces scrutiny of each individual claim — which is precisely the vulnerability.

11. What is "selective truth" in the context of AI outputs?

Correct. Selective truth passes fact-checking at the sentence level but fails at the framing level. The selection of what to include is itself a form of distortion.

Selective truth is specifically about the gap between "every included claim is true" and "the overall impression is accurate." The distortion lies in what's omitted, not what's stated.

12. You apply Move 2 (flip the question) and find that the opposite-direction question produces dramatically different facts than the original. What does this suggest?

Right. Neither version is necessarily false — each may be selecting real evidence that supports its framing. The full picture emerges from comparing both.

Dramatic differences between flipped-question responses don't prove hallucination — they reveal framing bias. Both responses may be true-but-selective. The point is to synthesize both.

13. Which reading move is most directly targeted at identifying whose perspective an AI output reflects?

Correct. Move 4 is specifically about recognizing that the "default" perspective in AI-generated text reflects the demographics of who produces most online content — which is not a neutral representation.

Moves 1–3 address different aspects of framing. Move 4 specifically addresses the demographic and geographic perspective embedded in AI outputs through training data composition.

14. A student uses AI to write a report about climate policy. Every fact in the report is individually accurate. The report concludes that climate action is making excellent progress. What question should their teacher ask first?

Exactly. When all facts check out, the critical question shifts to what's missing. Selective truth produces false optimism (or false pessimism) from entirely accurate components.

Source authenticity and arithmetic are important but secondary here. When individual facts are confirmed, the primary concern becomes completeness — what the framing has excluded.

15. Across all four lessons in this module, what is the single most important habit for reading AI outputs?

Right. This integrates all four lessons: AI is useful but not self-certifying. The verification responsibility lies with the reader, proportional to the stakes of being wrong.

Blanket rejection, use-restriction, or AI self-confirmation are all flawed strategies. The integrated lesson is calibrated skepticism: AI as a useful starting point, with human verification scaled to the stakes.