Lesson 1 · Module 2

Personality, Tone, and Voice

Why the way a chatbot speaks matters as much as what it says.

How did Duolingo's chatbot personality change user retention — and what did Microsoft's Clippy get catastrophically wrong?

When Duolingo introduced its AI-powered character companions in 2023, the company reported that users who engaged with the personality-driven chat features were significantly more likely to complete their daily streaks. The characters — each with distinct names, quirks, and conversational styles — weren't just decoration. They were the product.

Compare this to Microsoft's infamous Clippy, retired in 2003 after nine years of user frustration. Clippy had personality — too much of the wrong kind. It interrupted constantly, offered unsolicited advice, and felt intrusive rather than helpful. It is one of the most studied failures in conversational interface history.

What "Voice" Means in Chatbot Design

A chatbot's voice is the consistent set of linguistic and tonal choices it makes across every interaction. It answers the question: if this chatbot were a person at a party, who would it be? Voice encompasses vocabulary choice, sentence length, use of humor, level of formality, and even punctuation habits.

The critical distinction designers often miss is the difference between brand voice (who the company is) and chatbot personality (how the bot expresses that identity in real-time conversation). A luxury hotel brand may have a formal brand voice, but its chatbot still needs to decide: does it say "Certainly" or "Of course" or "Absolutely"? Each carries a different emotional weight.

Dimension 01

Formality Register

The spectrum from casual ("Hey, got it!") to formal ("Your request has been acknowledged."). Should match the user's context and expectation.

Dimension 02

Warmth Level

How emotionally expressive the bot is. High warmth bots use affirmations, empathy phrases, and first names. Low warmth bots prioritize efficiency.

Dimension 03

Expertise Persona

Is the bot a peer, a guide, or an authority? Each shapes how it handles uncertainty, corrections, and complex questions.

Dimension 04

Humor Tolerance

Whether and how the bot uses levity. Humor reduces frustration in low-stakes moments but can feel dismissive during complaints.

The Persona Specification Document

Professional chatbot design teams produce a persona specification document before writing a single response. This document defines: the bot's name and backstory, its communication principles, a vocabulary list of preferred and prohibited words, sample exchanges for 10–15 common scenarios, and a guide for handling edge cases like user frustration or profanity.

In 2021, Bank of America published details on the development of Erica, its AI assistant. Erica's persona team ran multiple A/B tests on greeting phrasing alone, finding that "Hi, I'm Erica. How can I help?" outperformed "Hello! What would you like to do today?" measurably on perceived trustworthiness. The difference was subtle but consistent across millions of sessions.

Design Failure Pattern

The most common voice mistake is inconsistency. When a bot switches from casual to stiff mid-conversation — often because different teams wrote different flows — users report feeling the bot is "broken" or "not real." Consistency of voice is a proxy for reliability of function in users' minds.

Anthropomorphism: Opportunity and Risk

When we give bots names, use first-person pronouns, and express emotions, we invite anthropomorphism — users treating the bot as a social agent. Research from Stanford's CASA (Computers Are Social Actors) studies, first published in 1994 by Clifford Nass and Byron Reeves, demonstrated that humans apply social rules to computers even when they know they're machines.

This is a double-edged design tool. High anthropomorphism can increase engagement and satisfaction — users feel heard. But it also raises expectations. When the bot fails at something a human would handle easily, the disappointment is proportionally greater. Design for the failure case first. A bot that's honest about its limits disappoints less than one that oversells itself.

Key Principle

The best chatbot voices are written from a character brief, not from a list of topics the bot can handle. Start with who the bot is, then derive what it says — not the reverse.

Key Terms

Voice:The consistent personality and linguistic style a chatbot maintains across all interactions.

Persona Specification:A design document defining a chatbot's name, personality traits, vocabulary rules, and sample dialogues.

Anthropomorphism:The tendency of users to attribute human qualities — emotions, intentions, social norms — to non-human agents like chatbots.

Formality Register:The level of linguistic formality chosen for a bot's responses, ranging from casual to highly formal.

Lesson 1 Quiz

Personality, Tone, and Voice — 4 questions

What was the primary documented failure of Microsoft's Clippy as a conversational interface?

Correct. Clippy's failure was behavioral and tonal — too much unsolicited interruption — not a technical one. It's a canonical example of personality design gone wrong.

Not quite. Clippy's failure was primarily one of inappropriate behavior and tone: it interrupted users constantly and felt presumptuous, not that it lacked technical capability.

In chatbot design, what is the difference between "brand voice" and "chatbot personality"?

Correct. The brand voice is the strategic foundation; the chatbot personality is the real-time expression of it in dialogue. A luxury brand voice might manifest in precise word choices the bot makes in each exchange.

Not quite. They are related but distinct: brand voice is the company-level identity, while chatbot personality is how that translates into specific conversational choices moment by moment.

According to Nass and Reeves' CASA research, what do users tend to do when interacting with bots that have human-like voices?

Correct. The CASA (Computers Are Social Actors) framework, established by Clifford Nass and Byron Reeves in 1994, showed that humans are hardwired to treat social cues as real social interactions regardless of the source.

Not quite. CASA research found the opposite: users apply human social norms to computers even when consciously aware they are machines. This is why anthropomorphism is both powerful and risky in design.

A persona specification document primarily serves which design purpose?

Correct. A persona spec is a design artifact — written before deployment — that ensures every team member writing bot responses follows the same character brief, producing a consistent voice.

Not quite. A persona specification document is a design and writing guide focused on personality consistency: who the bot is, how it speaks, and what words it prefers or avoids.

Lab 1 — Voice Design Workshop

Practice building chatbot personality with AI feedback

Your Task: Design a Chatbot Voice

You are designing the persona for a new chatbot. Your AI lab partner will help you test personality choices, identify inconsistencies, and sharpen your voice specification. Aim for at least 3 exchanges to complete the lab.

Try: "I'm designing a chatbot for a mental health support platform. Help me define its voice." — or describe your own scenario and ask for critique of your persona choices.

Voice Design Lab

AI Partner

Welcome to the Voice Design Lab. I'm here to help you build and stress-test chatbot personality specifications. Tell me about the chatbot you're designing — its industry, users, and goals — and we'll work through voice dimensions together.

Lesson 2 · Module 2

Conversation Flow Design

Mapping the paths users take — and the ones they don't.

How did KLM Royal Dutch Airlines design a chatbot that handled 16,000 conversations per week — and what made its flow architecture work?

In 2017, KLM Royal Dutch Airlines deployed BlueBot (BB) on Facebook Messenger. Within a year, it was handling over 16,000 conversations per week, sending 1.7 million messages monthly across 15 languages. The secret wasn't AI sophistication — it was deliberate flow architecture.

KLM's team mapped every conversation path a traveler might take from the moment they landed on Messenger: booking inquiries, seat upgrades, check-in reminders, rebooking after delays. Each path had a clear happy path (the ideal completion) and defined fallback exits — graceful handoffs to human agents when the bot reached its limits.

The Anatomy of a Conversation Flow

Every chatbot conversation is a structured graph of states. The happy path is the shortest route from user intent to satisfied resolution. But designing only the happy path is why most chatbots fail — users rarely follow it.

Professional flow design requires mapping at minimum: the happy path, at least three interruption scenarios per path, error and fallback states, re-entry points when conversations stall, and handoff protocols to humans.

Intent Detection

The bot identifies what the user wants. This can be keyword-based, NLU-powered, or guided via buttons. Good design makes the user's intent clear before the bot commits to a path.

Slot Filling

The bot collects the specific information it needs to fulfill the intent. For a flight booking: departure city, destination, date, passenger count. Each "slot" is a piece of required data.

Confirmation

Before executing irreversible actions, well-designed bots confirm with the user. "You'd like to rebook to the 4:15 PM flight — shall I confirm?" This reduces costly errors.

Fulfillment

The bot executes the task — queries a database, places an order, sends a confirmation email. This is the integration layer connecting conversation to action.

Closure & Next Action

The bot confirms completion and offers logical next steps. Good closure reduces follow-up contacts by anticipating what the user needs next.

Interruptions and Digressions

The most critical — and most neglected — aspect of flow design is handling digressions: moments when users deviate from the expected path. This happens constantly. A user booking a flight may suddenly ask "What's your baggage policy?" mid-flow. A well-designed bot handles this without losing context.

Google's Dialogflow introduced a formal concept called digression handling in its design guidelines in 2018: the bot follows the tangent, then offers to return to the original task. This "follow and return" pattern mirrors how skilled human customer service agents handle interruptions.

Common Design Error

Designing flows that only work when users answer exactly as expected. Real users ask compound questions, change their minds mid-sentence, and provide information out of sequence. Flows must be tolerant of disorder, not brittle to it.

The Escalation Decision

Every flow needs explicit escalation criteria: conditions under which the bot hands off to a human agent. These are not failures — they are features. KLM's BlueBot escalated automatically when it detected sentiment indicators of high frustration, when a rebooking involved multiple flights and special needs, or when the user explicitly requested a human.

The Forrester Research report on chatbot CX from 2020 found that the single biggest predictor of chatbot satisfaction scores was how gracefully the bot escalated — not how often it resolved without escalation. A bot that tries too hard to avoid escalation destroys satisfaction. A bot that escalates smoothly, with full context transfer to the agent, delights users.

Design Maxim

Draw the failure paths before you draw the success path. The flows you design for "when things go wrong" determine whether users trust and return to your bot.

Key Terms

Happy Path:The ideal, uninterrupted route from user intent to task completion — the minimum viable flow every bot must have.

Slot Filling:The process of collecting all required data points (slots) before the bot can fulfill a user's intent.

Digression:A user's departure from the expected conversational path, requiring the bot to handle a tangent before returning to the main task.

Escalation:The protocol for handing a conversation from a bot to a human agent, ideally with full context transfer.

Lesson 2 Quiz

Conversation Flow Design — 4 questions

What did KLM's BlueBot use as a primary success mechanism, more than AI sophistication?

Correct. KLM's success came from careful mapping of all possible conversational paths, including defined exits to human agents when needed — not from leading-edge AI technology.

Not quite. KLM's BlueBot succeeded primarily because of carefully designed conversation flows with clear happy paths and graceful escalations, not because of advanced AI capabilities.

In conversational flow design, "slot filling" refers to which process?

Correct. Slot filling is the structured collection of required information — like departure city, date, and passenger count for a flight booking — before the bot can act.

Not quite. Slot filling is the process of gathering all required data inputs (slots) from the user before the bot can successfully complete the requested task.

According to Forrester Research's 2020 chatbot study, what was the single biggest predictor of high chatbot satisfaction scores?

Correct. Graceful escalation — with smooth handoffs and full context transfer — predicted satisfaction more strongly than automation rates, which is counterintuitive but well-documented.

Not quite. Forrester found that the quality of escalation to human agents — not resolution rates — was the top driver of satisfaction. Bots that escalate poorly destroy trust even if they resolve many cases autonomously.

What is a "digression" in conversation flow design?

Correct. Digressions occur when users go off-script — asking an unrelated question mid-flow — and good design handles these gracefully with a "follow and return" pattern before resuming the original task.

Not quite. A digression is when the user deviates from the main conversational path (like asking about baggage policy while booking a flight). Google's Dialogflow formalized digression handling as a key design pattern.

Lab 2 — Flow Architecture Critic

Map and stress-test your conversation flows with AI guidance

Your Task: Design and Stress-Test a Conversation Flow

Describe a chatbot task flow you want to design — or submit a draft flow you have — and your AI partner will critique it, identify missing fallback paths, and suggest improvements. Aim for at least 3 exchanges.

Try: "I'm designing a flow for a restaurant reservation chatbot. It asks for date, time, party size, and name. What am I missing?" — or describe your scenario and ask for a full flow critique.

Flow Architecture Lab

AI Partner

Welcome to the Flow Architecture Lab. Share a chatbot task you want to design a conversation flow for — tell me the use case, the steps you have in mind, and I'll help you identify gaps, edge cases, and escalation triggers.

Lesson 3 · Module 2

Error Handling and Graceful Failures

What happens when the bot doesn't understand — and why it matters more than success.

How did Amazon's Alexa team discover that users judge an assistant more harshly for bad error messages than for being wrong — and what design system did they build in response?

Amazon's Alexa UX team published findings from a large-scale user study in 2018 showing that users rated Alexa significantly lower after receiving confusing error messages than after Alexa simply admitted it didn't know something. The phrasing "I'm not sure I understand" consistently outperformed "I'm sorry, I don't know how to help with that" on user satisfaction metrics — despite conveying similar limitations.

This discovery drove a systematic rewrite of Alexa's error response library. The team created a tiered error response system: clarification prompts for ambiguous input, graceful admissions for unknown requests, and redirect responses when the user was close but not quite right. The result was a measurable improvement in retention among users who had previously abandoned Alexa after encountering errors.

The Error Taxonomy

Not all chatbot errors are the same. Conflating them leads to single-strategy responses that fail across the board. Professional chatbot design distinguishes at minimum four error types:

Error Type	Description	Best Response Strategy
No Match	The bot cannot identify any intent in the user's message	Ask a clarifying question — don't apologize, don't repeat the same prompt
Low Confidence	The bot has a guess but is uncertain which intent applies	Confirm before acting: "It sounds like you want X — is that right?"
Scope Limit	The bot understands the request but it's outside its capability	Acknowledge, explain briefly, and redirect or escalate
Repeated Failure	Three or more consecutive no-match responses in a session	Mandatory escalation trigger — do not let the loop continue

The "Three Strikes" Rule

Industry practice — codified by firms including Nuance Communications, which has handled voice bot design for hundreds of enterprise clients — establishes that three consecutive failures to understand should always trigger an escalation or a significant conversational reset. Allowing a bot to fail indefinitely is the fastest path to user abandonment and negative brand association.

The three-strikes rule requires bots to track failure counts per conversational segment, not globally per session. A user who had a clean booking experience but then hits three errors during payment needs escalation at the payment step, even if overall session health looks fine.

Language Design Pattern

The phrasing of error messages follows a three-part structure: (1) Acknowledge that there was a problem without blaming the user. (2) State what you can do, not what you can't. (3) Offer a concrete next step. "I didn't catch that — could you tell me your booking number? Or I can connect you with an agent right away."

Avoiding Error Message Anti-Patterns

Several error message patterns reliably damage user trust and satisfaction scores:

The Robot Response: "ERROR: Intent not recognized. Please rephrase your query." This exposes technical internals and signals incompetence. Never use system-level language in user-facing errors.

The Apology Loop: "I'm sorry, I didn't understand. Could you rephrase?" repeated verbatim three times. The repetition signals that the bot is not adapting — it is broken. Each clarification attempt must use different language and offer a different kind of help.

The Capability Lie: "I can help with almost anything!" followed immediately by "I'm sorry, I can't help with that." This trust violation, documented in Nielsen Norman Group UX research on chatbots published in 2019, causes users to rate the entire bot as untrustworthy, not just that response.

Failure as a Data Collection Opportunity

Every error is a signal about where user expectations and bot capabilities diverge. Leading chatbot platforms — including Google CCAI and IBM Watson Assistant — have built analytics dashboards specifically around unmatched intent logs. These logs reveal what users are trying to do that the bot cannot handle, which becomes the product roadmap for the next bot version.

Lyft's customer service chatbot team published a retrospective in 2021 describing how quarterly reviews of unmatched intent logs drove their highest-impact bot improvements: features users wanted but designers hadn't anticipated. The failures told them more than the successes did.

Design Principle

Design your error states with the same care as your success states. Users form lasting impressions of a bot most strongly during moments of failure — not during routine success.

Key Terms

No Match:An error state where the bot cannot identify any intent in the user's input; requires a clarifying question, not an apology.

Three Strikes Rule:The industry practice of triggering mandatory escalation or reset after three consecutive failures to understand within a conversational segment.

Unmatched Intent Log:A record of user inputs the bot could not handle; used as a product roadmap tool to identify capability gaps.

Apology Loop:An anti-pattern where the bot repeats the same error message verbatim, signaling that it is not adapting and damaging user trust.

Lesson 3 Quiz

Error Handling and Graceful Failures — 4 questions

Amazon's Alexa UX research found that users rated the bot lower after which specific failure?

Correct. Amazon found that confusing error phrasing damaged satisfaction more than honest admissions of ignorance. "I'm not sure I understand" significantly outperformed longer apologetic error messages.

Not quite. Amazon's research showed that poorly phrased error messages — not incorrect answers — were the bigger satisfaction driver. Clear, human error responses outperformed technically "correct" ones that sounded robotic.

What is the "Three Strikes Rule" in chatbot error handling?

Correct. The Three Strikes Rule, codified by firms like Nuance Communications, prevents endless failure loops by mandating escalation or a significant conversational reset after three consecutive no-match events.

Not quite. The Three Strikes Rule means that after three consecutive failures to understand in a given segment, the bot must escalate to a human or significantly reset — never allowing the failure loop to continue indefinitely.

Which of these is an example of the "Capability Lie" anti-pattern in error design?

Correct. The Capability Lie — overpromising capabilities and then immediately failing to deliver — was identified by Nielsen Norman Group as a major trust violation that causes users to distrust the entire bot, not just that response.

Not quite. The Capability Lie is the pattern of broadly claiming capability ("I can help with almost anything!") and then immediately admitting inability. This trust violation damages users' ratings of the whole bot system.

How did Lyft's chatbot team use unmatched intent logs, according to their 2021 retrospective?

Correct. Lyft's team found that reviewing unmatched intents quarterly revealed their highest-impact improvement opportunities — the things users wanted that the bot couldn't do yet, which became the next development cycle's priority.

Not quite. Lyft used their unmatched intent logs as a product development roadmap — those failed conversations revealed what users wanted but the bot couldn't handle, informing the next version's feature set.

Lab 3 — Error Message Rewriter

Turn robotic error messages into graceful, trust-building responses

Your Task: Rewrite Bad Error Messages

Submit a chatbot error message you've encountered (or made up), and your AI partner will critique it and help you rewrite it using the three-part acknowledge-state-offer structure. Aim for at least 3 exchanges.

Try: "Please critique this error message: 'ERROR: Intent not recognized. Please rephrase your query and try again.'" — or submit your own error messages for rewriting practice.

Error Message Rewriter

AI Partner

Welcome to the Error Message Rewriter lab. Share a chatbot error message — real or hypothetical — and I'll analyze it against the anti-patterns (Robot Response, Apology Loop, Capability Lie) and help you rewrite it using the acknowledge-state-offer framework.

Lesson 4 · Module 2

Ethical Design and Trust Transparency

When bots must disclose what they are — and what happens when they don't.

Why did California pass the BOT Disclosure Act in 2019, and what does Google's AI Principles document say about chatbot identity disclosure?

In September 2019, California's Bolstering Online Transparency (BOT) Disclosure Act took effect. It made California the first U.S. state to legally require bots to disclose they are not human when interacting with users on online platforms to influence purchasing decisions or votes. Violations carried civil penalties.

The law was driven in part by documented cases of political bots on social media masquerading as human users during the 2016 and 2018 election cycles. But its implications extended immediately into commercial chatbot design: any bot attempting to persuade must identify itself as such. This created a direct design requirement — bots needed disclosure mechanisms built into their opening exchanges.

The Disclosure Requirement in Practice

Disclosure is more nuanced than simply saying "I'm a bot" at the start of every conversation. Research from the MIT Media Lab, published in 2020, found that upfront disclosure of AI nature actually increased trust in chatbots for task-completion scenarios (booking, support, information retrieval) but decreased engagement in scenarios designed for emotional support.

This creates a design tension. The ethical requirement is clear — disclose. The UX question is how to disclose in ways that don't undermine the interaction before it begins. The emerging practice in 2023–2024 is the contextual disclosure model: disclose at first contact, clearly but concisely, then proceed naturally. "Hi, I'm Aria — an AI assistant for TechCorp. How can I help?" satisfies both the legal and UX requirements.

Principle 01

Proactive Disclosure

Identify the bot's AI nature at first contact, before any task begins. Don't wait for the user to ask.

Principle 02

Honest Capability Framing

State what the bot can help with. Never overclaim. Vague promises ("I can help with anything!") violate both ethics and UX best practice.

Principle 03

Human Access Path

Always provide a clear path to a human agent. Users must never feel trapped in a bot interaction they cannot exit.

Principle 04

Data Transparency

When the bot collects personal data, disclose it. "I'll need your booking reference — this is stored securely and used only to locate your reservation."

Google's AI Principles and Chatbot Design

Google's AI Principles, published in 2018 and updated since, include explicit guidance on identity transparency: AI systems should not be designed to deceive users into thinking they are interacting with a human. This directly shaped the design requirements for Google Assistant and the Duplex project.

Google Duplex — the AI phone-calling assistant demonstrated in 2018 and capable of booking restaurant reservations — faced immediate backlash when it became clear the system did not identify itself as AI to the humans it called. Google responded by adding a mandatory disclosure: Duplex now identifies itself as an automated caller at the start of every call. The episode is a case study in the gap between technically impressive AI and ethically acceptable AI.

Design Ethical Failure — Google Duplex, 2018

When Google demonstrated Duplex making a restaurant booking without disclosing it was AI, the public and regulatory reaction was immediate and negative — even though the technology was technically remarkable. The lesson: capability and ethics are separate design axes. You must optimize for both.

Manipulation vs. Persuasion in Chatbot Design

The line between legitimate persuasion and manipulation is a live ethical frontier in chatbot design. Legitimate persuasion includes: recommending relevant products based on stated needs, reminding users of benefits they've already agreed to, and guiding users toward decisions that serve their stated goals.

Manipulation includes: creating false urgency ("Only 2 left — buy now!"), using social proof fabricated by bots, exploiting emotional states to drive purchases, or using dark patterns like making "no" much harder to find than "yes." The FTC published guidelines in 2023 specifically addressing AI-powered marketing bots and deceptive design, signaling increasing regulatory attention to these practices.

The design standard: if a human salesperson using the same technique would be considered unethical, the bot version is also unethical. AI does not create an ethics exemption.

Building User Trust Through Transparency

Paradoxically, transparency about limitations — what a bot cannot do — consistently increases overall user trust. Edelman's Trust Barometer data from 2022 found that users who were told upfront what an AI system couldn't do rated that system more trustworthy than systems that made no capability claims but then failed.

Practical transparency mechanisms include: scope statements at conversation start ("I can help with orders, returns, and tracking — for billing issues, I'll connect you with our team"), explicit data use notices, session summaries that confirm what was agreed, and clear receipts when actions are taken.

Core Principle

Ethical chatbot design is not a compliance checkbox — it is the foundation of a sustainable product. Bots that deceive generate short-term gains and long-term brand damage. Bots that are honest about what they are and what they can do build the kind of trust that drives retention.

Key Terms

BOT Disclosure Act:California's 2019 law requiring bots to disclose their non-human nature when attempting to influence purchasing decisions or votes.

Contextual Disclosure:The practice of disclosing AI identity at first contact, clearly but concisely, in a way that doesn't undermine the subsequent interaction.

Dark Patterns:Interface design choices that manipulate users into actions they didn't intend — such as making "no" harder to find than "yes."

Human Access Path:A clearly available route for users to reach a human agent at any point in a bot interaction.

Lesson 4 Quiz

Ethical Design and Trust Transparency — 4 questions

What specific behavior did California's BOT Disclosure Act of 2019 require of chatbots?

Correct. California's BOT Disclosure Act specifically targets bots used to influence purchasing or political decisions, requiring disclosure of AI nature. It was directly motivated by political bot activity in the 2016 and 2018 elections.

Not quite. The California BOT Disclosure Act requires that bots reveal their non-human nature specifically in contexts where they are attempting to influence purchasing decisions or political opinions — not a broad blanket requirement on all bot interactions.

What did MIT Media Lab's 2020 research find about upfront AI disclosure and trust?

Correct. MIT's research revealed nuanced, context-dependent effects: knowing you're talking to AI is fine — even trust-building — for booking a flight, but it can reduce openness in emotionally sensitive conversations. Disclosure design must consider context.

Not quite. MIT found that effects varied significantly by context: disclosure boosted trust for task-completion (bookings, support) but reduced engagement in emotional support contexts — revealing that how and when to disclose requires contextual judgment, not a one-size-fits-all rule.

What was the ethical failure in Google Duplex's initial 2018 demonstration?

Correct. Duplex's demo showed it making highly convincing calls without identifying itself as AI, causing immediate backlash from the public and ethicists. Google subsequently added mandatory AI disclosure to the system.

Not quite. Duplex's ethical problem was identity deception: the restaurant staff it called believed they were speaking with a human. This violated Google's own AI Principles about deception, and Google added mandatory disclosure as a result.

According to Edelman's 2022 Trust Barometer data, what counterintuitive finding emerged about chatbot transparency and trust?

Correct. Proactively disclosing limitations — "I can help with X but not Y" — builds more trust than staying silent and then failing. Honesty about limits paradoxically makes the whole system seem more reliable and trustworthy.

Not quite. Edelman's data showed that bots that proactively stated their limitations were rated more trustworthy than those that didn't — even when both failed at the same rate. Transparency about what a bot cannot do builds trust in what it can.

Lab 4 — Ethics Audit

Evaluate chatbot designs against transparency and ethical principles

Your Task: Run an Ethics Audit on a Chatbot Design

Describe a chatbot design — real or hypothetical — and your AI partner will audit it against the four ethical principles: proactive disclosure, honest capability framing, human access path, and data transparency. Aim for at least 3 exchanges.

Try: "Audit this design: a chatbot that says 'I can help with almost anything!' has no option to reach a human, and collects email addresses without explaining why." — or bring your own design for evaluation.

Ethics Audit Lab

AI Partner

Welcome to the Ethics Audit Lab. Share a chatbot design — an opening message, a flow description, or a set of design choices — and I'll audit it against the four ethical transparency principles: proactive disclosure, honest capability framing, human access path, and data transparency. What would you like to audit?

Module 2 Test

Chatbot Design Principles — 15 questions · Pass at 80%

1. Which of the following best describes a chatbot's "voice" in design terminology?

Correct.

Voice refers to linguistic and tonal consistency across interactions — who the bot sounds like as a character.

2. The Stanford CASA research framework (Nass & Reeves, 1994) established which principle relevant to chatbot design?

Correct.

CASA showed that humans are hardwired to treat computers with social cues as social agents — regardless of knowing it's a machine.

3. A persona specification document is produced at which stage of chatbot development?

Correct.

The persona spec is written before any responses, so all writers work from the same character brief from the start.

4. In conversation flow design, the "happy path" refers to what?

Correct.

The happy path is the minimum viable flow — the shortest route to completion — but must be accompanied by fallback paths for interruptions.

5. KLM's BlueBot processed over 16,000 conversations per week. What was its primary design success factor?

Correct.

KLM's success came from careful flow design, not AI sophistication. Mapped paths and graceful handoffs to agents were the foundation.

6. What is "slot filling" in chatbot flow design?

Correct.

Slot filling is the structured data collection phase — gathering departure city, dates, etc. — before the bot can act on the user's request.

7. According to Forrester Research (2020), the single biggest predictor of high chatbot satisfaction was:

Correct.

Forrester found graceful escalation — not automation rates — was the top satisfaction predictor. Smooth handoffs with context transfer delight users more than bots that stubbornly avoid escalation.

8. Google's Dialogflow introduced the concept of "digression handling" to address which user behavior?

Correct.

Digression handling is the "follow and return" pattern: the bot follows the user's tangent, resolves it, then offers to resume the original task.

9. Amazon's Alexa UX research found that which error message style outperformed others on satisfaction?

Correct.

Amazon found that honest, simple phrases like "I'm not sure I understand" with a follow-up clarifying question consistently outperformed longer apologetic error messages.

10. The "Apology Loop" anti-pattern in chatbot error design refers to:

Correct.

The Apology Loop is the repetition of identical error messages — each clarification attempt must use different language and offer a different kind of help to avoid this pattern.

11. Lyft's chatbot team used unmatched intent logs primarily as:

Correct.

Lyft's quarterly reviews of unmatched intent logs revealed their highest-impact improvement opportunities — what users wanted that the bot couldn't yet do.

12. California's BOT Disclosure Act was motivated in part by which documented problem?

Correct.

The BOT Disclosure Act was directly motivated by documented political bot activity during the 2016 and 2018 election cycles — bots influencing opinion without revealing they were automated.

13. What design change did Google make to Duplex after the 2018 backlash?

Correct.

Google added mandatory upfront AI identification to Duplex — every call now begins with a disclosure that it is an automated system — in response to the disclosure backlash.

14. The "Three Strikes Rule" in error handling, as codified by Nuance Communications, specifies that:

Correct.

The Three Strikes Rule mandates escalation after three consecutive no-match events — never allowing the bot to loop in failure indefinitely.

15. The "contextual disclosure model" for chatbot identity transparency involves:

Correct. Contextual disclosure — "Hi, I'm Aria, an AI assistant for TechCorp. How can I help?" — satisfies both legal requirements and UX needs by being upfront without making disclosure the centerpiece of the interaction.

The contextual disclosure model means disclosing at first contact, clearly but briefly — the disclosure is present and honest without being so prominent it derails the conversation before it begins.