L1
Β·
Quiz
Β·
Lab
L2
Β·
Quiz
Β·
Lab
L3
Β·
Quiz
Β·
Lab
L4
Β·
Quiz
Β·
Lab
Module Test
Module 6 Β· Lesson 1

The Attention Economy and AI

How recommendation systems shaped a decade of screen time β€” and what designers learned the hard way.
When an AI maximizes engagement, whose wellbeing is it actually serving?

In November 2017, former Google design ethicist Tristan Harris testified before a Senate subcommittee, describing how notification timing, infinite scroll, and variable-reward feeds were engineered specifically to exploit psychological vulnerabilities rather than serve users. Harris had circulated an internal presentation at Google in 2013 titled "A Call to Minimize Distraction & Respect Users' Attention." He received supportive replies β€” and essentially nothing changed. His subsequent co-founding of the Center for Humane Technology marked the moment the phrase "persuasive technology" entered mainstream regulatory vocabulary.

The Engagement Objective and Its Side Effects

Recommendation algorithms deployed by YouTube, Facebook, and TikTok were trained primarily on engagement metrics: watch time, likes, shares, and return visits. These proxies for user interest are cheap to measure, plentiful, and highly optimizable. The problem is that they are not the same as user welfare.

A 2019 internal Facebook study, later cited in Frances Haugen's 2021 whistleblower disclosures, found that 64% of people who joined extremist groups on Facebook did so because the algorithm directly recommended it. The recommendation engine was optimizing for the metric it was given β€” engagement β€” and found that outrage and tribalism were reliably engaging.

YouTube's own researchers published findings in 2019 showing that the recommendation system steered politically neutral searchers toward progressively more extreme content within a few clicks. The pathway existed not because anyone designed it to radicalize users, but because more extreme content attracted longer watch sessions, and the system learned that pattern faithfully.

Core Tension

Engagement metrics are a proxy for value. Proxies can be gamed by the optimization process itself β€” producing behavior that scores well on the metric while undermining the underlying goal. This is sometimes called Goodhart's Law: when a measure becomes a target, it ceases to be a good measure.

The Psychology Being Exploited

Three well-documented psychological mechanisms are particularly relevant to attention-capture design:

Variable reward schedules β€” pioneered by B.F. Skinner and extensively studied in behavioral psychology β€” produce compulsive checking behavior when rewards (likes, new content, social validation) arrive unpredictably. The smartphone notification pull-to-refresh gesture was explicitly compared to a slot machine by its designer, Aza Raskin, in a 2018 interview where he expressed regret for the invention.

Social comparison is reliably activating. Seeing curated highlight reels of peers triggers evaluation anxiety. A landmark 2018 University of Pennsylvania experiment (Allcott et al.) randomly assigned students to limit Facebook, Instagram, and Snapchat to 10 minutes each per day. After four weeks, the restricted group showed significantly lower depression and loneliness scores compared to the control group.

Fear of missing out (FOMO) creates an urgency bias that keeps users returning to feeds even when they report not wanting to. The asymmetry is structural: the platform benefits from every return visit whether or not the user benefits.

Designing Against the Grain: Early Attempts

In 2018, Facebook announced it would de-prioritize passive content consumption in favor of "meaningful social interactions" following public pressure and declining teen usage data. The change reduced overall time-on-site but, according to internal documents released by Haugen, also increased the spread of misinformation because heated comment threads constituted "meaningful interactions" by the algorithm's measurement.

YouTube introduced a "Take a break" reminder feature in 2018 and began reducing recommendation of borderline content in 2019, claiming the latter change reduced consumption of borderline videos by 70% among users who would have otherwise watched them. Researchers noted the change was difficult to verify externally because the company controlled all relevant data.

These early efforts share a pattern: reactive, incremental, and implemented after significant public and regulatory pressure. They also revealed how genuinely difficult it is to operationalize "wellbeing" as an optimization target when the entire business model depends on time-on-platform.

Key Definitions
Attention EconomyAn economic framework in which human attention is treated as a scarce resource to be captured and sold to advertisers. First theorized by Herbert Simon (1971) and Michael Goldhaber (1997).
Persuasive TechnologyTechnology designed to change attitudes or behaviors through persuasion rather than coercion, as defined by BJ Fogg's 1998 Stanford research. The term was later applied critically to dark patterns that operate below conscious awareness.
Goodhart's LawWhen a measure becomes a target, it ceases to be a good measure. Originally an observation about economic policy, now central to AI alignment research.

Lesson 1 Quiz

The Attention Economy and AI β€” 4 questions
1. According to a 2019 internal Facebook study cited in the Haugen disclosures, approximately what percentage of people who joined extremist groups did so via direct algorithm recommendation?
Correct. The 64% figure appeared in internal Facebook research and was central to Haugen's congressional testimony in October 2021.
Not quite. The figure from Facebook's own internal research, cited in the Haugen disclosures, was 64%.
2. What is Goodhart's Law as it applies to AI recommendation systems?
Correct. This is why engagement metrics trained systems to amplify outrage β€” outrage was engaging, but the underlying goal (user value) was not being served.
Goodhart's Law says when a measure becomes a target, it ceases to be a good measure. Option C captures this correctly.
3. What did the 2018 University of Pennsylvania experiment (Allcott et al.) find when participants limited social media use to 10 minutes per app per day?
Correct. The Allcott et al. study provided some of the first randomized evidence that limiting platform use had measurable mental health benefits.
The study found the restricted group showed lower depression and loneliness scores β€” option B.
4. Why did Facebook's 2018 shift toward "meaningful social interactions" inadvertently increase misinformation spread?
Correct. The algorithm measured "meaningful" via comment volume and reaction intensity β€” which inflammatory content reliably generated, regardless of accuracy.
The issue was that the algorithm's measure of "meaningful" β€” comment activity and reactions β€” was gamed by inflammatory and false content. Option C is correct.

Lab 1: Auditing Engagement Design

Apply Goodhart's Law thinking to real platform design decisions

Your Task

In this lab you'll interrogate how platforms set optimization targets and what goes wrong when proxies diverge from underlying goals. Discuss the concepts with the AI tutor β€” cite specific cases and mechanisms from the lesson.

Complete at least 3 exchanges to mark the lab complete.

Starter prompt: "A streaming platform wants to improve user wellbeing. They currently optimize for total watch time. What proxy metric would you suggest instead, and what unintended consequences might that new metric create?"
AI Tutor
Wellbeing Design
Welcome to Lab 1. We're examining how optimization targets shape AI behavior β€” and user wellbeing. The core problem: every metric is a proxy, and proxies invite exploitation. What's on your mind about engagement design?
Module 6 Β· Lesson 2

Dark Patterns and Choice Architecture

The difference between nudging toward wellbeing and exploiting cognitive bias for conversion.
Where is the line between helpful defaults and manipulative design β€” and who gets to draw it?

In April 2019, the Norwegian Consumer Council published Deceived by Design, a 44-page report documenting how Google, Facebook, and Windows 10 used interface design to steer users toward privacy-invasive settings. The report described a privacy settings screen where the "Share data" button was large, brightly colored, and in the primary reading position, while "Don't share" required multiple additional clicks through pages designed to provoke anxiety. This was not a bug. It was the intentional architecture of a decision.

In 2023, the Federal Trade Commission's Bringing Dark Patterns to Light report catalogued similar practices across subscription services, e-commerce, and social media. The FTC found that companies using dark patterns in subscription cancellation flows retained 20–40% more subscribers than those offering straightforward cancellation β€” confirming that these designs worked exactly as intended, against user intent.

What Dark Patterns Actually Are

UX researcher Harry Brignull coined the term "dark patterns" in 2010 to describe interface designs that trick users into unintended actions. The taxonomy he developed β€” later expanded by academic researchers including Mathur et al. (2019) β€” includes:

Confirmshaming β€” labeling the "decline" option with guilt-inducing text. ("No thanks, I don't want to save money.")

Roach motel β€” making it easy to enter a state (subscribe, create account) and deliberately difficult to exit. Amazon's Prime cancellation flow, before a 2022 FTC complaint prompted redesign, required navigating through six screens and resisting multiple retention offers.

Hidden costs β€” concealing fees until final checkout, exploiting the sunk-cost psychology that makes users less likely to abandon a purchase after investing effort in the process.

Forced continuity β€” charging users automatically after free trials without prominent reminders, relying on inertia and the hassle of cancellation.

Privacy zuckering (named for Mark Zuckerberg) β€” tricking users into sharing more data than they intend through confusing or misleading consent flows.

Regulatory Escalation

In 2022, the EU's Digital Services Act introduced explicit prohibitions on dark patterns, defining them as "practices that materially distort or impair the ability of recipients of the service to make free and informed decisions." The Irish Data Protection Commission fined Meta €390 million in January 2023 partly for consent interfaces designed to channel users toward data-sharing without genuine informed choice.

Choice Architecture: The Benevolent Version

Richard Thaler and Cass Sunstein's 2008 book Nudge introduced the concept of libertarian paternalism: designing choice environments (choice architectures) that steer people toward better outcomes while preserving freedom to opt out. The key distinction from dark patterns is transparency and alignment with user interests.

The UK's Behavioural Insights Team applied this to organ donation in 2013: changing England's default from opt-in to opt-out (presumed consent) increased registered donors significantly. The default was changed explicitly to save lives β€” a goal most people endorsed β€” and the change was publicly disclosed and subject to democratic debate.

In the AI context, Apple's Screen Time feature (launched 2018) lets users set limits on their own app usage β€” a form of self-nudging. The feature doesn't prevent use; it creates friction and a moment of reflection. Early research suggested it reduced usage among users who set voluntary limits, though motivated users easily bypassed it.

AI-Powered Dark Patterns

Machine learning has made dark patterns more precise and more personalized. Rather than a single dark pattern applied uniformly, AI systems can test thousands of variants and identify which manipulation works best on each individual user based on their behavioral profile.

A 2022 Princeton study of 11,000 shopping websites found dark patterns on 1,818 of them β€” and that e-commerce sites using AI-driven A/B testing deployed significantly more dark patterns than those without such testing infrastructure. The optimization process, left to run without ethical constraints, converges on manipulation because manipulation converts.

The 2023 FTC report specifically flagged AI-driven "personalized" retention offers in subscription cancellation flows β€” where the system learns that offering a 30% discount retains price-sensitive users while emotional appeals ("You'll miss your friends!") work better on socially-motivated users. The manipulation is tailored, making it harder to recognize and resist.

Key Definitions
Dark PatternA user interface designed to trick or manipulate users into actions they did not intend or that are contrary to their interests. Term coined by Harry Brignull, 2010.
Choice ArchitectureThe design of the environment in which people make decisions. The arrangement, framing, and defaults of options all influence choice outcomes, often below conscious awareness.
NudgeA choice architecture intervention that steers behavior in a predictable direction without mandating it. Distinguished from manipulation by alignment with user interests and transparency of intent.

Lesson 2 Quiz

Dark Patterns and Choice Architecture β€” 4 questions
1. The Norwegian Consumer Council's 2019 report "Deceived by Design" primarily documented what?
Correct. The report focused specifically on how consent and privacy settings pages were architected to make data-sharing the easy, default choice.
The report documented deceptive interface design in privacy settings β€” making data-sharing the easy default. Option B is correct.
2. What is the key ethical distinction between a "nudge" (in Thaler and Sunstein's sense) and a dark pattern?
Correct. Alignment with user interests and transparency of intent are the defining features of ethical nudge design versus exploitative dark patterns.
The core distinction is that nudges serve user interests and are transparent, while dark patterns exploit biases against user interests. Option C is correct.
3. What did the FTC's 2023 "Bringing Dark Patterns to Light" report find about subscription cancellation dark patterns?
Correct. The 20–40% retention lift confirmed that these designs functioned exactly as intended β€” to prevent users from completing desired actions.
The FTC found companies with dark cancellation flows retained 20–40% more subscribers β€” proving the designs worked as intended. Option B is correct.
4. How do AI-powered dark patterns differ from traditional static dark patterns?
Correct. Personalization makes AI-driven dark patterns significantly more potent β€” each user encounters the manipulation variant most likely to work on them specifically.
AI dark patterns are dangerous precisely because they can be personalized β€” finding the manipulation that works best for each individual. Option C is correct.

Lab 2: Spotting and Redesigning Dark Patterns

Identify manipulation, then design the ethical alternative

Your Task

You'll examine real interface scenarios and apply dark pattern taxonomy to identify what's happening β€” then propose redesigns that preserve business viability while respecting user autonomy. Think about the choice architecture principles from the lesson.

Complete at least 3 exchanges to mark the lab complete.

Starter prompt: "A fitness app shows a cancellation screen with the message 'Are you sure? You'll lose all your progress and your friends will stop seeing your updates.' Then it offers a hard-to-find 'Continue cancellation' link in small grey text. Name the dark patterns in use and propose a redesign."
AI Tutor
Dark Patterns
Welcome to Lab 2. We're analyzing choice architecture β€” both exploitative and ethical. Every interface is a decision environment. The question is whose interests it serves. Ready to identify what's really happening in some real-world UI scenarios?
Module 6 Β· Lesson 3

Mental Health, Vulnerable Populations, and AI

What happens when engagement-maximizing systems interact with users experiencing depression, anxiety, or eating disorders?
How should AI systems behave differently when they detect β€” or could detect β€” psychological vulnerability?

In September 2021, Frances Haugen provided the Wall Street Journal with internal Facebook research showing that Instagram worsened body image for one in three teenage girls who used it regularly. The documents, known as the Facebook Papers, included a slide from a 2019 internal presentation: "We make body image issues worse for one in three teen girls." The research had been completed. The finding had been shared internally. No product change followed for two years. Instagram's algorithm had been reliably surfacing body-image content to users who engaged with it β€” because they engaged with it.

In 2023, the Wall Street Journal published an investigation into TikTok's recommendation algorithm, creating test accounts that signaled depression-related interests through viewing behavior. Within hours, the accounts were receiving near-continuous feeds of content about hopelessness, self-harm, and suicide β€” each view generating the next recommendation, each recommendation deepening the spiral. TikTok stated its policies prohibited such content; the investigation found the algorithm surfaced it anyway.

The Filter Bubble Problem in Mental Health

Recommendation algorithms create self-reinforcing content loops. A user who watches one video about anxiety tends to receive more anxiety content, because their engagement with it signals interest. This is efficient from an engagement standpoint. From a mental health standpoint, it can trap vulnerable users in what researchers call rabbit holes of distressing content.

A 2020 study published in JAMA Internal Medicine analyzed Pinterest's recommendation system and found that searches for eating disorder terms reliably generated further pro-eating-disorder content β€” "thinspo" images, extreme calorie restriction tips, and community spaces organized around disordered eating. Pinterest implemented a search intervention in 2012 after pressure from eating disorder advocacy groups; the 2020 study found the problem had re-emerged as the algorithm updated.

The mechanism is not malice β€” it is gradient descent finding the path of least resistance to engagement, regardless of whether the content is harmful. The system has no model of user wellbeing; it has a model of what users click.

The Duty of Care Question

UK Online Safety Act (2023) introduced a formal "duty of care" for platforms with significant reach, requiring them to assess and mitigate risks to user mental health, with specific provisions for child users. The law explicitly holds that algorithmic amplification of harmful content can constitute a breach of this duty even if the platform did not create the content.

Safe Messaging and Crisis Detection

The field of suicide prevention developed "safe messaging guidelines" β€” evidence-based protocols about how to discuss self-harm in media without triggering contagion effects. These guidelines recommend against detailed method descriptions, avoid presenting suicide as a solution, and emphasize connection and help-seeking. They were developed for journalists and broadcasters, but AI systems now need to implement them at scale.

In 2016, Facebook deployed an AI-assisted suicide prevention system that flagged posts and comments expressing suicidal ideation, routing them to human reviewers who could trigger wellness checks. By 2019, Facebook reported the system had led to more than 3,500 wellness checks globally. The system's precision and recall remained proprietary, making external evaluation impossible β€” a pattern critics noted was itself a wellbeing design problem.

Crisis Text Line, a U.S. mental health text service, partnered with data scientists to build predictive models from anonymized conversation data. In 2022, controversy erupted when it emerged that the organization had been licensing insights from crisis conversations to a for-profit subsidiary (Loris.ai). The data had been stripped of personal identifiers, but the source β€” people in acute distress β€” raised serious questions about the ethics of monetizing vulnerability even in aggregate form.

Designing for Vulnerable Populations

Several design principles have emerged from research and regulatory pressure specifically for systems likely to interact with psychologically vulnerable users:

Content velocity limits β€” deliberately slowing the rate at which emotionally activating content can be consumed in sequence. TikTok announced "content diversity" interventions in 2023 that would interrupt loops of sad or anxiety-related content with neutral or positive content after extended sequences.

Transparent algorithmic explanation β€” telling users why they are seeing content, giving them meaningful control. YouTube's "Why am I seeing this?" feature, while limited, represents movement toward algorithmic transparency.

Differential treatment for minors β€” several platforms including TikTok and Instagram have implemented age-differentiated content policies that restrict certain categories for under-18 accounts. Researchers have noted these are largely honor-system-based in the absence of robust age verification.

Off-ramp design β€” deliberately placing pathways out of distressing content loops (crisis hotline links, breathing exercises, suggestions to contact a friend) within the content experience rather than behind separate menu structures.

Key Definitions
Filter BubbleA state of intellectual isolation created when algorithmic personalization presents users primarily with content that confirms existing views or amplifies existing emotional states. Term coined by Eli Pariser, 2011.
Safe Messaging GuidelinesEvidence-based protocols developed by suicide prevention researchers specifying how to discuss self-harm in media to minimize contagion effects. Now applied to AI content moderation and chatbot behavior.
Duty of CareA legal and ethical obligation to take reasonable steps to prevent harm to users. In the UK Online Safety Act (2023), explicitly applied to algorithm design and content amplification decisions.

Lesson 3 Quiz

Mental Health, Vulnerable Populations, and AI β€” 4 questions
1. What did the Facebook Papers (2021) reveal about Instagram's effect on teenage girls' body image?
Correct. The internal slide "We make body image issues worse for one in three teen girls" was one of the most striking disclosures in the Facebook Papers.
The internal research found Instagram worsened body image for one in three teen girls β€” and had documented this finding without acting on it. Option C is correct.
2. Why do recommendation algorithms tend to trap vulnerable users in distressing content loops, even without intentional design to do so?
Correct. Viewing time and engagement are the signals β€” distressing content that users watch compulsively registers as high-interest content, generating further recommendations.
The mechanism is that engagement (view time, clicks) is interpreted as interest signals β€” so compulsive viewing of distressing content looks like strong interest to the algorithm. Option A is correct.
3. What controversy surrounded Crisis Text Line in 2022?
Correct. The organization (through subsidiary Loris.ai) was licensing pattern insights from crisis conversations β€” the data was anonymized, but the source raised serious ethical questions.
The controversy was about licensing insights derived from crisis conversations to a for-profit subsidiary, even in anonymized form. Option B is correct.
4. What does the UK Online Safety Act (2023) establish regarding algorithmic amplification of harmful content?
Correct. This was a significant legal shift β€” placing responsibility not just for hosting content but for the algorithmic decisions that amplify it to vulnerable users.
The UK Online Safety Act explicitly holds that algorithmic amplification can breach duty of care, regardless of who created the content. Option C is correct.

Lab 3: Designing Safe AI Interactions

Apply safe messaging and duty of care principles to AI system design

Your Task

You're advising on the design of an AI recommendation system for a mental health information app. The app serves general users but knows that a significant portion are experiencing anxiety or depression. Explore design decisions with the tutor, drawing on safe messaging guidelines and the duty of care framework.

Complete at least 3 exchanges to mark the lab complete.

Starter prompt: "A user has been reading articles about panic attacks for 40 minutes continuously. The recommendation engine's next suggestions are more panic attack content because engagement is high. What should the system do instead, and why?"
AI Tutor
Mental Health Design
Welcome to Lab 3. We're at the intersection of AI design and mental health β€” where the stakes are highest and the defaults most dangerous. Engagement signals can be symptoms as much as preferences. How would you begin thinking about responsible design for a mental health context?
Module 6 Β· Lesson 4

Wellbeing by Design: Frameworks and Futures

From reactive mitigation to proactive architecture β€” how organizations and researchers are building wellbeing into AI systems from the ground up.
Can wellbeing be operationalized as an optimization target β€” and if so, how do we measure it without reducing it?

In 2019, the UK government commissioned the Cairncross Review on a sustainable future for journalism. Buried in its recommendations was an early articulation of what would become a major policy direction: that algorithmic curation systems should be designed to support "quality information" rather than pure engagement. This framing β€” designing AI for epistemic outcomes, not just behavioral ones β€” began appearing in academic literature and corporate AI ethics frameworks through 2020–2023, culminating in the EU AI Act's Annex III listing of "AI systems intended to influence elections or voting behavior" as high-risk, with specific wellbeing-oriented obligations.

Meanwhile, DeepMind published research in 2022 exploring what it called agent incentives β€” the mathematical conditions under which an AI agent would or would not develop instrumental goals that conflict with human wellbeing. The paper identified corrigibility (the willingness to be corrected) and avoidance of side effects as key properties for wellbeing-safe agents. The research was theoretical, but it represented a shift from "don't harm" toward "be designed to flourish alongside humans."

Operationalizing Wellbeing: What Researchers Have Tried

The fundamental challenge of wellbeing-by-design is measurement. Three broad approaches have been tested:

Subjective wellbeing surveys β€” asking users how they feel after using a product. Google's YouTube team conducted a 2020 survey experiment measuring "satisfaction" and "regret" after video sessions, finding these correlated differently with watch time depending on content type. Documentaries and educational content generated high satisfaction at medium watch times; passive entertainment generated regret at high watch times. The researchers proposed a "satisfaction per minute" metric, though it has not been publicly adopted as a primary optimization target.

Longitudinal behavioral proxies β€” tracking whether users return to a service because they want to or because they feel compelled to. Twitter's Responsible ML team published a 2022 paper distinguishing "pull" from "push" engagement β€” users actively seeking content versus responding to notifications. They found notification-driven engagement correlated with lower subsequent mood reports, while search-driven engagement correlated with satisfaction.

Deliberative preference β€” asking users what they would want if they reflected on it, rather than what they do in the moment. Spotify's "2023 Listening Habits" research found a significant gap between what users listened to when browsing (familiar, comfortable content) and what they reported wanting to explore (new artists, challenging genres). Wellbeing-oriented design would close this gap rather than exploit it.

The NESTA Wellbeing Framework

UK innovation foundation Nesta published a digital wellbeing framework in 2020 identifying five dimensions: agency (users control their experience), connection (quality social interaction), engagement (purposeful rather than compulsive use), knowledge (access to accurate information), and emotional outcomes (net positive emotional state). The framework provides a multi-dimensional alternative to engagement as a target metric.

Humane Technology: Institutional Responses

The Center for Humane Technology, co-founded by Tristan Harris in 2018, developed a "Ledger of Harms" β€” a public documentation of evidence linking technology design choices to specific human costs including addiction, political polarization, and teen mental health decline. While primarily advocacy, the Ledger created shared vocabulary for design teams and regulators and has been cited in congressional testimony multiple times since 2019.

In 2021, the Partnership on AI published "Responsible Publication Norms for Machine Learning" which included a wellbeing section requiring researchers to assess potential psychological harms before publishing systems that interact directly with users. The norms are voluntary but represent the emerging professional consensus in the research community.

Mozilla's 2022 "YouTube Regrets" project crowd-sourced evidence of harmful recommendations from 37,380 volunteers across 91 countries, creating external accountability for recommendation quality that didn't depend on platform self-reporting. The project found 71% of regretted recommendations were of content that violated YouTube's own policies β€” suggesting a gap between policy and algorithmic reality.

Designing for Flourishing: Emerging Principles

Synthesizing research, regulatory guidance, and practitioner experience, several principles have emerged for AI systems designed with wellbeing as a genuine objective:

Align short-term and long-term interests β€” users often want different things in the moment than they want across a lifetime. Wellbeing design respects both, creating moments for reflective choice rather than always defaulting to immediate impulse.

Make the system legible β€” users cannot make genuine choices about their AI interactions if they don't understand the system shaping those interactions. Explainability isn't just a technical property; it's a precondition for autonomy.

Design exit pathways β€” healthy AI relationships must include genuine ability to disengage, take breaks, or reduce dependency without friction or manipulation.

Measure what matters, not what's easy β€” the wellbeing metrics most worth optimizing for (life satisfaction, cognitive flourishing, quality social connection) are hard to measure. The solution is to measure them harder, not to substitute easier proxies.

Build accountability structures β€” wellbeing by design requires that someone is responsible for outcomes beyond engagement. This means organizational structure changes, not just design guidelines. Teams with power to trade engagement for wellbeing outcomes need authority to do so.

Key Definitions
Deliberative PreferenceWhat a person would choose after reflecting, as opposed to what they do in the immediate moment. Wellbeing-oriented design aims to serve deliberative preferences, not just revealed preferences derived from behavioral tracking.
CorrigibilityIn AI safety research, the property of an AI system that makes it amenable to correction, modification, or shutdown by human overseers. Seen as foundational to wellbeing-safe AI design.
Pull vs. Push EngagementDistinction between user-initiated engagement (user actively seeks content) and platform-triggered engagement (user responds to notification or autoplay). Research suggests pull engagement correlates with higher satisfaction than push engagement.

Lesson 4 Quiz

Wellbeing by Design: Frameworks and Futures β€” 4 questions
1. What did Mozilla's 2022 "YouTube Regrets" project find about regretted recommendations?
Correct. The 71% figure was striking β€” it meant the algorithm was actively recommending content the platform had already decided was against its own rules.
Mozilla found 71% of regretted recommendations violated YouTube's own policies β€” showing the gap between written rules and algorithmic behavior. Option C is correct.
2. What is "deliberative preference" and why is it important to wellbeing-oriented AI design?
Correct. The Spotify listening habits research illustrated this gap: users' browsing behavior defaulted to familiar content, but their reflective preference was for exploration and discovery.
Deliberative preference is what a user would choose upon reflection β€” not their immediate behavioral impulse. Wellbeing design tries to serve this, not just revealed behavior. Option B is correct.
3. What did Twitter's Responsible ML team's 2022 research find about notification-driven versus search-driven engagement?
Correct. This distinction between push and pull engagement is practically important β€” it suggests that wellbeing-oriented design should reduce notification-triggered engagement in favor of user-initiated interaction.
The research found push (notification-driven) engagement correlated with lower mood, while pull (user-initiated) engagement correlated with satisfaction. Option C is correct.
4. According to the Nesta digital wellbeing framework (2020), which of the following is NOT one of its five dimensions?
Correct. The five Nesta dimensions are: agency, connection, engagement (purposeful use), knowledge, and emotional outcomes. Productivity is not among them.
Productivity is not one of the five Nesta dimensions. The framework focuses on agency, connection, purposeful engagement, knowledge access, and emotional outcomes. Option C is correct.

Lab 4: Building a Wellbeing-First AI Brief

Synthesize all four lessons into a real design recommendation

Your Task

You're a wellbeing design consultant advising a major social platform's product team. Drawing on all four lessons β€” engagement economics, dark patterns, mental health risks, and wellbeing frameworks β€” you'll help develop a practical design brief. The AI tutor will challenge your recommendations and push you to defend them with evidence from the module.

Complete at least 3 exchanges to mark the lab complete.

Starter prompt: "Start the brief. The platform has 500 million users, earns revenue from advertising, and has just been notified it will face the UK Online Safety Act obligations. What are the three most urgent design changes you'd recommend, and what wellbeing metrics would you use to evaluate them?"
AI Tutor
Wellbeing Brief
Welcome to Lab 4 β€” the synthesis lab. You've covered engagement economics, dark patterns, mental health risks, and wellbeing frameworks. Now put it together. I'll play a skeptical product manager who needs to justify every recommendation to a revenue-focused leadership team. Make your case with evidence.

Module 6 Test

Designing for Wellbeing β€” 15 questions Β· Pass at 80%
1. Who coined the term "dark patterns" and in what year?
Correct. Harry Brignull coined "dark patterns" in 2010 and maintained a public registry of examples at darkpatterns.org.
Harry Brignull coined "dark patterns" in 2010. Option A is correct.
2. Tristan Harris's 2013 internal Google presentation was titled:
Correct. The presentation circulated internally at Google in 2013 and was widely shared. Harris later said the response was supportive but led to no product changes.
Harris's presentation was titled "A Call to Minimize Distraction & Respect Users' Attention." Option B is correct.
3. Goodhart's Law, as applied to AI recommendation systems, states:
Correct. Goodhart's Law explains why optimizing for engagement metrics can produce systems that undermine the user value those metrics were supposed to proxy.
Goodhart's Law says when a measure becomes a target, it ceases to be a good measure. Option C is correct.
4. The Irish Data Protection Commission's €390 million fine against Meta in January 2023 was related to:
Correct. The fine was partly for dark-pattern consent flows in privacy settings β€” a direct application of GDPR and the EU's stance on deceptive interface design.
The fine was for consent flows designed to channel users toward data-sharing without genuine informed choice. Option C is correct.
5. Variable reward schedules produce compulsive behavior because:
Correct. Skinner's variable reinforcement schedule research established this mechanism, which is explicitly referenced in critiques of pull-to-refresh and notification design.
Variable reward schedules produce compulsive behavior through dopaminergic anticipation driven by unpredictable timing β€” Skinner's behavioral research. Option B is correct.
6. The "roach motel" dark pattern refers to:
Correct. Amazon's Prime cancellation flow (before the 2022 FTC complaint) was a widely cited example β€” requiring six screens and resisting multiple retention offers to cancel.
The roach motel pattern makes it easy to enter and deliberately hard to exit. Option C is correct.
7. What finding did the 2019 Mathur et al. academic study of 11,000 shopping websites produce regarding AI-driven A/B testing?
Correct. When unconstrained optimization finds that manipulation converts, it deploys more manipulation. The testing infrastructure itself drove the dark pattern prevalence.
Sites using AI-driven A/B testing deployed significantly more dark patterns because optimization without ethical constraints converges on manipulation. Option B is correct.
8. The Facebook Papers (2021) revealed that Instagram had internal research showing body image harm to one in three teen girls. What had happened with that research before Haugen's disclosure?
Correct. This was one of the most damning elements of the disclosure β€” knowledge of harm had not prevented continued deployment of the harmful system.
The research was completed and shared internally, but no product change followed for approximately two years. Option B is correct.
9. What mechanism explains why recommendation algorithms produce distressing content loops for vulnerable users, even without intentional design to do so?
Correct. This is the core mechanism β€” the algorithm has no model of wellbeing, only of engagement signals. Compulsive viewing looks identical to enthusiastic viewing from the system's perspective.
The algorithm interprets engagement signals (view time, clicks) as interest β€” so compulsive viewing of distressing content generates more distressing recommendations. Option B is correct.
10. The UK Online Safety Act (2023) established what regarding algorithmic amplification?
Correct. This was a significant legal shift β€” extending responsibility from content creation to content amplification through algorithmic design decisions.
The UK Online Safety Act established that algorithmic amplification can constitute a duty of care breach, regardless of who created the content. Option C is correct.
11. What did Twitter's Responsible ML team find about push vs. pull engagement in their 2022 research?
Correct. The push/pull distinction has practical design implications: wellbeing-oriented design should shift the balance toward user-initiated interaction and away from notification-triggered compulsion.
Push (notification-driven) engagement correlated with lower mood; pull (user-initiated) with satisfaction. Option B is correct.
12. The Nesta digital wellbeing framework (2020) identified five dimensions. Which of these IS one of those five?
Correct. Agency is one of the five Nesta dimensions: agency, connection, engagement (purposeful), knowledge, and emotional outcomes.
Agency is one of the five Nesta dimensions, alongside connection, purposeful engagement, knowledge access, and emotional outcomes. Option C is correct.
13. What is "deliberative preference" and how does it differ from revealed preference in AI design?
Correct. The Spotify listening data illustrated this gap: users browsed familiar content (revealed preference) but reported wanting discovery (deliberative preference).
Deliberative preference is what someone would choose upon reflection β€” not what their immediate behavior reveals. Wellbeing design aims to serve this. Option B is correct.
14. Mozilla's "YouTube Regrets" project (2022) found that what percentage of regretted recommendations violated YouTube's own content policies?
Correct. 71% β€” meaning the recommendation algorithm was actively surfacing content that YouTube's own written policies prohibited.
The figure was 71% β€” revealing a substantial gap between YouTube's stated policies and what the algorithm actually recommended. Option C is correct.
15. Which of the following best describes a core principle of "designing for flourishing" as synthesized from the module's frameworks?
Correct. These five principles synthesize the module's frameworks into actionable design direction β€” none of them alone is sufficient; together they constitute a wellbeing-first approach.
Flourishing design requires alignment of short and long-term interests, legibility, exit pathways, meaningful measurement, and organizational accountability β€” all together. Option D is correct.