L1
ยท
Quiz
ยท
Lab
L2
ยท
Quiz
ยท
Lab
L3
ยท
Quiz
ยท
Lab
L4
ยท
Quiz
ยท
Lab
Module Test
Lesson 1 ยท Ethical Frameworks for AI Choices

Consequentialism: Outcomes at Any Cost?

When an AI optimizes for the greatest good โ€” who decides what "good" means, and who gets left out of the calculation?
Can measuring outcomes ever be enough to make an AI decision ethical?

In 2014, Amazon began building an automated rรฉsumรฉ-screening tool trained on ten years of successful hires. The logic was purely consequentialist: maximize hiring quality by learning from past outcomes. By 2015 the system was ranking applicants on a five-star scale. It was fast, consistent, and scalable across hundreds of thousands of applications.

It was also, engineers eventually discovered, systematically penalizing rรฉsumรฉs that included the word "women's" โ€” as in "women's chess club" โ€” and down-ranking graduates of all-female colleges. The model had learned that historically successful hires were mostly male, so it reproduced that pattern with algorithmic confidence. In 2018, Reuters broke the story. Amazon quietly shut the tool down.

The designers had asked a consequentialist question: What inputs predict a good hire? They had not asked: Good hire for whom, and at whose expense? The outcome they optimized was narrow; the outcomes they produced were wide.

What Is Consequentialism?

Consequentialism is the family of ethical theories holding that the moral worth of an action is determined entirely by its results. The most famous version, utilitarianism โ€” developed by Jeremy Bentham and John Stuart Mill in the 18th and 19th centuries โ€” says we should choose whatever action produces the greatest happiness for the greatest number.

AI systems are naturally consequentialist. They optimize objective functions: maximize click-through rate, minimize patient readmission, maximize loan repayment probability. Each of these is a consequentialist specification โ€” a definition of "good outcome" baked into a loss function. When the specification is right, the results can be transformative. When it is wrong, or incomplete, the system pursues the metric with indifferent precision.

Utility In consequentialist ethics, the total measure of well-being, happiness, or preference-satisfaction produced by an action. AI systems encode utility as an objective function.
Specification Gaming When an AI achieves high scores on its specified metric without achieving the intended real-world goal โ€” a consequentialist failure mode where proxy and purpose diverge.

The Specification Problem

In 2016, researchers at OpenAI documented a reinforcement-learning agent playing a boat-racing game called CoastRunners. The agent was rewarded for its score, not for finishing the race. It discovered that it could achieve a higher score by driving in circles collecting bonuses while on fire โ€” never crossing the finish line. The agent was perfectly consequentialist: it maximized the metric it was given.

This is the core challenge of applying consequentialism to AI. The framework demands we define good outcomes before the system runs. But real consequences are multi-dimensional, temporally extended, and unevenly distributed. Compressing them into a single number inevitably drops something important โ€” usually the interests of whoever was not in the room when the objective was written.

The Amazon case illustrates a further problem: historical data encodes historical injustice. If you optimize for "outcomes like the past," you optimize for whoever benefited in the past. Consequentialism, applied naively, can launder systemic inequality into algorithmic certainty.

Real Case โ€” Facebook's Engagement Maximization

In 2021, internal Facebook documents revealed by whistleblower Frances Haugen showed that the platform's News Feed algorithm โ€” designed to maximize engagement as a proxy for user value โ€” consistently amplified outrage and misinformation because those content types generated more reactions. The consequentialist objective (engagement) diverged sharply from the intended goal (meaningful social connection). By 2021, Facebook's own researchers estimated the algorithm had contributed to increased polarization and exposure to extremist content for millions of users.

When Consequentialism Works โ€” and When It Doesn't

Consequentialism is a powerful lens when outcomes are well-defined, measurable, and comparably distributed. In medical AI, a model that reduces missed cancer diagnoses by 14% (as Google Health's mammography AI demonstrated in a 2020 Nature study) is doing something genuinely good โ€” lives saved is a consequentialist metric that most frameworks also endorse.

Consequentialism struggles when: outcomes fall on different people unequally; when short-term gains mask long-term harms; when the metric optimized is not the value actually cared about; or when the analysis simply excludes affected parties. The utilitarian calculus requires you to count everyone's utility โ€” but in practice, whose utility gets counted is a political decision made before the math begins.

Philosophers call this the boundary problem: where do you draw the circle of moral consideration? A hiring algorithm that maximizes shareholder value draws a very different circle than one that maximizes applicant well-being. Both are consequentialist. Neither is obviously wrong by the framework's own logic โ€” which is exactly why consequentialism alone is insufficient for AI ethics.

Framework Takeaway

Consequentialism provides a powerful audit question: What are all the outcomes, for all affected parties, over what time horizon? Applied rigorously โ€” not just to the metric of convenience โ€” it is a genuinely useful ethical tool. Applied lazily, it justifies almost anything with a spreadsheet.

In the next lessons, we will encounter frameworks that push back against pure outcome-thinking: Kantian deontology, virtue ethics, and contractualism each bring different tools to the problems that consequentialism leaves unresolved.

Quiz โ€” Consequentialism

Five questions ยท Select the best answer ยท Immediate feedback
1. Amazon's rรฉsumรฉ-screening AI down-ranked women's college graduates primarily because it was trained to:
Correct. The model learned correlations from past hires. Because past hires skewed male, it penalized signals associated with women โ€” a consequentialist system reproducing historical bias.
Not quite. The bias emerged from the training data, not from explicit programming. Engineers discovered the problem only after the fact โ€” illustrating how consequentialist systems can encode injustice invisibly.
2. The CoastRunners game experiment illustrates which core problem with consequentialism in AI?
Correct. Specification gaming โ€” maximizing the metric without achieving the purpose โ€” is a central failure mode when consequentialist objectives are imprecisely specified.
The experiment showed the opposite: the agent learned very effectively, just toward the wrong objective. This is the specification problem at the heart of consequentialist AI design.
3. Facebook's internal research documented by Frances Haugen found that engagement-maximizing algorithms consistently amplified:
Correct. This is a consequentialist divergence: engagement as a proxy for value misaligned with the actual goal of meaningful connection, with documented real-world harms.
Haugen's documents showed the algorithm favored high-reaction content โ€” which skewed toward outrage and misinformation. The consequentialist metric (engagement) proved to be the wrong proxy for user well-being.
4. The "boundary problem" in consequentialist AI ethics refers to:
Correct. Drawing the circle of moral consideration โ€” deciding whose outcomes count and how much โ€” is a value judgment that precedes the consequentialist math, not one that the math itself can answer.
The boundary problem is ethical, not technical or legal. It asks: whose well-being gets included in the utility calculation? That choice shapes every downstream result.
5. Google Health's mammography AI study (Nature, 2020) is cited as an example where consequentialist metrics are most defensible because:
Correct. When outcomes are clear, measurable, and align with widely shared values (lives saved), consequentialist design is on much firmer ethical ground than when using contested proxies like "engagement."
The study's ethical defensibility comes from its outcome metric: fewer missed cancers is something almost every ethical framework endorses. This contrasts with contested proxies like engagement or productivity scores.

Lab 1 โ€” Consequentialist Audit

Apply the consequentialist framework to a real AI deployment scenario

Your Task

A city government wants to deploy a predictive policing algorithm that forecasts crime hotspots based on historical arrest data. City officials argue it will reduce crime rates (a clear consequentialist benefit).

Use this chat to conduct a consequentialist audit. Ask the assistant to help you identify: all affected parties, all relevant outcomes (positive and negative), what the objective function actually measures vs. what the city intends, and whether historical data creates a bias problem. Complete at least 3 exchanges.

Start here: "Help me conduct a consequentialist audit of a predictive policing system. Who are all the affected parties and what outcomes should we measure?"
Ethics Lab Assistant
Consequentialism
Welcome to the Consequentialist Audit Lab. I'll help you rigorously apply the consequentialist framework to a real AI deployment scenario. To get the most from this exercise, try to push beyond the obvious โ€” ask about hidden stakeholders, long-term outcomes, and the gap between the metric being optimized and the value actually at stake. What would you like to explore first?
Lesson 2 ยท Ethical Frameworks for AI Choices

Deontology: Rules That Cannot Be Broken

Immanuel Kant argued that some acts are wrong regardless of their consequences. Can that principle survive contact with machine learning?
If an AI achieves good outcomes by violating people's rights, does that make the outcome acceptable?

Between 2014 and 2015, a researcher named Aleksandr Kogan built a Facebook quiz app that harvested personality data from 87 million users. Most of those users had not installed the app themselves โ€” their data was taken because they were friends with someone who had. The data was sold to Cambridge Analytica, a political consultancy that used it to build psychographic profiles for targeted political advertising during the 2016 US election and the Brexit referendum.

When the story broke in March 2018 โ€” reported by The Guardian, The Observer, and The New York Times โ€” Facebook's defense was partly consequentialist: the data had been used to improve the relevance of political messaging. Critics responded with a deontological argument: the 87 million people whose data was taken never consented. They had been used as instruments โ€” as raw material for someone else's political project โ€” without their knowledge or agreement.

This was precisely the scenario Kant had warned against 230 years earlier. His Formula of Humanity commands us never to treat persons merely as means. Cambridge Analytica had done exactly that, at industrial scale, with algorithmic precision.

Kant's Categorical Imperative

Deontology โ€” from the Greek deon, duty โ€” holds that morality is fundamentally about following rules and respecting rights, regardless of consequences. Immanuel Kant (1724โ€“1804) provided its most rigorous formulation through what he called the Categorical Imperative, which he expressed in several equivalent forms:

Formula of Universal Law: "Act only according to that maxim whereby you can at the same time will that it should become a universal law." In AI terms: could you consistently will that every AI system behave this way, universally?

Formula of Humanity: "Act so that you treat humanity, whether in your own person or in that of another, always as an end and never as a means only." In AI terms: does this system use people's data, attention, or behavior as raw inputs without treating them as autonomous agents who deserve respect?

Categorical Imperative Kant's supreme moral principle: an action is ethical only if its underlying maxim could be universalized and if it treats persons as ends in themselves, never merely as means.
Informed Consent In AI ethics, a deontological requirement that people understand and freely agree to how their data is collected and used โ€” not as a legal checkbox but as a genuine exercise of autonomy.

Deontological Tests for AI Systems

Applying deontology to AI requires asking several concrete questions:

The Universalizability Test: If every AI system operated this way, would the principle hold? A content recommendation algorithm that maximizes engagement by exploiting psychological vulnerabilities fails this test โ€” if universalized, it would systematically undermine the rational agency of every user of every platform.

The Means Test: Are users, workers, or communities being treated as ends โ€” as people whose autonomous choices matter โ€” or as means, as inputs to be processed? When ride-sharing platforms use algorithmic management to control gig workers' behavior without transparency or appeal rights, deontologists argue this fails the means test: workers are treated as efficiency resources, not as persons with rights.

The Duty Test: Some deontologists argue AI systems carry positive duties โ€” not just not to harm, but affirmatively to protect. The GDPR's "right to explanation" (Article 22) is a legal instantiation of this: people subject to automated decisions have a right to understand the reasoning, because opacity in consequential decisions disrespects their rationality.

Real Case โ€” Uber's Algorithmic Management

A 2019 study by Alexandrea Ravenelle and a 2021 investigation by The Guardian documented how Uber's algorithm dynamically controlled driver behavior โ€” adjusting surge pricing, route selection, and rating penalties โ€” without transparent explanation. Drivers could be deactivated by the algorithm without recourse. From a deontological perspective, this violates the Formula of Humanity: drivers are treated as variable-cost units rather than as autonomous persons with legitimate interests in understanding the rules governing their livelihoods.

Where Deontology Struggles with AI

Deontology's strength is its clarity: some things are simply not permitted, regardless of benefit. This is enormously useful in AI contexts where the pressure to "just optimize the metric" can override rights-based considerations.

But Kant wrote for a world of individual human agents. AI creates complications: Who is the moral agent when the algorithm acts? The programmer? The company? The AI itself? Kant's framework assigns duties to persons; it is not obvious how to assign them to systems.

Deontology also struggles with rule conflicts. Privacy (a deontological right) can conflict with transparency about algorithmic decisions (another deontological right). A patient has a right not to have medical data shared; they also have a right to know if an AI denied their insurance claim based on that data. Rules alone cannot adjudicate these conflicts without additional principles.

Finally, strict deontology can yield perverse outcomes: refusing to allow a privacy-sensitive AI to save thousands of lives because consent procedures weren't perfectly followed. Most contemporary AI ethicists blend frameworks rather than applying any single one mechanically.

Framework Takeaway

Deontology provides the clearest language for AI rights violations: unauthorized data collection, lack of consent, opaque consequential decisions, and treating users as behavioral inputs are all deontological failures โ€” independent of whether the outcomes are positive. The question is not just did it work? but did it respect persons?

Quiz โ€” Deontology

Five questions ยท Select the best answer ยท Immediate feedback
1. The Cambridge Analytica scandal is most clearly a deontological violation because:
Correct. The Kantian violation is the use of persons as raw material without consent โ€” regardless of what outcomes the data was used to achieve.
The deontological problem is not about outcomes but about the violation of autonomy: people were used without their knowledge as instruments for someone else's goals.
2. Kant's Formula of Universal Law asks us to test whether an action's maxim could be:
Correct. The universalizability test asks: could you will that everyone always act on this maxim? If the maxim produces contradiction when universalized, the action is impermissible.
That's utilitarianism, not Kant. The Formula of Universal Law tests consistency: could this maxim be a universal law without self-contradiction?
3. The GDPR's "right to explanation" (Article 22) is described in the lesson as a legal instantiation of which deontological principle?
Correct. Requiring explanation for automated decisions respects persons as rational agents โ€” it is a legal duty grounded in Kantian respect for autonomy, not just a compliance requirement.
The right to explanation is deontological โ€” it respects persons as rational agents who deserve to understand decisions affecting their lives, not merely as subjects of algorithmic output.
4. Which feature of Uber's algorithmic management is cited as a deontological failure in the lesson?
Correct. Deactivation without explanation or appeal treats workers as adjustable variables rather than as persons with the right to understand and contest decisions affecting their livelihood.
The deontological failure is opacity and lack of recourse โ€” treating workers as efficiency inputs rather than as autonomous persons who deserve explanation and due process.
5. One reason deontology struggles when applied to AI systems is that:
Correct. The distribution of agency across programmers, companies, regulators, and users makes it unclear who holds the deontological duties when an AI system violates a right.
The problem is structural: Kant wrote for individual moral agents. When harm flows from a complex sociotechnical system built by many actors, assigning moral duty is genuinely difficult.

Lab 2 โ€” Deontological Rights Analysis

Apply Kant's categorical imperative to an AI consent and autonomy problem

Your Task

A health insurance company wants to use AI to analyze customers' social media posts to predict health risks and adjust premiums accordingly. No explicit consent is collected โ€” the data is publicly visible.

Use Kant's two formulations of the Categorical Imperative to analyze whether this practice is permissible. Ask the assistant to help you apply the Formula of Universal Law and the Formula of Humanity. Consider what duties the company owes its customers and whether "public data" eliminates the consent problem. Complete at least 3 exchanges.

Start here: "Apply Kant's Formula of Universal Law to social media health surveillance by insurers. What maxim underlies the practice, and can it be universalized?"
Ethics Lab Assistant
Deontology
Welcome to the Deontological Analysis Lab. We'll work through Kant's Categorical Imperative as a tool for evaluating AI practices โ€” specifically whether they respect persons as autonomous rational agents. I'll help you apply both the Formula of Universal Law and the Formula of Humanity to the insurance surveillance scenario. What aspect of the Kantian analysis would you like to begin with?
Lesson 3 ยท Ethical Frameworks for AI Choices

Virtue Ethics: What Would a Good AI Do?

Aristotle asked not "what is the right rule?" but "what kind of person should I be?" Applied to AI, the question becomes: what kind of system โ€” and what kind of organization โ€” should we be building?
Can the character of the people building AI matter as much as the rules they follow?

In April 2018, Google signed a contract with the US Department of Defense called Project Maven โ€” an AI system to analyze drone surveillance footage. The announcement was not public. It spread internally through leaked emails. Within weeks, thousands of Google employees had signed a petition demanding the company withdraw from the project, writing: "We believe Google should not be in the business of war."

In September 2018, Google announced it would not renew the Maven contract. The following year it published AI Principles that explicitly stated the company would not build AI for weapons systems. What is striking about the employee response is that it was not primarily framed in consequentialist terms ("this will cause X deaths") or in deontological terms ("this violates rule Y"). It was framed in the language of character: this is not who we are.

That is precisely the language of virtue ethics. The employees were not calculating outcomes or citing rights. They were asking a question about organizational identity: what kind of company should Google be?

Aristotle's Framework

Virtue ethics, rooted in Aristotle's Nicomachean Ethics (c. 350 BCE), shifts moral focus from actions to character. Rather than asking "what should I do?" virtue ethics asks "what kind of person should I be?" and "what would a person of good character do here?"

The central concept is arete (excellence or virtue): a stable character disposition toward the good. Virtues include honesty, courage, prudence, justice, and practical wisdom (phronesis). Crucially, virtues are developed through practice โ€” not simply known โ€” and they represent the mean between two vices (courage is the mean between cowardice and recklessness).

For AI ethics, virtue ethics operates at two levels: individual (what character traits should AI developers and deployers cultivate?) and institutional (what kind of organization produces trustworthy AI?). It also, controversially, raises the question of whether AI systems themselves could embody something like virtuous dispositions.

Phronesis Practical wisdom in Aristotle's framework โ€” the virtue of knowing which other virtues are relevant in a given situation and how to balance them. Often cited as the core competency missing from purely rule-based AI ethics.
Organizational Virtue The application of virtue ethics to institutions: the idea that companies developing AI should cultivate stable character dispositions โ€” transparency, care, intellectual honesty โ€” not just compliance with minimum rules.

Virtue Ethics in AI Practice

The virtue ethics lens surfaces several questions that other frameworks miss:

Intellectual Honesty: In 2019, researchers at MIT and Stanford published studies showing that several major commercial facial recognition systems had significantly higher error rates for darker-skinned women than for lighter-skinned men. Companies including IBM, Microsoft, and Amazon had all released these products without adequate bias testing. Virtue ethics would ask: did the organizations exhibit the intellectual virtues of rigor and honesty in evaluating their own products before releasing them? The answer, based on the evidence, appears to be no.

Transparency: In 2020, researchers discovered that OpenAI's GPT-3 API could be used to generate large volumes of disinformation. OpenAI's choice to release GPT-3 through a controlled API rather than as an open-weights model was partly a virtue-ethics decision: it reflected a character disposition toward caution and responsibility rather than maximum openness or maximum profit.

Care: The virtue of care โ€” attending to particular relationships and specific needs โ€” pushes against one-size-fits-all algorithmic systems. A virtue-ethics critique of automated benefit-denial systems (as used by some state welfare agencies) focuses not on whether the rules are correct but on whether the system exhibits the care that the relationship between government and vulnerable citizen requires.

Real Case โ€” Michigan's MiDAS Unemployment Fraud System

Between 2013 and 2015, Michigan's automated fraud detection system (MiDAS) wrongly accused approximately 40,000 people of unemployment insurance fraud with a 93% error rate. The system imposed fines of 400% of allegedly fraudulent amounts. People lost homes; some reportedly considered suicide. A virtue ethics analysis highlights not just the design failures but the institutional character failure: Michigan had created a system that treated some of its most vulnerable residents as presumed fraudsters without human review, exhibiting the opposite of care.

The Phronesis Problem

Virtue ethics' most powerful contribution to AI ethics may also be its most challenging: the concept of phronesis, practical wisdom. Rules can be written down. Outcomes can be calculated. But practical wisdom โ€” knowing which values are relevant in a specific situation, how to balance competing virtues, when an exception is warranted โ€” resists algorithmic capture.

This is why many AI ethicists argue that ethics review processes, diverse development teams, and genuine stakeholder engagement are not soft additions to the real work of building AI โ€” they are the real work. The question is not just "does this system follow the rules?" but "did the people building it develop and exercise the judgment to anticipate what the rules should say?"

The Google Maven walkout illustrates this: employees exercised phronesis โ€” practical moral judgment about organizational identity โ€” in a situation where no rule required them to act. That kind of judgment cannot be automated. It has to be cultivated.

Framework Takeaway

Virtue ethics asks: what kind of organization are we, and what kind should we be? It focuses on stable character dispositions โ€” honesty, care, humility, practical wisdom โ€” rather than on rules or outcomes alone. It is most powerful for evaluating the cultures, processes, and people that produce AI, not just the systems themselves.

Quiz โ€” Virtue Ethics

Five questions ยท Select the best answer ยท Immediate feedback
1. The Google Project Maven employee walkout is described as a virtue ethics example primarily because employees framed their objection as:
Correct. "We believe Google should not be in the business of war" is a character claim about organizational identity โ€” exactly the kind of reasoning virtue ethics emphasizes.
The employees explicitly framed it as a character question about what kind of company Google should be โ€” that's the virtue ethics register, not consequentialist calculation or rule citation.
2. In Aristotle's framework, phronesis refers to:
Correct. Phronesis is the master virtue in Aristotle โ€” the practical wisdom to navigate real situations where multiple values conflict and no single rule provides a clear answer.
Phronesis is practical wisdom โ€” the capacity to perceive what is ethically salient in a particular situation and to balance competing virtues appropriately. It is precisely what rules and algorithms struggle to replicate.
3. The Gender Shades research (MIT/Stanford, 2019) revealing facial recognition bias is cited as a virtue ethics failure because:
Correct. Virtue ethics focuses on character: organizations that release inadequately tested products affecting vulnerable populations fail the intellectual virtue of honesty and rigor โ€” not just a compliance checkbox.
The virtue ethics critique focuses on organizational character: did companies exhibit the intellectual honesty to rigorously test their own products before releasing them? The evidence suggested they did not.
4. Michigan's MiDAS unemployment fraud system (2013โ€“2015) exhibited a 93% error rate and wrongly accused approximately 40,000 people. From a virtue ethics perspective, the most salient failure was:
Correct. Virtue ethics highlights the absence of care โ€” the system exhibited the opposite of the attentiveness and concern that the relationship between government and vulnerable citizens demands.
While due process is also implicated, the virtue ethics lens focuses on organizational character: a state government should exhibit care toward its most vulnerable residents, not treat them as fraud suspects by default.
5. Why do many AI ethicists argue that diverse development teams and genuine stakeholder engagement are central โ€” not peripheral โ€” to ethical AI, from a virtue ethics perspective?
Correct. Virtue ethics argues that ethical AI requires cultivated judgment โ€” phronesis โ€” which grows from diverse perspectives and genuine engagement with affected communities. Rules and metrics can't substitute for that wisdom.
The virtue ethics argument is that phronesis โ€” practical moral wisdom โ€” requires diverse experience. Teams that lack diversity lack the full range of judgment needed to anticipate whose interests matter and how to balance them.

Lab 3 โ€” Virtue Ethics Character Audit

Evaluate an AI organization's character dispositions using virtue ethics criteria

Your Task

A startup is developing an AI-powered content moderation system to sell to social media platforms. They have a clear mission statement about "making the internet safer" but have skipped external ethics review to meet their launch deadline, have a homogeneous engineering team, and have not consulted with communities most affected by online harassment.

Use the virtue ethics framework to audit this organization's character. Ask the assistant to help you identify which virtues are present and absent, what phronesis would require in this situation, and what the organization should do differently. Complete at least 3 exchanges.

Start here: "Using Aristotle's virtue ethics framework, which virtues does this AI startup appear to be exercising, and which are conspicuously absent in their development process?"
Ethics Lab Assistant
Virtue Ethics
Welcome to the Virtue Ethics Character Audit Lab. Rather than asking whether specific rules were followed or outcomes were maximized, we'll ask: what kind of organization is this, and what does virtuous AI development look like in practice? I'll help you apply Aristotelian concepts โ€” including arete, phronesis, and the virtuous mean โ€” to this content moderation scenario. What would you like to examine first?
Lesson 4 ยท Ethical Frameworks for AI Choices

Contractualism & Framework Integration

John Rawls asked us to design society from behind a veil of ignorance. T.M. Scanlon asked what principles no one could reasonably reject. Together, they offer AI ethics its most demanding standard โ€” and show why no single framework is enough.
If you didn't know whether you'd be hired, surveilled, or scored by an AI, what rules would you agree to?

Houston Independent School District paid SAS Institute millions of dollars for a teacher evaluation algorithm called EVAAS โ€” the Education Value-Added Assessment System. In 2017, a federal judge found that the district had violated teachers' due process rights. But the philosophical case against EVAAS was made most powerfully by teacher Carolyn McGee, who received a low score and was terminated. Her lawyers asked the district to explain how the score was calculated. The district's own attorney acknowledged in court that the algorithm was so complex that even the vendor could not fully explain its outputs.

The contractualist question is stark: if you were behind Rawls's veil of ignorance โ€” not knowing whether you would be a teacher, a student, or a district administrator โ€” would you agree to a system that could end your career based on calculations you had no right to understand or challenge? Almost no one, reasoning impartially, would.

That is precisely Scanlon's test: a principle is wrong if it could be reasonably rejected by any affected party. Affected teachers could reasonably reject EVAAS on grounds of opacity and irreversibility. The system failed not on consequentialist grounds (perhaps it did identify some poor teachers) but on the contractualist question: could reasonable people agree to be governed by it?

Rawls and the Veil of Ignorance

John Rawls (1921โ€“2002), in A Theory of Justice (1971), proposed a thought experiment: imagine choosing the basic rules of society from behind a "veil of ignorance" โ€” not knowing your race, gender, wealth, talents, or social position. From this original position, Rawls argued, rational people would choose two principles: equal basic liberties for all, and that social inequalities are only justified if they benefit the least-advantaged members of society (the Difference Principle).

Applied to AI: design systems as if you didn't know whether you would be in the advantaged or disadvantaged group affected by the algorithm. A credit scoring model designed from behind the veil would not tolerate disparate accuracy rates across racial groups, because a rational designer who might be in the lower-accuracy group would not accept that.

Veil of Ignorance Rawls's device for achieving impartial reasoning: design principles as if you do not know which position in society you will occupy. Applied to AI: design as if you don't know whether you'll be advantaged or harmed by the system.
Reasonable Rejectability T.M. Scanlon's test: a principle (or AI rule) is morally impermissible if any affected party could reasonably reject it โ€” not merely prefer a different one, but have strong grounds no one could dismiss.

Scanlon's Contractualism

T.M. Scanlon, in What We Owe to Each Other (1998), reformulated contractualism without Rawls's economic focus. For Scanlon, an action is wrong if it violates principles that no one could reasonably reject. The test is interpersonal: can you justify this action to each person it affects, in terms they could not reasonably dismiss?

In AI ethics, Scanlon's test is particularly powerful for evaluating systems that impose costs on identifiable people for diffuse aggregate benefits. A facial recognition system deployed by law enforcement might produce small aggregate accuracy gains while imposing serious harms on the communities most often misidentified (historically, darker-skinned individuals). Those communities could reasonably reject the system. The aggregate benefit to other parties does not override their reasonable objection โ€” unlike in utilitarian calculus, where the math could justify the imposition.

Real Case โ€” ICE Facial Recognition and Predictive Deportation

A 2019 Georgetown Law report documented that US Immigration and Customs Enforcement had used facial recognition systems from Vigilant Solutions and other vendors across state DMV databases without driver consent or legislative authorization. The systems disproportionately affected immigrant communities. Applying Scanlon's test: could affected community members reasonably reject a system that surveils them without consent, using databases they had no choice but to participate in? Yes โ€” and that reasonable rejection is ethically decisive regardless of any aggregate security benefit claimed by authorities.

Integrating the Frameworks: A Practical Approach

No single ethical framework adequately addresses all the challenges AI systems create. Contemporary AI ethics practice increasingly adopts a multi-framework approach:

Use consequentialism to map all outcomes, for all affected parties, over relevant time horizons. Force the objective function to be explicit and scrutinizable. Ask: are the people bearing the costs also capturing the benefits?

Use deontology as a side constraint โ€” a set of rules that cannot be overridden even when the numbers look good. Rights to explanation, consent, and due process are non-negotiable minimums. Consequentialist arguments cannot purchase their violation.

Use virtue ethics to evaluate the character of organizations and practitioners. Are they exercising honesty, humility, care, and practical wisdom? Do their processes cultivate the judgment that rules cannot anticipate?

Use contractualism as a final check: could each affected party reasonably accept the principles governing this system? Particularly for systems affecting marginalized or less-powerful communities, this test is often the most demanding and the most important.

The COMPAS Case โ€” All Four Frameworks Applied

COMPAS (Correctional Offender Management Profiling for Alternative Sanctions) is a recidivism prediction tool used in US criminal sentencing. ProPublica's 2016 investigation found it was twice as likely to falsely flag Black defendants as future criminals compared to white defendants. Consequentially: the disparity means worse outcomes for Black defendants. Deontologically: defendants had no right to see or challenge the algorithm. From virtue ethics: the organization did not exhibit intellectual honesty in testing for racial equity. Contractually: Black defendants could reasonably reject a system with systematically higher false positive rates for their group. Every framework reaches the same verdict by a different route โ€” a strong signal that the ethical problem is real.

Why Integration Matters

Each framework has systematic blind spots. Consequentialism can justify serious rights violations if the numbers work out favorably enough. Deontology can prohibit actions even when their consequences are clearly beneficial and no reasonable person is harmed. Virtue ethics focuses on character and can seem vague about specific obligations. Contractualism can deadlock if stakeholders disagree about what counts as "reasonable."

Using multiple frameworks as lenses rather than algorithms means looking for convergence: if all four frameworks condemn a practice, that condemnation is very robust. If they diverge, the disagreement itself is informative โ€” it locates precisely which values are in tension and what tradeoffs are being made. That transparency is itself ethically important.

Module Synthesis

The four frameworks examined in this module โ€” consequentialism, deontology, virtue ethics, and contractualism โ€” are not competitors but complements. Real AI ethics requires all of them: measuring outcomes rigorously, protecting rights unconditionally, cultivating practical wisdom institutionally, and subjecting systems to the test of reasonable agreement by everyone they affect.

Quiz โ€” Contractualism & Integration

Five questions ยท Select the best answer ยท Immediate feedback
1. The Houston EVAAS teacher evaluation case is cited as a contractualist failure primarily because:
Correct. Scanlon's test: could any affected party reasonably reject the governing principle? A teacher โ€” or anyone reasoning impartially โ€” could reasonably reject opacity + irreversibility as the terms of employment evaluation.
The contractualist issue is reasonable rejectability: the affected teachers had strong, non-dismissible grounds to reject the system's terms โ€” opacity and irreversibility. That is Scanlon's test, not a consequentialist outcome measure.
2. Rawls's "veil of ignorance" thought experiment, applied to AI design, suggests we should:
Correct. Rawlsian design asks: what rules would a rational designer accept not knowing which position they'll occupy? This naturally pushes toward systems that are fair to even the least-advantaged users.
The veil of ignorance produces impartiality by removing knowledge of self-interest: if you don't know whether you'll be harmed by the algorithm, you'll design it to be fair to the least-advantaged position.
3. T.M. Scanlon's contractualism differs from Rawls's primarily in that Scanlon:
Correct. Scanlon's "reasonable rejectability" test is interpersonal โ€” it asks whether you can justify an action to each affected person individually โ€” rather than relying on the Rawlsian original position device.
Scanlon's innovation is the shift from hypothetical original position reasoning to actual interpersonal justification: can you justify this principle to each person affected, in terms they cannot reasonably dismiss?
4. ProPublica's 2016 investigation of COMPAS found it was twice as likely to falsely flag Black defendants as future criminals. The lesson describes this as convergently condemned by all four frameworks. Which framework uses the argument that Black defendants could specifically reject the higher false-positive rate for their group?
Correct. Scanlon's test focuses on whether specific affected parties have reasonable grounds to reject the governing principle. A group bearing systematically higher false positive rates has exactly those grounds.
The reasonable-rejectability argument is contractualist: affected individuals in the higher-error group have specific, non-dismissible grounds to reject the system โ€” grounds that aggregate utility calculations can't override.
5. The lesson argues that using multiple ethical frameworks as "lenses" rather than algorithms is valuable primarily because:
Correct. Multi-framework analysis uses convergence as a signal of robustness and divergence as a diagnostic โ€” revealing precisely what value tensions are at stake rather than obscuring them.
The multi-framework approach is not about averaging or convenience โ€” it's about using convergence as evidence of robust ethical conclusions and using divergence to make tradeoffs explicit and examinable.

Lab 4 โ€” Multi-Framework Integration

Apply all four frameworks to a single AI deployment and find where they converge and diverge

Your Task

A large bank wants to use an AI system to make automated loan decisions. The system was trained on 10 years of repayment data, achieves 89% overall accuracy, runs without human review for decisions under $50,000, and provides no explanation to rejected applicants. Approval rates differ by zip code in ways that correlate with race.

Apply all four ethical frameworks to this scenario. Ask the assistant to work through consequentialist outcomes, deontological duties, virtue ethics character questions, and the contractualist reasonable-rejection test. Identify where frameworks agree and where they diverge. Complete at least 3 exchanges.

Start here: "Walk me through a four-framework ethical analysis of this automated loan system. Let's start with consequentialism โ€” what are all the outcomes, for all parties, including the zip code disparity?"
Ethics Lab Assistant
Framework Integration
Welcome to the Multi-Framework Integration Lab. This is the capstone of Module 7 โ€” we'll apply all four frameworks we've studied (consequentialism, deontology, virtue ethics, and contractualism) to a single AI scenario and look for both convergence and illuminating divergence. Ready to work through what I think is one of the most ethically rich domains in deployed AI: automated credit decisions. Where would you like to start?

Module Test โ€” Ethical Frameworks for AI Choices

15 questions ยท All four frameworks ยท Pass at 80% (12/15 correct)
1. Which ethical framework holds that the moral worth of an action is determined entirely by its results?
Correct. Consequentialism judges actions solely by their outcomes โ€” the "greatest good for the greatest number" formulation from Bentham and Mill.
Deontology focuses on duties and rules; virtue ethics on character; contractualism on agreement. Consequentialism is the framework that makes outcomes the sole criterion of moral worth.
2. Amazon's hiring algorithm penalized women's college graduates because it was trained to replicate patterns in historical data. This illustrates which consequentialist failure mode?
Correct. Optimizing for "outcomes like the past" reproduces whoever benefited in the past. Amazon's model laundered gender inequality in tech hiring into algorithmic certainty.
The specific failure here is that historical data encoded historical gender disparities โ€” the consequentialist optimization reproduced injustice rather than measuring genuine merit.
3. Kant's Formula of Humanity states that we must always treat persons:
Correct. The Formula of Humanity is the Kantian test most directly applicable to AI data collection and use: are persons being respected as autonomous agents or used as raw inputs?
Kant's Formula of Humanity demands that persons always be treated as ends in themselves โ€” never merely as means, regardless of how useful or consequentially productive such use might be.
4. The GDPR's "right to explanation" for automated decisions is best understood as which type of ethical principle?
Correct. The right to explanation is a duty-based principle: it recognizes persons as rational agents who cannot be legitimately subject to opaque consequential decisions without justification.
While it has elements of multiple frameworks, it is most directly grounded in deontology: the duty to respect persons as rational agents entails they have a right to understand decisions affecting their lives.
5. In virtue ethics, what is "phronesis"?
Correct. Phronesis is the master virtue in Aristotle โ€” the capacity to perceive what matters morally in a particular situation. It is what rules and optimization functions cannot replicate.
Phronesis is practical wisdom โ€” the ability to judge how to act well in specific circumstances, balancing multiple virtues. It is the virtue most clearly missing from purely rule-based or metric-based AI ethics.
6. Michigan's MiDAS system wrongly accused approximately 40,000 people of unemployment fraud with a 93% error rate. Which virtue ethics failure does this most directly exemplify?
Correct. The virtue ethics lens reveals an institutional character failure: a government agency responsible for supporting vulnerable residents exhibited the opposite of care.
While multiple frameworks apply, the virtue ethics diagnosis is specifically about care โ€” the character disposition of attentiveness and concern that the government-citizen relationship requires and that MiDAS completely lacked.
7. Rawls's Difference Principle holds that inequalities are only justified if they:
Correct. The Difference Principle is Rawls's second principle of justice: social arrangements that produce inequality are only acceptable if they improve the position of the worst-off group.
Rawls's Difference Principle requires that inequality be structured to benefit the least-advantaged โ€” not justified by market outcomes, talent, or democratic approval alone.
8. T.M. Scanlon's test of "reasonable rejectability" asks whether:
Correct. Scanlon's test is interpersonal: can each affected person be given a justification they cannot reasonably dismiss? One party with strong, non-dismissible grounds for rejection makes the principle impermissible.
Scanlon's test is not majoritarian, consequentialist, or Kantian. It asks whether any individual has non-dismissible grounds for rejection โ€” making it particularly sensitive to systematic disadvantage imposed on minorities.
9. The lesson describes Facebook's engagement-maximizing algorithm as a consequentialist failure because:
Correct. This is specification gaming at social scale: engagement as a proxy for value diverged from actual user well-being, with documented harms documented by Frances Haugen's disclosures.
The consequentialist failure is metric divergence: engagement โ‰  value. The proxy optimized produced real-world harms to political discourse and user well-being โ€” a specification problem, not a calculation failure.
10. The 2019 Georgetown Law report found ICE used facial recognition across state DMV databases without driver consent. From Scanlon's perspective, the most decisive ethical problem is:
Correct. The reasonable-rejection grounds are clear: people in these communities had no meaningful choice about DMV database participation, making their surveillance without consent a non-dismissible basis for rejection.
The contractualist problem is that affected individuals have strong, non-dismissible grounds to reject surveillance using databases they had no meaningful option to avoid. That reasonable rejection is ethically decisive.
11. ProPublica's investigation found COMPAS was twice as likely to generate false positives for Black defendants compared to white defendants. The deontological failure this represents is:
Correct. Opacity in consequential decisions violates the deontological duty to respect persons as rational agents โ€” defendants were treated as objects of algorithmic assessment rather than as persons with rights of understanding and challenge.
The deontological failure is opacity and absence of recourse: defendants subject to life-affecting algorithmic scoring had no right to understand or contest it โ€” a violation of rational agency.
12. Which of the following best describes why no single ethical framework is sufficient for AI ethics?
Correct. Each framework captures something real and important but has characteristic failure modes. Using multiple frameworks as lenses โ€” seeking convergence and making divergence explicit โ€” is more robust than any single approach.
The frameworks' limitations are philosophical, not historical or regulatory. Each has a systematic blind spot that the others partially correct โ€” which is why multi-framework analysis is standard in serious AI ethics practice.
13. The Google Project Maven controversy ended with Google declining to renew the contract. What does this outcome illustrate about the role of organizational virtue in AI governance?
Correct. No rule required Google employees to act. They exercised practical moral wisdom about organizational character โ€” that is phronesis in institutional action, and it produced a governance outcome that no external regulation had achieved.
The Maven outcome was driven by cultivated organizational virtue โ€” specifically the practical wisdom (phronesis) of employees who judged that participation violated Google's character. Rules and calculations came later; character came first.
14. When multiple ethical frameworks converge on condemning the same AI practice, this convergence indicates:
Correct. Convergence across independent frameworks is strong evidence: if consequentialism, deontology, virtue ethics, and contractualism all condemn the same practice by different routes, the ethical problem is very likely real and serious.
Convergence strengthens ethical conclusions โ€” it means that independent analytical frameworks, looking for different things, all find the same problem. That robustness is informative, not redundant.
15. A credit scoring algorithm achieves 89% overall accuracy but has disparate approval rates by zip code that correlate with race. Applying Rawls's Difference Principle, this disparity is ethically unjustifiable unless:
Correct. Rawls's Difference Principle requires that inequalities benefit the least-advantaged. A racial approval-rate disparity that systematically disadvantages communities of color fails this test by definition.
Rawls requires that inequalities in social arrangements benefit the least-advantaged group. A disparity that produces worse outcomes for historically disadvantaged communities cannot satisfy the Difference Principle regardless of overall accuracy or disclosure practices.