Metaethics, normative ethics, and applied ethics in AI contexts.
Philosopher Peter Singer's 1972 paper "Famine, Affluence and Morality" argued that if we can prevent something bad from happening without sacrificing anything of comparable moral importance, we are morally obligated to do so. Singer's consequentialism has been enormously influential — and enormously contested. Deontologists argue that duties to refrain from harm are stronger than duties to actively help; virtue ethicists argue Singer ignores the importance of character and community. For AI ethics, the question isn't just which framework is right — it's which frameworks are most useful for reasoning about specific AI situations, and whether different situations call for different frameworks.
T.M. Scanlon's contractualism: an action is wrong if principles allowing it could not be justified to everyone it affects. This framework has particular relevance for AI ethics — it asks designers to consider whether their design choices could be justified to every affected party:
A useful design heuristic: for each consequential design choice, ask "could I justify this to the person most harmed by it?" If not, that's a signal worth taking seriously.
5 questions — free, untracked, retake anytime.
distinguishes metaethics from normative ethics?
makes Scanlon's contractualism particularly relevant to AI ethics?
is the 'justification test' as a design heuristic for AI?
distinguishes Singer's consequentialism from deontological responses to it?
is 'which framework is most useful' a better question than 'which framework is right' for AI ethics practice?
Develop a multi-framework ethical analysis process for AI decisions.
Develop a multi-framework approach to AI ethics.
Harm ontology, stakeholder theory, and the political philosophy of impact assessment.
When Clearview AI scraped billions of photographs from social media and built a facial recognition database sold to law enforcement, the company argued it was simply indexing publicly available information — a first amendment protection. Privacy advocates argued it had created an unprecedented surveillance capability by aggregating individually innocuous public photos into a comprehensive identification system. The EU banned it under GDPR. The US legal status remained contested. The harm wasn't in any individual photo — it was in the aggregation. Harm from AI systems often emerges from combination and scale rather than from any individual data point or decision.
A distinctive feature of AI-related harm is its emergent character — harm arising from combination and scale that wasn't present in any individual component:
Who counts as a stakeholder in AI impact assessment?
Standard impact assessments focus on foreseeable direct harms. AI's emergent, aggregative, and systemic harms require expanding scope — temporally, geographically, and in terms of affected populations — beyond what standard assessment practices capture.
5 questions — free, untracked, retake anytime.
is 'aggregation harm' in the context of AI?
makes systemic AI harm difficult to attribute to specific decisions?
counts as a stakeholder in AI impact assessment beyond direct users?
is the 'scope problem' in AI impact assessment?
was the core legal and ethical dispute about Clearview AI?
Design a comprehensive AI impact assessment framework.
Design a comprehensive AI impact assessment framework.
The philosophy of fairness, anti-discrimination law, and structural equality in AI contexts.
Political philosopher John Rawls proposed the "veil of ignorance" thought experiment: imagine designing the rules of society without knowing your place in it — your class, race, gender, abilities. Behind this veil, Rawls argued, rational self-interest would produce fair principles, because you might end up in the worst-off position. Applied to AI: if you didn't know whether you'd be a beneficiary or subject of an AI system, what design constraints would you demand? This thought experiment reveals something important: fairness in AI design often requires actively designing against the interests of those with most power to shape the design.
Rawls's difference principle: inequalities are acceptable only if they benefit the least-advantaged members of society. Applied to AI: AI systems that produce unequal outcomes are justifiable only if those outcomes benefit the worst-off group. This is a much stronger fairness requirement than most current AI systems meet:
Legal anti-discrimination frameworks add to philosophical requirements:
Anti-discrimination law addresses individual decisions. AI bias operates systemically and distributionally — across thousands of decisions simultaneously. Legal frameworks designed for individual acts of discrimination don't fit this pattern well.
5 questions — free, untracked, retake anytime.
does Rawls's veil of ignorance thought experiment apply to AI design?
is Rawls's difference principle, and how does it apply to AI?
is 'disparate impact' as a legal anti-discrimination doctrine, and why is it relevant to AI?
don't anti-discrimination legal frameworks fit AI bias well?
does the veil of ignorance reveal that fairness in AI design often requires 'designing against designer interests'?
Apply Rawlsian fairness principles to AI system design.
Apply Rawlsian fairness principles to AI system design.
Explainability, interpretability, and the ethics of AI communication.
In 2018, the EU's GDPR created a "right to explanation" for automated decisions. Within months, legal scholars were debating what this right actually required. Did it require post-hoc explanations (explaining a decision after it was made)? Ante-hoc interpretability (using models that are inherently interpretable)? Both? Neither, if explanations could mislead? Researchers found that some post-hoc explanation tools (like LIME and SHAP) could be made to produce any desired explanation regardless of the actual model behavior — raising questions about whether 'explanation' requirements created a false sense of transparency rather than genuine accountability.
Honesty for AI systems involves more than just the truth of individual outputs:
If explanation tools can be made to produce any desired explanation regardless of model behavior, explanation requirements create compliance theater rather than genuine transparency. What matters is accountability — whether the explanation actually enables challenge and correction — not just whether an explanation was provided.
5 questions — free, untracked, retake anytime.
is the 'fidelity problem' with post-hoc explanation tools?
distinguishes interpretability from explainability in AI systems?
might GDPR's right to explanation create compliance theater rather than genuine accountability?
is the difference between 'sincere assertion' and 'performative assertion' for AI honesty?
makes genuine AI transparency about accountability rather than just explanation?
Design genuine transparency requirements for high-stakes AI.
Analyze the fidelity problem and design genuine transparency requirements.
Philosophical theories of autonomy, manipulation, and AI's threats to self-determination.
Philosopher Harry Frankfurt distinguished first-order desires (what you want) from second-order desires (what you want to want). Genuine autonomy, for Frankfurt, requires acting in accordance with your second-order desires — being the author of your motivational structure. Persuasive AI systems exploit this gap: they satisfy first-order desires (you want to check, you want validation) while working against second-order preferences (you want to want to spend time differently, you want to make independent judgments). This isn't just a practical problem of screen time — it's a philosophical attack on the preconditions of autonomous agency.
AI systems that shape what information users encounter, how they think about issues, and what views feel natural have significant epistemic autonomy implications. If billions of people interact with systems that subtly shape their beliefs in commercially or politically motivated directions, individual epistemic autonomy — the capacity to form beliefs through one's own reasoning — is threatened at scale.
An individual trying to influence your beliefs is a normal social interaction. A system influencing the beliefs of billions simultaneously, in commercially motivated directions, is a threat to the epistemic conditions for democratic self-governance.
5 questions — free, untracked, retake anytime.
does Frankfurt's hierarchical autonomy theory explain why persuasive AI is ethically problematic?
is 'substantive autonomy' and how do AI systems threaten it?
is the 'epistemic autonomy' problem specific to AI systems at scale?
does 'relational autonomy' theory extend the critique of AI systems?
distinguishes manipulation from legitimate influence in terms of autonomy?
Apply autonomy theories to derive AI design ethics constraints.
Apply autonomy theories to AI design ethics.
Moral responsibility theory, the problem of many hands, and new liability frameworks.
Philosopher Dennis Thompson identified what he called "the problem of many hands" in organizational ethics: in complex organizations, when something goes wrong, no individual is responsible — each contributed a small, locally rational piece of a harmful aggregate outcome. Modern AI systems are a paradigm case: thousands of engineers make individually reasonable design choices; the resulting system causes harm that no individual engineer intended or could have prevented alone. This doesn't mean no one is responsible — it means responsibility must be conceived differently than the individual attribution that traditional ethical and legal frameworks assume.
Dennis Thompson's 'problem of many hands' applies acutely to AI:
Thompson and others have argued for structural responsibility — responsibility for the institutions, incentive structures, and organizational designs that produce harmful outcomes — alongside (not instead of) individual responsibility:
Rather than asking only 'who caused this?' (retrospective), prospective responsibility asks 'who is responsible for preventing this?' Organizations that design incentive structures that predictably produce harm are prospectively responsible — regardless of whether any individual is fully causally responsible.
5 questions — free, untracked, retake anytime.
is 'the problem of many hands' in organizational ethics?
three elements does traditional moral responsibility require, and why are they difficult to satisfy in AI harms?
is 'structural responsibility' as Thompson conceives it?
distinguishes prospective from retrospective responsibility in AI contexts?
does the problem of many hands imply for AI governance design?
Develop a structural responsibility framework for AI harm.
Develop a structural responsibility framework for AI harm.
The political philosophy of AI values, universal vs. cultural ethics, and the legitimacy of alignment choices.
When Anthropic describes its Constitutional AI, it uses a written constitution of ethical principles. When Chinese AI company ByteDance deploys TikTok, its algorithm reflects different content norms. When the EU regulates AI under the AI Act, it encodes European political values. When US AI systems are trained, they reflect demographic and cultural skews in their training data and RLHF feedback. Every AI system reflects specific values — the question is whether those values are universal, culturally contingent, explicitly chosen, or accidentally encoded. AI systems deployed globally operate across cultures with genuinely different values — and neither ignoring those differences nor relativizing all values is ethically satisfactory.
The challenge of values alignment in a pluralist world:
The entities with most actual power to specify AI values (large AI developers) have the least democratic legitimacy to do so. The entities with most democratic legitimacy (affected communities, democratic governments) have the least power over AI systems deployed globally by foreign corporations.
5 questions — free, untracked, retake anytime.
is the 'values pluralism problem' for globally deployed AI systems?
is the power-asymmetry concern about 'universal values' claims in AI?
is the 'legitimacy gap' in AI values specification?
distinguishes cultural relativism from the values pluralism position in AI ethics?
is democratic legitimacy for AI values important beyond procedural fairness?
Analyze who has legitimate authority to specify AI values.
Analyze the legitimacy problem in AI values specification.
Professional ethics for AI builders, collective responsibility, and the moral obligations of those with technical power.
In 2018, Google employees organized and petitioned against Project Maven — the company's AI contract with the US military for drone targeting. Thousands signed. The company chose not to renew the contract. In 2023, employees at multiple AI companies signed open letters about safety concerns. In 2024, Anthropic published its model spec publicly — attempting to make explicit the values and priorities embedded in Claude. These episodes illustrate a developing field: the professional ethics of AI developers. What obligations do people who build AI systems have — to their employers, to users, to society, and to the future?
Professional ethics frameworks for AI are emerging:
AI builders face complicity questions that most professional fields don't:
Technical capability to build AI systems is rare and comes with corresponding moral weight. People who can build these systems are not just workers following instructions — they are agents making choices with significant societal consequences. With that capability comes responsibility that can't be fully outsourced to employers or regulators.
5 questions — free, untracked, retake anytime.
did the Google Project Maven episode illustrate about AI professional ethics?
can't technical work in AI be 'ethically neutral'?
is the 'power-responsibility principle' for AI builders?
are AI builders' obligations to third parties beyond their direct users?
does the complicity problem ask AI builders to consider?
Develop your personal AI builder ethics framework.
Synthesize the module and develop your AI builder ethics framework.
8 questions covering all lessons. Free, untracked, retake anytime.
contractualism asks AI designers to:
harm in AI means:
veil of ignorance applied to AI design would generate:
fidelity problem with post-hoc explanation tools means:
hierarchical autonomy theory implies that persuasive AI is ethically problematic because:
'problem of many hands' implies that AI governance should:
'legitimacy gap' in AI values means:
power-responsibility principle for AI builders states: