Module 3 · Lesson 1

Opacity by Design

When systems are built to obscure, not illuminate

Why do some consequential AI systems hide how they work — and who pays the price?

In 2016, ProPublica published an investigation into COMPAS — Correctional Offender Management Profiling for Alternative Sanctions — a risk-assessment tool used by judges across the United States to inform bail and sentencing decisions. The investigation found that the algorithm was twice as likely to falsely flag Black defendants as future criminals compared to white defendants. Northpointe, the company behind COMPAS, refused to release the algorithm's weights, citing trade-secret protections. Defendants had no way to challenge scores they could not see.

The case crystallised a fundamental tension: opacity is often not an oversight but a deliberate product choice, and that choice carries severe distributional consequences.

What We Mean by "Transparency in Decision Making"

Transparency in AI decision making is not a single toggle. It is a spectrum of disclosures — from revealing that an algorithm exists, to showing which inputs it uses, to exposing the mathematical relationships between those inputs and outputs. Each layer of disclosure has different legal, commercial, and ethical implications.

Researchers distinguish three levels: algorithmic transparency (knowing the system exists and its general logic), data transparency (knowing what training data shaped it), and outcome transparency (knowing why a specific decision was reached for a specific person). Most public controversy focuses on the third level, because it is the one that affects individual rights.

Why Opacity Persists

Commercial incentives reward opacity. Algorithms are expensive to build and easy to copy; trade-secret law protects them. Regulators have historically lacked the technical capacity to audit what they cannot read. And most affected individuals — loan applicants, criminal defendants, welfare recipients — lack both the legal standing and the technical vocabulary to demand explanations.

The Architecture of Hidden Decisions

Most black-box decisions in high-stakes domains share a recognisable structure. A proprietary model is trained on historical data, deployed via an API or embedded in government software, and its outputs are presented to decision-makers as objective scores. The decision-maker — a judge, a loan officer, a benefits caseworker — often treats the score as authoritative precisely because they cannot interrogate it.

The UK's A-level algorithm controversy of 2020 illustrates this clearly. Ofqual's standardisation model downgraded 39% of teacher-predicted grades, disproportionately harming students from state schools and disadvantaged backgrounds. The model was never published in advance. Students who received degraded grades had no access to the weighting formula that had effectively determined their university places.

COMPAS — Key Facts

Used in at least 10 US states. Scores range 1–10 on recidivism risk. Northpointe maintained trade-secret protection throughout litigation. A New Jersey court ruled in 2017 that defendants had no constitutional right to examine the source code.

UK A-Level Model, 2020

Ofqual's model used school-level historical performance to adjust individual grades. Students at small schools were most affected because of insufficient historical data. Model was reversed within nine days after mass protests.

Key Terms

Algorithmic opacityThe condition in which the logic of an automated decision system is unavailable to those affected by its outputs, whether by technical complexity, proprietary protection, or deliberate design.

Recidivism scoreA numeric estimate of how likely a person is to re-offend, used in US criminal justice to inform pretrial release, sentencing, and parole decisions.

Outcome transparencyThe disclosure of why a specific individual received a specific algorithmic decision, as opposed to general documentation of model architecture.

Standardisation modelA statistical method applied to reconcile predicted grades with historical school performance — used by Ofqual in 2020 when COVID-19 cancelled UK public exams.

The Deeper Problem

Opacity is not neutral. When a system cannot be examined, its errors cannot be identified, its biases cannot be corrected, and its accountability cannot be assigned. The COMPAS case shows that the absence of transparency is itself a policy choice — one that systematically disadvantages those who most need recourse.

Lesson 1 Quiz

Opacity by Design — check your understanding

What did ProPublica's 2016 investigation find about the COMPAS algorithm?

Correct. ProPublica found COMPAS produced false positives for Black defendants at roughly twice the rate compared to white defendants — one of the most cited empirical findings in algorithmic fairness research.

Not quite. ProPublica's core finding was a significant racial disparity in false-positive rates — Black defendants were far more likely to be incorrectly labelled high-risk.

Which of the following best describes "outcome transparency"?

Correct. Outcome transparency is the most granular form — it addresses why this person got this result, which is most relevant to individual rights and legal challenges.

That describes algorithmic or data transparency. Outcome transparency specifically concerns individual-level explanations for individual decisions.

Why was Ofqual's 2020 A-level standardisation model controversial from a transparency standpoint?

Correct. The model was applied to determine university entry outcomes, but students affected by grade downgrades had no prior knowledge of the methodology and no practical means of appeal.

The core issue was non-disclosure of the methodology before it was applied, leaving affected students unable to contest decisions that shaped their futures.

Lab 1 — Interrogating Opacity

Practice identifying when and why algorithmic opacity matters

Your Task

You are examining three real-world scenarios where AI systems were used in high-stakes decisions. Use the chat below to work through each scenario with your AI tutor. Discuss: what information was withheld, who was harmed by that withholding, and what a minimally transparent alternative would look like.

Starter prompts: "What made COMPAS opacity legally defensible but ethically contested?" — "How does the UK A-level case differ from COMPAS in terms of who bore the transparency burden?" — "If you were designing a recidivism tool, what would you publish and why?"

AI Tutor — Opacity & Transparency

Lab 1

Welcome to Lab 1. We're examining algorithmic opacity in high-stakes decisions — COMPAS in criminal justice and Ofqual's A-level model in education. Both involve systems where consequential decisions were made using hidden logic. What aspect of opacity would you like to explore first?

Module 3 · Lesson 2

The Right to an Explanation

How law and regulation have tried to force transparency into automated decisions

Does telling someone "an algorithm decided" satisfy any meaningful legal or ethical standard?

When the European Union's General Data Protection Regulation came into force in May 2018, Article 22 gave individuals the right not to be subject to solely automated decisions with significant effects — and Article 13/14 required that individuals be given "meaningful information about the logic involved." Within weeks of GDPR enforcement, privacy advocates filed complaints against companies ranging from credit bureaus to social media platforms, testing whether "meaningful information" could be satisfied by boilerplate disclosures.

Meanwhile in the United States, Eric Loomis had appealed his Wisconsin sentence all the way to the state Supreme Court, arguing that using a proprietary COMPAS score violated his due-process rights. In 2016, the Wisconsin Supreme Court upheld the sentence, ruling that the score was one factor among many and that sufficient safeguards existed. The US Supreme Court declined to hear the case in 2017.

What the Law Actually Requires

GDPR Article 22 is frequently cited as creating a "right to explanation," but its scope is narrower than commonly understood. It applies only to solely automated decisions — the moment a human reviews (even superficially) an algorithmic output, the protection may not apply. Recourse consists of requesting human review and contesting the decision, not necessarily of receiving a full algorithmic disclosure.

The EU AI Act, provisionally agreed in December 2023, goes further. High-risk AI systems — those used in credit scoring, employment, education, and criminal justice — must provide documentation sufficient for a "meaningful explanation" of outputs, and affected individuals must be notified that a high-risk AI system was used in decisions affecting them.

2016 — Loomis v. Wisconsin

Wisconsin Supreme Court upholds COMPAS-informed sentencing. Rules proprietary protection of algorithm does not violate due process when the score is one factor among many.

2018 — GDPR Enforcement Begins

Article 22 creates qualified right against solely automated decisions with significant effects. "Meaningful information about the logic involved" remains legally undefined.

2020 — ICO Guidance (UK)

UK Information Commissioner's Office publishes guidance stating that explanations must be "meaningful, contextual and tailored to the individual." Generic disclosures found insufficient.

2023 — EU AI Act (Provisional)

High-risk AI systems must provide documentation and individual notification. Penalty regime includes fines up to 3% of global annual turnover for compliance failures.

The "Meaningful Information" Problem

The phrase "meaningful information about the logic involved" appears in GDPR but is not defined. Courts and regulators have struggled to specify what it requires in practice. A 2019 case involving credit insurer Allianz found that a statement saying "your application was assessed using an automated credit-scoring system" was not sufficient; affected individuals were entitled to know which factors were weighted and in what direction.

But full model disclosure creates its own problems. Publishing exact weights enables gaming: loan applicants who know that credit utilisation has a higher weight than account age can optimise superficially without improving creditworthiness. Regulators are therefore caught between the transparency needed for accountability and the opacity needed to prevent manipulation.

Practical Gap

Research by the Alan Turing Institute (2019) found that the majority of GDPR Article 22 requests in the financial sector received responses that courts would likely consider inadequate — usually short paragraphs describing model type and input categories, without factor-level weighting or any individualised explanation.

Key Terms

GDPR Article 22EU regulation giving individuals the right not to be subject to solely automated decisions with significant legal or similarly significant effects, and requiring meaningful disclosure of the logic involved.

Due processThe constitutional guarantee (US) and legal principle (EU/UK) that individuals must have a meaningful opportunity to understand and challenge decisions that affect their legal rights.

High-risk AI (EU AI Act)Systems deployed in eight defined domains — including credit, education, employment, and criminal justice — subject to mandatory transparency, documentation, and human oversight requirements.

Gaming resistanceThe property of a model whose disclosed features cannot easily be gamed by applicants to obtain undeserved positive outcomes without genuine underlying improvement.

Lesson 2 Quiz

The Right to an Explanation — check your understanding

Under GDPR Article 22, when does the right against automated decision-making apply?

Correct. The "solely automated" threshold is critical — the involvement of any genuine human review can break the Article 22 trigger, which is why critics argue it is too easy to circumvent.

Article 22 specifically requires the decision to be solely automated. Human involvement, even minimal, can exclude a decision from Article 22's scope.

What was the outcome of Loomis v. Wisconsin (2016)?

Correct. The ruling was widely criticised by legal scholars, but it established the precedent that COMPAS-style scores do not inherently violate due process when the judge considers them alongside other evidence.

The court sided with the state. The US Supreme Court later declined to hear the case, leaving the Wisconsin ruling in place.

What is the "gaming resistance" tension in algorithmic transparency?

Correct. This is the genuine dilemma: complete feature-weight disclosure enables individuals to manipulate inputs strategically, undermining the accuracy the model is meant to provide.

The tension is about model gaming, not cybersecurity or regulatory arbitrage. Knowing exact weights could let loan applicants optimise cosmetically without genuine creditworthiness improvement.

Lab 2 — The Law's Limits

Explore where legal transparency requirements succeed and fail

Your Task

GDPR Article 22 and the EU AI Act represent the strongest transparency mandates in the world — yet researchers still find them widely circumvented in practice. Use the chat to work through the gaps: What makes a transparency requirement meaningful? What makes it hollow? How would you write the law differently?

Starter prompts: "Why is 'solely automated' such a weak threshold in GDPR?" — "What would a genuinely meaningful credit-scoring explanation look like under GDPR?" — "How does the EU AI Act improve on GDPR for transparency, and where does it still fall short?"

AI Tutor — Legal Transparency Frameworks

Lab 2

Welcome to Lab 2. We're examining whether law can force genuine transparency in AI decision-making — GDPR Article 22, the Loomis ruling, and the EU AI Act all attempt this, with mixed results. What aspect of legal transparency requirements would you like to dig into?

Module 3 · Lesson 3

Technical Tools for Transparency

LIME, SHAP, and the science of explaining what models actually do

Can post-hoc explanation methods produce explanations that are both accurate and human-intelligible?

When Scott Lundberg and Su-In Lee published the SHAP (SHapley Additive exPlanations) paper at NeurIPS 2017, it rapidly became the dominant interpretability tool in industry. By 2020, the major US credit bureaus — Equifax, Experian, and TransUnion — had adopted SHAP or LIME to generate the "adverse action notices" required by the Fair Credit Reporting Act: the four reasons a credit application was declined.

In healthcare, researchers at the University of Washington used SHAP values to audit a commercial sepsis-prediction algorithm deployed at a major hospital network. They found that the model's top-weighted feature was the number of prior nursing notes — a proxy for administrative documentation intensity rather than clinical severity. The model was predicting which patients had more nurses, not which patients were sicker. SHAP made the problem visible; without it, the system would have continued operating invisibly.

How LIME Works

LIME (Local Interpretable Model-agnostic Explanations), introduced by Ribeiro, Singh, and Guestrin in 2016, generates explanations by perturbing a specific input and observing how the model's output changes. It fits a simple interpretable model (typically a sparse linear model) in the local neighbourhood of the prediction, producing feature-importance values for that particular instance.

The key limitation is locality: LIME explains behaviour around a single point and can produce contradictory explanations for nearby points. A loan applicant with income of $45,000 and one of $46,000 may receive meaningfully different LIME explanations despite near-identical underlying model logic.

How SHAP Works

SHAP grounds feature importance in Shapley values from cooperative game theory: each feature is credited with the marginal contribution it makes to the prediction across all possible orderings of features. This produces globally consistent attributions — a feature's SHAP value means the same thing across the entire dataset.

SHAP satisfies three desirable axioms: efficiency (attributions sum to the prediction), symmetry (features with identical contributions receive identical values), and dummy (features with no effect receive zero attribution). TreeSHAP, the variant for gradient-boosted trees, runs in polynomial time and is fast enough for production use.

LIME — Strengths & Limits

Strengths: Model-agnostic, fast, intuitive output. Limits: Locally inconsistent across nearby inputs; instability documented by Alvarez-Melis & Jaakkola (2018); perturbation distribution may not reflect real data manifold.

SHAP — Strengths & Limits

Strengths: Theoretically grounded, globally consistent, additive. Limits: Computationally expensive for neural networks; assumes feature independence in KernelSHAP; Shapley values can be counterintuitive for correlated features.

The Sepsis Algorithm Finding

The University of Washington's audit found that "number of nursing notes" ranked as the model's highest SHAP-value feature — not clinical vitals or lab results. This reflected that sicker patients accumulate more documentation, making note-count a statistical proxy for severity. SHAP exposed a correlation the model was exploiting but that had no causal clinical meaning.

Limitations of Post-Hoc Explanation

Post-hoc methods explain model behaviour, not model correctness. If a model has learned a spurious correlation — as in the sepsis case — SHAP faithfully reports that the model relies on it. The explanation is accurate but the underlying decision process is flawed. This distinction is critical: explainability and fairness are not the same property.

Rudin (2019) argues in "Stop Explaining Black Box Machine Learning Models for High Stakes Decisions" that post-hoc explanations are inherently untrustworthy for high-stakes use because they are approximations of behaviour that may diverge in precisely the cases that matter most. She advocates instead for inherently interpretable models — decision trees, logistic regression, scorecard models — whose logic is the prediction process itself.

Key Terms

LIMELocal Interpretable Model-agnostic Explanations. A post-hoc method that generates feature importance for a single prediction by fitting a simple model to locally perturbed data points.

SHAPSHapley Additive exPlanations. A post-hoc method grounded in game-theoretic Shapley values that attributes each feature's contribution to a prediction in a globally consistent way.

Post-hoc explanationAn explanation generated after a model has made a prediction, which approximates model behaviour without altering the model itself — distinct from inherently interpretable models.

Adverse action noticeA US regulatory requirement under the Fair Credit Reporting Act to inform applicants of the primary reasons a credit decision was unfavourable, typically expressed as up to four top factors.

Lesson 3 Quiz

Technical Tools for Transparency — check your understanding

What did the University of Washington's SHAP audit of a hospital sepsis model reveal?

Correct. This finding demonstrated that SHAP can expose dangerous spurious correlations — the model was statistically correct but clinically misleading, using documentation volume as a proxy for illness severity.

The audit found that note count — not clinical vitals — was the dominant feature. This was a proxy correlation, not a causal clinical signal.

Which theoretical framework underpins SHAP's approach to feature attribution?

Correct. Shapley values — originally developed by Lloyd Shapley for distributing cooperative game payoffs fairly among players — are adapted in SHAP to attribute prediction value across input features.

That describes gradient-based methods (saliency maps) or LIME. SHAP uses Shapley values from cooperative game theory for globally consistent attribution.

According to Cynthia Rudin's 2019 argument, what is the core problem with post-hoc explanations for high-stakes decisions?

Correct. Rudin's key point is that explanations that approximate behaviour can be systematically wrong at distribution boundaries — precisely where high-stakes decisions concentrate.

Rudin's concern is epistemic, not practical. The problem is that post-hoc explanations are approximations and may be least accurate in the edge cases where accuracy matters most.

Lab 3 — Applying SHAP & LIME

Work through explanation method trade-offs with your AI tutor

Your Task

You are a data scientist asked to add explainability to a credit-scoring model at a bank subject to GDPR. The model is a gradient-boosted tree. You must choose an explanation method, generate adverse action notices, and defend your choices to a regulator. Use the chat to work through the technical and ethical trade-offs.

Starter prompts: "Should I use LIME or SHAP for a gradient-boosted credit model — why?" — "How do I turn SHAP values into a plain-language adverse action notice?" — "What are the risks of using SHAP values directly as legally binding explanations?"

AI Tutor — SHAP, LIME & Explanation Methods

Lab 3

Welcome to Lab 3. You're a data scientist adding explainability to a GDPR-regulated credit model. The core tension: SHAP is theoretically principled but can be counterintuitive for correlated features; LIME is fast but locally unstable. What would you like to work through first?

Module 3 · Lesson 4

Building Transparent Systems

Design principles, audit regimes, and the organisations getting it right

What does a genuinely transparent AI decision system look like — and how do you build one?

In November 2021, New York City passed Local Law 144, the first US law requiring bias audits of automated employment decision tools. Companies using AI in hiring or promotion must commission independent bias audits and publish summary results before deploying the system. By January 2023, compliance had exposed a practical gap: several major vendors refused to publish audit results, claiming the requirement was preempted by federal law. The enforcement debate crystallised what transparency legislation alone cannot solve — you can mandate disclosure but not candour.

Amsterdam took a different approach. In 2020, the city government published an Algorithm Register — a public database of every algorithm used by municipal services, including its purpose, data sources, decision impact, and a risk classification. By 2023, over 90 algorithms had been documented. Rotterdam and Helsinki followed with similar registers. The European Commission cited Amsterdam's approach as a model in its AI Act implementation guidance.

Design Principles for Transparent Decision Systems

Transparency is most tractable when it is designed in from the start, not retrofitted after deployment. Several principles have emerged from the research literature and from organisations that have done this well:

1. Use the simplest model that meets performance requirements. Logistic regression with calibrated probabilities is fully transparent. A gradient-boosted tree requires SHAP. A deep neural network may be practically inexplicable at the individual level. The choice of model is a transparency choice.

2. Maintain a model card. Introduced by Mitchell et al. at Google (2019), model cards document intended use, evaluation metrics disaggregated by subgroup, known limitations, and recommended deployment conditions. Several major organisations now publish them as standard.

3. Conduct pre-deployment disparate impact analysis. Before deployment, measure whether the model produces statistically significant differences in outcomes across protected groups. The legal standard in US employment law is the 4/5ths rule: if the selection rate for any group is less than 80% of the highest group's rate, disparate impact is indicated.

90+
Algorithms documented in Amsterdam's public Algorithm Register by 2023
4/5
The legal threshold for disparate impact in US employment decisions (80% rule)
2019
Year Google's Mitchell et al. published the model card framework, now widely adopted

Algorithm Registers as Infrastructure

Amsterdam's Algorithm Register works because it treats algorithmic transparency as a public infrastructure problem, not a disclosure compliance exercise. Each entry includes not just what the algorithm does but who commissioned it, what decision it informs, whether humans review outputs, what appeals process exists, and when it was last audited. The register is searchable and machine-readable.

The practical lesson is that transparency requires standardised vocabulary. When every organisation can define "meaningful explanation" however it chooses, comparisons are impossible and accountability is undermined. Registers force a common schema.

NYC Local Law 144 — What It Requires

Employers using automated employment decision tools in NYC hiring or promotion must: (1) commission an independent bias audit within one year of deployment, (2) make a summary of results publicly available on their website, (3) notify candidates and employees that an AEDT was used, and (4) provide, on request, the data categories the tool relies on. First civil penalties were issued in 2023.

What Audits Cannot Catch

Audits of deployed systems face a fundamental limitation: they measure the model's behaviour on the data it has seen. If the model is deployed on a population that differs from the audit sample — in demographic composition, economic conditions, or the tasks they perform — the audit findings may not apply. This is the distribution shift problem, and it is particularly acute in credit and hiring systems that operate in dynamic labour and financial markets.

A hiring algorithm audited against 2019 applicant data may not reflect the model's behaviour against 2024 applicants who have optimised their applications using AI-assisted tools. Transparency obligations should therefore include ongoing monitoring, not just pre-deployment audit — a requirement the EU AI Act addresses through its post-market monitoring obligations for high-risk systems.

Key Terms

Model cardA structured document describing a machine learning model's intended use, performance metrics disaggregated by subgroup, known limitations, and deployment recommendations, introduced by Mitchell et al. (2019).

Algorithm registerA publicly accessible database documenting algorithms used by an organisation — particularly government — including purpose, data sources, decision impact, and audit status. Pioneered by Amsterdam in 2020.

NYC Local Law 1442021 New York City law requiring independent bias audits and public disclosure for automated employment decision tools used in hiring or promotion within NYC.

Distribution shiftThe condition in which the statistical properties of data on which a model is deployed differ from the properties of the data on which it was trained or audited, potentially invalidating earlier performance assessments.

The Practical Takeaway

Transparency is not a single act — it is a system of practices: choosing interpretable models where possible, documenting them with model cards, registering them publicly, auditing them before deployment, monitoring them in production, and providing individuals with meaningful recourse. Each element compensates for the gaps in the others.

Lesson 4 Quiz

Building Transparent Systems — check your understanding

What is the primary purpose of Amsterdam's Algorithm Register?

Correct. The register is an accountability infrastructure — a structured, public, machine-readable record of algorithmic activity in government that enables scrutiny by journalists, researchers, and citizens.

The register is a transparency and accountability tool, not a testing platform or opt-out system. It documents what algorithms exist and what they do.

What is the 4/5ths (80%) rule in US employment discrimination law?

Correct. The 4/5ths rule is the EEOC's Uniform Guidelines standard for adverse impact — it provides a threshold for when statistical disparities in hiring outcomes trigger scrutiny.

The 4/5ths rule measures selection-rate ratios across protected groups — if any group's selection rate falls below 80% of the highest group's rate, adverse impact is indicated.

Why does distribution shift pose a threat to the validity of pre-deployment bias audits?

Correct. An audit is a snapshot — if the model subsequently encounters a population that differs from the audit sample (due to demographic change, economic shifts, or applicant behaviour change), the audit findings are no longer reliable.

The threat is that the real population on which the model operates may differ from the population on which it was audited, making the audit's conclusions invalid for actual deployment conditions.

Lab 4 — Design a Transparent System

Apply module concepts to a real design challenge

Your Task

A city government wants to deploy an algorithm to prioritise housing maintenance inspections in low-income neighbourhoods. The algorithm uses publicly available data: complaint history, building age, inspection records, and census demographics. You are the responsible AI lead. Design the transparency architecture for this system — audit, register entry, explanation approach, and appeals process.

Starter prompts: "What model type should I choose to maximise transparency for housing inspection prioritisation?" — "How would I structure this algorithm's entry in a public register?" — "What appeals process should residents have if they believe the algorithm wrongly deprioritised their building?"

AI Tutor — Transparent System Design

Lab 4

Welcome to Lab 4. You're designing transparency architecture for a municipal housing inspection algorithm — model choice, public register documentation, explanation methods, and appeals. This pulls together everything in Module 3. Where would you like to start your design?

Module 3 Test

Transparency in Decision Making — 15 questions · Pass at 80%

1. ProPublica's 2016 investigation found COMPAS was how much more likely to falsely flag Black defendants as high-risk compared to white defendants?

Correct. The 2× false-positive rate disparity is the core empirical finding of the ProPublica investigation.

ProPublica found the false-positive rate for Black defendants was approximately twice that for white defendants.

2. What legal protection did Northpointe use to prevent defendants from accessing COMPAS's algorithm?

Correct. Northpointe invoked trade-secret law throughout the litigation, and courts generally upheld this protection.

Northpointe relied on trade-secret law, arguing that commercial algorithms are proprietary business information.

3. In the UK's 2020 A-level grading crisis, what percentage of teacher-predicted grades were downgraded by Ofqual's algorithm?

Correct. Ofqual's standardisation model downgraded 39% of centre-assessed grades, disproportionately harming state-school students.

Ofqual's model downgraded 39% of teacher-predicted grades — a figure widely reported during the resulting protests.

4. GDPR Article 22's right against automated decision-making is triggered only when decisions are made in what manner?

Correct. The "solely automated" threshold is a critical limitation — human review, even superficial, can remove a decision from Article 22's scope.

The trigger is "solely automated" decisions with significant effects. Human review — even nominal — can break this threshold.

5. The Loomis v. Wisconsin case established which principle?

Correct. The Wisconsin Supreme Court ruled that the score's proprietary nature was acceptable because the judge used it as one factor, not as a sole determinant.

The ruling actually upheld the use of a proprietary score, finding no due-process violation when the score was one among many sentencing factors.

6. LIME generates local explanations by doing which of the following?

Correct. LIME creates a local perturbation sample, gets the model's predictions for those perturbed points, and fits a sparse linear model to approximate local behaviour.

LIME perturbs the input and fits a simple linear approximation locally — it is not gradient-based and is distinct from SHAP's Shapley-value approach.

7. Which three axioms does SHAP satisfy that most other attribution methods do not?

Correct. These three Shapley axioms are what distinguish SHAP as theoretically principled — they guarantee consistent and fair attribution across features.

The three Shapley axioms are efficiency, symmetry, and dummy — they guarantee that attributions are consistent, fair, and sum to the full prediction value.

8. Cynthia Rudin's (2019) alternative to post-hoc explanation methods is to use what?

Correct. Rudin argues that for high-stakes decisions, the right solution is to build interpretable models rather than explain opaque ones — transparency baked in, not bolted on.

Rudin advocates for inherently interpretable models — decision trees, logistic regression, scorecards — where the model's logic is transparent by design, not approximated after the fact.

9. What did the University of Washington's SHAP audit of a hospital sepsis prediction model reveal about its primary feature?

Correct. The model had learned that sicker patients generate more nursing documentation — a true correlation but a causally meaningless one, making the model's logic clinically unsound.

SHAP revealed that note count was the top feature — the model had picked up that sicker patients generate more documentation, using this as a proxy for illness severity.

10. NYC Local Law 144 requires employers using automated employment decision tools to do which of the following?

Correct. Law 144 requires independent audit and public disclosure of results — it does not require source code review or consent forms, but does require candidate notification.

Law 144 requires independent bias audits and public publication of results — and notification to candidates that an AEDT was used.

11. What is the US employment law 4/5ths (80%) rule used to detect?

Correct. The 4/5ths rule is the EEOC's standard statistical threshold for detecting adverse impact in selection procedures, including algorithmic ones.

The 4/5ths rule compares selection rates across groups — if any group is selected at less than 80% the rate of the highest group, adverse impact is flagged.

12. Amsterdam's Algorithm Register was notable for being which of the following?

Correct. The register's value lies in its public accessibility, standardised schema, and machine-readability — enabling journalistic, academic, and civil society scrutiny.

Amsterdam's register is public-facing and searchable — it treats algorithmic transparency as public infrastructure, not internal compliance documentation.

13. Model cards, as introduced by Mitchell et al. at Google (2019), include which of the following elements?

Correct. Model cards are structured documentation templates — their key contribution is disaggregated performance metrics, which reveal performance disparities that aggregate metrics conceal.

Model cards document intended use, subgroup-disaggregated performance, known limitations, and deployment guidance — not source code or legal disclaimers.

14. What is the core risk of distribution shift for pre-deployment bias audits?

Correct. Audits are snapshots. If the model is deployed against a shifted population — due to demographics, behaviour, or economic context — the audit findings may not hold.

The risk is that the deployment population differs from the audit sample, making the audit's conclusions about fairness and accuracy potentially inapplicable to real users.

15. Which of the following best captures the "gaming resistance" tension in transparency design?

Correct. This is the genuine dilemma regulators face: enough transparency to enable accountability, but not so much that it enables strategic gaming that degrades model accuracy.

The tension is about applicant gaming — knowing exact feature weights lets people optimise cosmetically without genuinely improving their underlying situation, which undermines the model's purpose.