AI Risk for Business Leaders · Module 4 · Lesson 1

The GDPR and AI: Data Protection as Legal Liability

When an algorithm decides your credit score or job application, regulators want to know why — and they will fine you if you can't explain it.

In November 2023, the Swedish data protection authority issued a €12.3 million fine against Spotify for GDPR violations related to inadequate disclosure of how user data was being processed — including data fed into recommendation algorithms. The case illustrated a structural tension that now confronts every business deploying AI: the right of individuals to understand decisions made about them collides directly with the opacity of machine learning systems.

The EU's General Data Protection Regulation, operative since May 2018, was not written with transformers or gradient-boosted trees in mind. Yet its principles — lawfulness, transparency, data minimisation, purpose limitation — apply with full legal force to every AI system that processes personal data. Businesses that treat GDPR as a cookie-consent checkbox while deploying AI-driven hiring, pricing, or credit tools are accumulating significant unpriced legal risk.

Article 22 and the Right to Explanation

Article 22 of the GDPR grants individuals the right not to be subject to a decision based solely on automated processing where that decision produces legal or similarly significant effects. This includes credit scoring, insurance underwriting, recruitment screening, and loan origination. When such processing does occur — with consent or contractual necessity — the data controller must provide meaningful information about the logic involved, the significance, and the envisaged consequences.

The word "meaningful" has generated enormous legal debate. In 2022, the Austrian Data Protection Authority ruled that a fitness app's transfer of user data to Google Analytics — even anonymised data — constituted a violation because it could not guarantee data wouldn't be accessed under US surveillance law. The principle extends to AI: if you cannot explain the variables your model uses, their weights, and how they combine to produce an output about a specific individual, you may be unable to demonstrate GDPR compliance.

The practical implication for business leaders is stark: deploying a black-box model against EU residents for consequential decisions is a compliance gamble. Interpretable models, post-hoc explanation tools (SHAP, LIME), and documented model cards are now elements of legal risk management — not merely good ML engineering practice.

Data Minimisation vs. the Data Hunger of AI

Machine learning models generally improve with more data. GDPR's data minimisation principle says you should collect only what is adequate, relevant, and limited to the processing purpose. These two imperatives are in direct tension. Businesses building or fine-tuning AI systems on customer data must conduct Data Protection Impact Assessments (DPIAs) — mandatory under Article 35 for high-risk processing — before deployment, not after an incident.

In 2022, the Italian Data Protection Authority (Garante) ordered Clearview AI to stop processing Italian residents' facial recognition data and imposed a €20 million fine — the maximum permitted under GDPR. Clearview had scraped billions of public images to train a facial recognition system without any lawful basis for that processing. The case established that training data provenance is itself a compliance question, not merely a downstream application issue. If you fine-tune a model on customer data, you must have a lawful basis for each processing activity involved in that fine-tuning.

Enforcement Reality

Between 2018 and 2024, GDPR fines exceeded €4.5 billion in aggregate. Meta has been fined over €1.3 billion for transatlantic data transfers alone. The Irish Data Protection Commission and French CNIL have both opened investigations explicitly targeting AI training data practices. Fines can reach 4% of global annual turnover — for a company with €10 billion revenue, that is €400 million at maximum exposure.

Lawful Basis and Consent for AI Processing

Six lawful bases exist under GDPR Article 6. For AI deployments the most commonly invoked are: legitimate interests (Article 6(1)(f)), contract performance (6(1)(b)), and consent (6(1)(a)). Each carries different obligations. Legitimate interests requires a balancing test — the controller's interest must not override the fundamental rights of the data subject. Consent must be freely given, specific, informed, and unambiguous; bundling AI profiling consent into terms of service has been repeatedly rejected by regulators.

For special category data — health, biometric, racial or ethnic origin — the bar is higher still: you need an Article 9 condition in addition to an Article 6 basis. AI systems that infer health status from behavioural data, or that use facial recognition, almost certainly process special category data. The 2023 enforcement action against TikTok by Ireland's DPC, resulting in a €345 million fine, highlighted how AI-driven recommendation systems that process children's data attract heightened scrutiny even when data is nominally behavioural rather than explicitly categorised.

Key Terms

DPIA (Data Protection Impact Assessment): Mandatory pre-deployment risk assessment for high-risk AI processing under GDPR Article 35. Must identify risks to data subject rights and describe mitigation measures.

Article 22: Right not to be subject to solely automated decisions with significant effects. Requires human review on request and meaningful explanation of logic.

Lawful Basis: One of six GDPR-recognised justifications for processing personal data. Without one, processing is unlawful regardless of business purpose.

Lesson 1 Quiz

3 questions — untracked, retake anytime.

1. Under GDPR Article 22, what must an organisation provide when making solely automated decisions with significant effects on individuals?

✓ Correct. Article 22 requires controllers to provide meaningful information about the logic involved — not source code or full model documentation, but enough for the individual to understand and contest the decision.

✗ Incorrect. Article 22 requires meaningful information about the logic, significance, and consequences — a standard focused on explainability and individual rights, not technical documentation or source code.

2. The Italian DPA fined Clearview AI €20 million primarily because:

✓ Correct. The Garante found no lawful basis for Clearview's mass scraping and processing of biometric data, establishing that training data provenance is itself a GDPR compliance question.

✗ Incorrect. The core violation was the absence of any lawful basis for processing biometric data — not accuracy, DPO appointment, or transfer mechanisms.

3. Which GDPR principle most directly conflicts with the data appetite of machine learning model training?

✓ Correct. Data minimisation directly collides with ML's need for large, diverse datasets. Models typically improve with more data, while GDPR requires limiting collection to what is necessary — creating a structural tension organisations must actively manage.

✗ Incorrect. While other principles are important, data minimisation is the principle most structurally in tension with ML development practices, which benefit from large and varied training datasets.

Lab 1: GDPR Compliance Analyser

Practice mapping AI deployment scenarios to GDPR obligations and risk.

Scenario-Based GDPR Risk Analysis

You are a senior legal or compliance officer advising on AI deployments. Use this lab to stress-test real scenarios against GDPR requirements: lawful basis, Article 22 obligations, DPIA triggers, special category data risks, and enforcement exposure. Describe your AI use case and get structured compliance analysis.

Try asking: "We want to use an AI to screen CV applications from EU candidates and auto-reject the bottom 60%. What GDPR obligations apply and what is our exposure?"

GDPR AI Compliance Lab Lab 1

AI Risk for Business Leaders · Module 4 · Lesson 2

The EU AI Act: A Risk-Based Regulatory Architecture

Europe's landmark AI law creates four tiers of regulation — understanding which tier your AI sits in determines your compliance obligations, your timelines, and your liability exposure.

The EU AI Act entered into force on 1 August 2024, becoming the world's first comprehensive binding legal framework for artificial intelligence. Its prohibitions on unacceptable-risk AI began applying six months later; obligations for general-purpose AI model providers began in August 2025; and the full high-risk AI system requirements apply from August 2026. For businesses already deploying AI or planning to deploy, the compliance clock is running.

Unlike GDPR — which is fundamentally about data — the AI Act regulates the AI system itself: how it is built, deployed, monitored, and documented. It imposes different obligations on providers (those who develop or place AI on the market) and deployers (those who use AI in a professional context). Crucially, a company that purchases an AI system from a vendor and deploys it in a consequential context may still bear deployer obligations — including maintaining human oversight and conducting fundamental rights impact assessments for certain high-risk systems.

The Four-Tier Risk Architecture

Unacceptable Risk (Prohibited): A small category of AI practices banned outright from the EU market. These include social scoring by public authorities, real-time remote biometric identification in public spaces (with narrow law enforcement exceptions), subliminal manipulation that impairs free will, and AI that exploits vulnerabilities of specific groups. Companies building or selling these systems into EU markets face penalties of up to €35 million or 7% of global annual turnover.

High Risk: The largest and most consequential category for business. High-risk AI systems are those used in: critical infrastructure, education and vocational training, employment and HR (hiring, promotion, task management), essential private services (credit scoring, insurance risk assessment), migration and border control, administration of justice, and systems that are safety components of products regulated under EU product law. All high-risk systems must comply with extensive requirements before market placement.

Limited Risk: Systems with specific transparency obligations — primarily chatbots and deepfake-generating systems. Users must be informed they are interacting with AI or viewing AI-generated content. No pre-market compliance requirements, but post-deployment transparency is mandatory.

Minimal Risk: The vast majority of AI applications — spam filters, AI-powered spreadsheet features, most recommendation engines not falling into high-risk categories. No mandatory compliance obligations beyond existing law.

High-Risk Requirements: What Business Leaders Must Know

If your organisation is a provider of a high-risk AI system — or a deployer of one used in an HR, credit, or essential services context — the AI Act imposes a suite of obligations that represent a significant operational and legal commitment:

Risk Management System: A documented, continuous process identifying and mitigating risks throughout the AI system's lifecycle. Not a one-time assessment — a living programme.

Data Governance: Training, validation, and test datasets must be relevant, representative, free of errors, and complete. This requirement alone has significant implications for organisations using historical datasets that may embed past discriminatory practices.

Technical Documentation: Providers must maintain detailed technical documentation demonstrating compliance. This documentation must be available to regulators on request.

Transparency and User Information: High-risk systems must provide deployers with instructions for use, including the system's purpose, accuracy levels, human oversight measures, and known limitations.

Human Oversight: High-risk systems must be designed to allow natural persons to understand, monitor, and override automated outputs. This is not optional human-in-the-loop theatre — it must be effective and documented.

Accuracy, Robustness, and Cybersecurity: Systems must achieve appropriate levels of performance and be resilient to adversarial inputs, errors, and inconsistencies.

Conformity Assessment: Before market placement, high-risk systems must undergo conformity assessment — either self-assessment or third-party assessment depending on the category. A CE mark equivalent is required.

Deployer vs. Provider: The Liability Split

Under the AI Act, providers (developers/vendors) bear primary compliance obligations. But deployers — companies that put the AI to work in their business — carry obligations too: conducting fundamental rights impact assessments for high-risk AI, ensuring effective human oversight, logging system operation, and reporting serious incidents. A company that buys an off-the-shelf AI hiring tool and uses it without modification is a deployer — and is legally responsible for how it deploys that tool.

General Purpose AI Models and Frontier Risk

The AI Act includes a dedicated regime for general-purpose AI (GPAI) models — large language models and foundation models that can be adapted to a wide range of downstream tasks. Providers of all GPAI models must maintain technical documentation, comply with EU copyright law, and publish summaries of training data. Providers of models with systemic risk — currently defined as models trained using more than 10^25 FLOPs — face additional obligations: model evaluation, adversarial testing, incident reporting to the European AI Office, and cybersecurity measures.

As of 2024, the models in scope for systemic risk designation included GPT-4, Gemini Ultra, Claude 3, and Llama 3 400B. Businesses building products on top of these models are downstream deployers — they inherit obligations but also need to understand what their GPAI provider is and is not warranting about the underlying model's compliance.

Penalty Structure

Prohibited AI practices: Up to €35 million or 7% of global annual turnover, whichever is higher.

High-risk system non-compliance: Up to €15 million or 3% of global annual turnover.

Providing incorrect information to authorities: Up to €7.5 million or 1% of global annual turnover.

For SMEs, the lower of the monetary amount or percentage applies. Penalties are administered by national market surveillance authorities coordinated by the European AI Office.

Lesson 2 Quiz

3 questions — untracked, retake anytime.

1. Under the EU AI Act, which of the following AI deployments would most likely be classified as HIGH RISK?

✓ Correct. AI systems used in employment and HR — including candidate ranking and screening — are explicitly listed in Annex III of the AI Act as high-risk, triggering the full suite of conformity, documentation, and human oversight requirements.

✗ Incorrect. AI used in employment decisions — including candidate ranking — is explicitly classified as high-risk under Annex III of the EU AI Act. Playlist recommendation, customer service chatbots, and ticket categorisation fall under limited or minimal risk categories.

2. A company purchases an AI credit-scoring tool from a third-party vendor and uses it to make lending decisions for EU customers. Which statement best describes their legal position under the EU AI Act?

✓ Correct. Deployers of high-risk AI systems carry their own legal obligations under the AI Act — they cannot outsource all responsibility to the vendor. Human oversight, impact assessments, and incident reporting duties fall on the deployer.

✗ Incorrect. The AI Act imposes specific duties on deployers, independent of provider compliance. These include ensuring effective human oversight, conducting fundamental rights impact assessments, and reporting serious incidents.

3. What threshold currently defines a "systemic risk" general-purpose AI model under the EU AI Act, triggering additional obligations including adversarial testing and incident reporting?

✓ Correct. The AI Act defines systemic risk GPAI models by their training compute — above 10²⁵ FLOPs. This threshold currently captures the largest frontier models including GPT-4, Gemini Ultra, and Claude 3.

✗ Incorrect. The AI Act's systemic risk threshold is training compute: 10²⁵ FLOPs. Parameter count, capability descriptions, and user reach are not the defined threshold — though the European AI Office may update the threshold as compute scaling continues.

Lab 2: EU AI Act Risk Classifier

Classify AI systems into the Act's risk tiers and map the resulting compliance obligations.

AI Act Compliance Planning Tool

Use this lab to classify your AI systems or proposed deployments under the EU AI Act's four-tier risk architecture. Describe the AI system — its purpose, who uses it, what decisions it informs, and whether it operates in the EU market. Get a risk-tier classification with rationale, the specific legal obligations that apply, and a prioritised action list for compliance.

Try asking: "We're building an AI model that analyses employee performance data to recommend who should be considered for promotion. We operate in Germany and France. What tier is this and what do we need to do?"

EU AI Act Risk Classifier Lab 2

AI Risk for Business Leaders · Module 4 · Lesson 3

Discrimination, Bias Liability, and Employment Law

Neutral-looking algorithms can produce discriminatory outcomes — and regulators, courts, and plaintiffs are beginning to hold companies legally accountable when they do.

Amazon built a machine learning hiring tool designed to automate CV screening. Trained on ten years of submitted applications — the vast majority from men, reflecting the male-dominated tech industry — the model learned to systematically downgrade CVs from women. It penalised CVs that included the word "women's" (as in "women's chess club") and downgraded graduates of all-women's colleges. Amazon scrapped the project in 2018, having never used the tool in actual hiring decisions. But the episode became one of the most cited examples in AI bias literature — a system learned to replicate historical discrimination with algorithmic precision.

Amazon's case was a self-discovered internal failure. Other organisations have faced external enforcement. In 2023, the EEOC — the US Equal Employment Opportunity Commission — published guidance explicitly warning that employers using AI for hiring decisions remain liable under Title VII, the ADA, and the ADEA even if the discriminatory output was generated by a third-party vendor's algorithm. The vendor does not absorb your liability.

Disparate Impact: How Neutral Systems Produce Illegal Outcomes

Anti-discrimination law in both the US and EU distinguishes between disparate treatment (intentional discrimination) and disparate impact (facially neutral practices that disproportionately harm a protected class). The disparate impact doctrine — established by the US Supreme Court in Griggs v. Duke Power Co. (1971) and embedded in EU Equal Treatment Directives — does not require proof of discriminatory intent. If an AI selection tool produces statistically significant adverse effects on a protected group, and cannot be justified as job-related and consistent with business necessity, it may constitute illegal discrimination.

The four-fifths rule (or 80% rule), used by the EEOC as a rule of thumb, flags adverse impact when a protected group's selection rate is less than 80% of the rate for the group with the highest selection rate. An AI hiring tool that selects 40% of white male applicants but only 25% of Black female applicants has an adverse impact ratio of 62.5% — well below the 80% threshold. The burden then shifts to the employer to demonstrate job-relatedness and business necessity, and that no equally valid, less discriminatory alternative exists.

In 2023, the city of New York enacted Local Law 144, requiring employers using AI hiring tools to conduct and publish annual bias audits. The law was the first of its kind in the US and signalled a regulatory direction that has since been followed by Colorado (insurance algorithms), Illinois (AI video interview analysis), and Maryland (facial recognition in hiring) with their own AI-specific employment laws.

The HireVue and iCIMS Cases: Vendor Liability and Audit Requirements

HireVue, a major AI-based video interview assessment platform, faced a 2019 complaint from the Electronic Privacy Information Center (EPIC) to the FTC, alleging that its system made employment assessments based on video and voice data using criteria that were not validated for job-relatedness. HireVue subsequently dropped facial recognition from its platform in 2021, citing algorithmic explainability concerns. The episode illustrates how regulatory and reputational pressure can force product pivots that significantly affect any business that has built on top of the vendor's capabilities.

For business leaders, the HireVue case generates two practical risk exposures: First, vendor dependency risk — a vendor product change can disrupt your hiring process midstream. Second, co-liability risk — using a third-party AI tool does not insulate you from discrimination claims. The EEOC's 2023 technical assistance document makes clear that employers are responsible for the discriminatory impact of any tool they use, even if they did not build it.

The Corinthia Bank Precedent — EU Employment AI

In 2022, the Austrian Labour Court ruled that an employer's use of an AI-based scheduling system that systematically assigned less desirable shifts to employees who had taken parental leave constituted indirect sex discrimination under EU law, since parental leave is disproportionately taken by women. The system was neutral on its face — it optimised operational coverage. The outcome was discriminatory. The court required the employer to demonstrate that the scheduling algorithm's business justification was proportionate and that no less discriminatory scheduling approach existed.

Proactive Steps: Bias Audits, Documentation, and Disparate Impact Testing

The legal and reputational risk of AI discrimination is asymmetric: the cost of an undetected discriminatory system that reaches regulatory attention or litigation is vastly higher than the cost of proactive bias testing. Business leaders deploying AI in HR, credit, insurance, or any decisioning role affecting protected groups should consider the following:

Pre-deployment disparate impact testing: Run your model's outputs against demographic breakdowns of your applicant or customer population before go-live. Document the analysis and the decisions made in response to any adverse impact findings.

Regular post-deployment audits: Populations shift, model drift occurs, and the interaction of your AI with real-world data may produce impacts not visible in pre-deployment testing. New York's Local Law 144 mandates annual independent bias audits with public disclosure — treat this as a template for best practice, regardless of jurisdiction.

Vendor contracts and representations: Require vendors of AI decisioning tools to provide bias audit results, demographic performance data across protected classes, and contractual representations about job-relatedness validation. Ensure your contract includes audit rights.

Human review processes: For high-stakes decisions — hiring, termination, credit denial — ensure documented human review exists. This does not mean a human rubber-stamps the AI; it means a human can explain the decision independently of the algorithmic output.

Regulatory Landscape — US AI Discrimination Enforcement

EEOC (Employment): Issued 2023 technical assistance on AI and Title VII/ADA/ADEA. Employers liable for discriminatory AI outputs regardless of vendor origin.

CFPB (Credit): 2022 circular clarified that "black box" credit models cannot be used to deny credit without providing specific, comprehensible adverse action reasons — explainability is a legal requirement under ECOA and FCRA.

HUD (Housing): Fair Housing Act applies to algorithmic property advertising targeting and tenant screening tools. Facebook settled a DOJ housing discrimination complaint in 2022 related to its ad targeting algorithms.

Lesson 3 Quiz

3 questions — untracked, retake anytime.

1. Under the disparate impact doctrine, an employer using an AI hiring tool can face discrimination liability even without discriminatory intent if:

✓ Correct. Disparate impact does not require intent — it requires demonstrating that a facially neutral practice produces statistically significant adverse effects on a protected group. The employer must then justify the practice as job-related and necessary, with no less discriminatory alternative available.

✗ Incorrect. Disparate impact liability turns on statistical adverse effects that cannot be justified as job-related and necessary — not on complaints alone, vendor audit status, or whether demographic data was an explicit input.

2. What is the EEOC's "four-fifths rule" and how does it apply to AI hiring tools?

✓ Correct. The four-fifths (80%) rule is the EEOC's primary statistical threshold for identifying adverse impact. If a protected group's selection rate is less than 80% of the highest-selected group's rate, adverse impact is presumed and the employer must justify the practice.

✗ Incorrect. The four-fifths rule is a statistical threshold: a protected group's selection rate must not fall below 80% of the highest-selected group's rate. It does not govern training data sourcing, model accuracy requirements, or audit frequency.

3. In 2022, the CFPB issued a circular clarifying that lenders using AI credit models must:

✓ Correct. The CFPB circular clarified that ECOA and FCRA require specific, comprehensible adverse action reasons — explainability is a legal obligation, not an engineering preference. Black-box models that cannot produce these reasons are non-compliant.

✗ Incorrect. The CFPB's key clarification was on adverse action notices: credit decisions must be accompanied by specific, comprehensible reasons that the applicant can understand — making AI explainability a legal requirement in credit decisioning.

Lab 3: Bias and Discrimination Risk Advisor

Identify bias and discrimination exposure in your AI deployment and build a mitigation plan.

AI Discrimination Risk Assessment

This lab helps you identify discrimination and bias risks in your AI deployment, understand your legal exposure under employment, credit, and fair housing laws, and develop practical mitigation strategies. Describe your AI use case — including what decisions it supports, what data it uses, and what population it affects — and get a structured risk and remediation analysis.

Try asking: "We use a third-party AI platform that scores job candidates based on psychometric data from an online assessment. We haven't conducted a bias audit. What are our legal risks and what should we do first?"

Bias & Discrimination Risk Lab Lab 3

AI Risk for Business Leaders · Module 4 · Lesson 4

Intellectual Property, Liability, and Emerging Litigation Fronts

AI generates text, images, and code — but who owns it, who is liable when it goes wrong, and what does the first wave of AI litigation tell us about where the law is heading?

In December 2023, The New York Times filed suit against OpenAI and Microsoft in the Southern District of New York, alleging that millions of Times articles were used without authorisation to train ChatGPT and related models. The complaint included exhibits showing ChatGPT reproducing substantial portions of copyrighted Times articles verbatim — a phenomenon called "memorisation" in ML literature. The Times alleged both direct copyright infringement and contributory infringement, and sought statutory damages that, if awarded at maximum per-work rates, could theoretically reach billions of dollars.

The Times case is one of dozens of IP lawsuits now working through US courts. Artists' suits against Stability AI, Midjourney, and DeviantArt (Kelly v. Stability AI, filed 2023), musicians' suits regarding AI music generation trained on copyrighted recordings, and Getty Images' suit against Stability AI in both the US and UK courts collectively define the contours of the most consequential unresolved legal question in AI: does training a model on copyrighted material constitute infringement, and if so, what is the remedy?

US copyright law does not protect ideas, only expression. But training on copyrighted works involves copying that expression — temporarily or persistently — into model weights. The central legal question is whether this constitutes fair use under 17 U.S.C. § 107. The four-factor fair use analysis — purpose and character of use, nature of the copyrighted work, amount taken, and effect on the market — will be applied to AI training for the first time in binding court decisions expected between 2025 and 2027.

In the Google Books case (Authors Guild v. Google, 2d Cir. 2015), the Second Circuit found Google's scanning of millions of books for search indexing to be fair use — largely because the use was transformative and did not substitute for the original. AI training proponents argue a similar logic applies: a model doesn't reproduce a book, it learns patterns. Critics, including the Times, argue that memorisation of verbatim content, and competition with original creators in the AI output market, undermine the fair use defence.

Outside the US, the position is clearer and more restrictive. The EU's Directive on Copyright in the Digital Single Market (DSM Directive, 2019) permits text and data mining for research purposes but allows rightholders to opt out of TDM for commercial AI training. Several major publishers, including Axel Springer and Le Monde, have done exactly that. A business using an AI system trained on data scraped from EU sources without proper opt-out mechanisms may face copyright liability under EU law even if the training occurred outside the EU.

AI-Generated Content and IP Ownership

A separate but related question: who owns content generated by AI? The US Copyright Office has taken a clear position: copyright does not protect works generated by AI without sufficient human authorship. In 2023, the Office declined to register "A Recent Entrance to Paradise" — an image generated entirely by the AI system DABUS — and subsequent guidance confirmed that the degree of human creative control determines copyrightability. AI-generated images, text, or music with minimal human input cannot be copyright protected in the US.

This has significant commercial implications. If your organisation uses generative AI to produce marketing copy, product descriptions, software code, or creative assets, and those outputs lack sufficient human authorship, you may not be able to obtain copyright protection for them. Competitors could freely copy your AI-generated content without infringement. Your contracts with clients that warrant ownership of delivered work may be breached if the delivered work is unprotectable AI output.

For code specifically: GitHub Copilot has faced multiple lawsuits alleging that it reproduces copyrighted open-source code without attribution, in violation of licence terms. A 2022 class action (Doe v. GitHub/Microsoft/OpenAI) raised claims under the DMCA's Section 1202 (circumvention of copyright management information by stripping licence notices from reproduced code). The case signals that businesses using AI code generation tools face potential licence compliance exposure if the generated code is substantially similar to licensed open-source material.

The Getty Images v. Stability AI Cases

Getty Images sued Stability AI in both Delaware (US) and the UK in 2023, alleging that Stable Diffusion was trained on over 12 million Getty images without licence, and that the model can generate outputs bearing distorted versions of Getty watermarks — evidence of training on watermarked images. The UK case proceeded under English copyright law, which does not have the same fair use doctrine as the US. As of 2024, both cases remain ongoing but have established that AI companies cannot assume public accessibility of images implies licence to train on them.

AI Product Liability: When Outputs Cause Harm

Beyond IP, a rapidly developing area of legal exposure is product liability for AI outputs that cause harm. In 2023, a US attorney was sanctioned by a federal court after submitting a brief citing non-existent cases hallucinated by ChatGPT (Mata v. Avianca). While the attorney bore direct professional responsibility, the episode foreshadowed a class of harm where AI-generated output is relied upon for consequential purposes and proves false or dangerous.

The EU AI Liability Directive, proposed in 2022 and still in legislative process as of 2024, would create a presumption of causation where an AI system operated in a high-risk context and caused damage — shifting the burden to the developer or deployer to disprove the causal link. Combined with the EU Product Liability Directive update (also 2022), which explicitly extends product liability to software including AI, European businesses face a legal environment where demonstrable harm from an AI output creates a credible liability claim without requiring the plaintiff to unpack the technical causation chain.

In the US, existing product liability doctrine applies. The key questions courts will address are: Is AI a product or a service? (Different liability frameworks apply.) Who is the "manufacturer" in the supply chain — the model developer, the API provider, or the business deploying the model? The answers will shape the AI industry's liability architecture for decades.

Practical Risk Management for IP and Liability

Training data: Audit and document the provenance of any training data used in proprietary models. Ensure licences permit AI training use. Where EU DSM opt-outs apply, respect them.

Generated content: Flag AI-generated content internally. For commercially important content — campaigns, code, product designs — add sufficient human creative input and document it to support copyright claims.

Code generation: Implement policies requiring review of AI-generated code against known open-source licence obligations. Consider tools that flag potential licence conflicts in generated code.

Output verification: For any AI output used in professional, legal, medical, or financial contexts, implement mandatory human verification protocols. Document the verification process. Do not deploy AI-generated professional advice without human expert review.

Lesson 4 Quiz

3 questions — untracked, retake anytime.

1. The US Copyright Office's 2023 guidance on AI-generated content concluded that:

✓ Correct. The Copyright Office has consistently held that copyright requires human authorship. AI-generated works without sufficient human creative input are not protectable — meaning organisations cannot claim copyright in purely AI-generated content.

✗ Incorrect. The Copyright Office's position is that human authorship is required for copyright protection. AI cannot hold copyright, but works are also not automatically public domain if sufficient human creativity is involved in the creative process.

2. Under the EU's DSM Directive (2019), what right do rightholders have regarding the use of their content for commercial AI model training?

✓ Correct. The DSM Directive permits TDM for research but allows rightholders to opt out for commercial purposes. Publishers including Axel Springer and Le Monde have exercised this right, meaning AI training on their content without a licence may constitute infringement under EU law.

✗ Incorrect. The DSM Directive created a specific opt-out right allowing rightholders to reserve their works from commercial TDM. This has been exercised by major publishers and creates real licensing obligations for commercial AI developers using EU content.

3. What is the core commercial risk for organisations that deploy AI-generated content — such as marketing copy or product images — without adding sufficient human creative input?

✓ Correct. Without sufficient human authorship, AI-generated content cannot be copyrighted — competitors can copy it without infringement. Client contracts that warrant clear IP title to delivered work may also be breached if the delivered content is unprotectable. This is a significant but underappreciated commercial risk.

✗ Incorrect. The primary commercial risk is the inability to obtain copyright protection — meaning competitors can freely reproduce the content. Additionally, client contracts warranting IP ownership of deliverables may be breached. Training data rightholders do not automatically claim ownership of outputs.

Lab 4: IP and Liability Exposure Advisor

Map your AI-related intellectual property and product liability exposure and build mitigation strategies.

AI IP and Liability Risk Analysis

Use this lab to analyse your organisation's exposure to IP infringement claims (training data, generated content, code), product liability from AI outputs that cause harm, and related contractual risks. Describe your AI use case — what you generate, what data you used, and how you deploy outputs — and receive structured legal risk analysis and practical mitigation steps.

Try asking: "We use GPT-4 via the API to generate product descriptions for our e-commerce site. We sell these to clients under contracts that say we warrant clear IP ownership of all deliverables. What are our IP risks and what should we do?"

IP & Liability Risk Lab Lab 4

Module 4 Test

15 questions. Score 80% or above to pass. All lessons covered.

1. GDPR Article 22 grants individuals a right not to be subject to solely automated decisions with significant effects. Which of the following decisions would most clearly trigger this right?

✓ Correct. Mortgage rejection has clear legal and financial significance — it directly and substantially affects an individual. This is precisely the category of automated decision GDPR Article 22 was designed to regulate.

✗ Incorrect. Article 22 applies to decisions with legal or similarly significant effects. Mortgage rejection clearly meets this threshold; content recommendations, ticket routing, and internal search results generally do not.

2. A Data Protection Impact Assessment (DPIA) under GDPR Article 35 is required when:

✓ Correct. DPIAs are triggered by high-risk processing — which explicitly includes systematic and extensive profiling with significant effects, large-scale processing of special category data, and systematic monitoring of public areas. AI systems engaged in such activities must have a completed DPIA before deployment.

✗ Incorrect. DPIAs are required for high-risk processing — not based on record counts, deployment of any AI, or duration of processing. The key triggers are systematic profiling with significant effects, large-scale special category data, and public area monitoring.

3. The Italian DPA (Garante) fined Clearview AI the maximum permitted €20 million for processing facial recognition data of Italian residents. What did this case establish about AI training data?

✓ Correct. The Clearview case established that lawful basis applies to training data collection and processing — not merely to the deployment of the finished model. This has significant implications for any organisation training models on scraped or third-party data.

✗ Incorrect. The Clearview case's key principle is that lawful basis must exist for training data processing itself — not just downstream use. Public availability of images does not create a lawful basis for biometric processing under GDPR.

4. Under the EU AI Act, which of the following is classified as an UNACCEPTABLE RISK AI practice and is therefore prohibited outright?

✓ Correct. Social scoring by public authorities is explicitly listed among the prohibited AI practices under the EU AI Act. CV screening is high-risk (not prohibited), and product recommendation is minimal risk. Non-disclosed chatbots face transparency obligations but are not prohibited.

✗ Incorrect. Social scoring by public authorities is explicitly prohibited under the EU AI Act. CV screening AI is classified as high-risk with compliance requirements but is not banned. Product recommendations and chatbots fall into lower risk categories.

5. What is the maximum penalty for a provider placing a prohibited AI system on the EU market under the EU AI Act?

✓ Correct. The AI Act's highest penalty tier — €35 million or 7% of global turnover — applies to violations of the prohibited practices provisions. High-risk system non-compliance carries €15 million/3%, and providing incorrect information carries €7.5 million/1%.

✗ Incorrect. Prohibited AI practice violations carry the highest penalty: €35 million or 7% of global annual turnover. This is deliberately set higher than GDPR's 4% maximum to reflect the severity of prohibited AI harms.

6. A company purchases an off-the-shelf AI performance management tool from a vendor and uses it to flag underperforming employees for potential dismissal. Under the EU AI Act, the company is best characterised as:

✓ Correct. The purchasing company is a deployer under the AI Act and bears specific obligations — they cannot delegate all responsibility to the vendor. High-risk AI deployers must ensure human oversight, conduct impact assessments, and report serious incidents.

✗ Incorrect. The company is a deployer — not a provider, exempt party, or co-provider in the technical legal sense. Deployers bear their own set of AI Act obligations for high-risk AI systems, independent of what the vendor has done.

7. Amazon's AI hiring tool (2014–2018) systematically downgraded CVs from women primarily because:

✓ Correct. The Amazon tool illustrates proxy discrimination: the model didn't use gender directly, but learned to treat gender-correlated signals — attending women's colleges, using the word "women's" — as negative indicators because successful historical applicants were predominantly male.

✗ Incorrect. The discrimination was unintentional and arose from training data that reflected historical industry demographics. The model learned to replicate patterns in the training data — patterns that correlated with gender — without explicit gender inputs.

8. New York City's Local Law 144 (2023) requires employers using AI in hiring to:

✓ Correct. Local Law 144 requires annual independent bias audits with public disclosure — making New York the first US jurisdiction to mandate bias auditing of AI hiring tools. It has become a template for AI employment regulation nationwide.

✗ Incorrect. Local Law 144 requires annual independent bias audits with public disclosure. It does not mandate consent forms, human-only committees above salary thresholds, or pre-deployment registration of AI tools.

9. The CFPB's 2022 circular on AI credit models clarified that lenders must provide specific adverse action reasons to denied applicants. What is the core implication for AI model selection in lending?

✓ Correct. The CFPB circular made clear that model accuracy does not override the legal requirement for specific, comprehensible adverse action notices. A highly accurate black-box model that cannot explain why it denied a loan is legally non-compliant under ECOA and FCRA.

✗ Incorrect. The CFPB circular's key implication is that explainability is legally required in credit decisioning — model type, pre-approval, and certifications do not substitute for the legal obligation to provide specific adverse action reasons.

10. The New York Times v. OpenAI lawsuit (2023) centred on which core legal theory?

✓ Correct. The Times's lawsuit centred on copyright infringement — specifically that ChatGPT was trained on copyrighted articles without licence, and that the model can reproduce substantial verbatim portions of those articles (memorisation), directly competing with the Times's own content licensing business.

✗ Incorrect. The core claim is copyright infringement based on unauthorised training use and output memorisation. Trademark, defamation, and CFAA claims were not the primary basis of the New York Times lawsuit against OpenAI.

11. Under the EU's DSM Directive, which statement about AI training and copyright is accurate?

✓ Correct. The DSM Directive allows TDM for research but includes an opt-out right for rightholders to prevent commercial use of their content for AI training. This opt-out has been exercised by major European publishers, creating real licensing obligations for commercial AI.

✗ Incorrect. The DSM Directive does not create a blanket EU fair use equivalent. It permits TDM for research but allows commercial opt-outs — and explicitly does not harmonise with the US fair use doctrine, which is why EU and US copyright outcomes for AI may diverge significantly.

12. General-purpose AI model providers (such as those developing large language models) are required under the EU AI Act to:

✓ Correct. All GPAI providers must maintain documentation, comply with EU copyright law, and publish training data summaries. Systemic risk model providers (above 10²⁵ FLOPs) additionally must conduct evaluations, adversarial testing, and report incidents to the European AI Office.

✗ Incorrect. GPAI providers face documentation, copyright compliance, and transparency obligations — not pre-approval requirements, strict liability for all outputs, or access restrictions to certified EU businesses.

13. HireVue dropped facial recognition from its interview assessment platform in 2021. What are the two primary risk lessons for businesses that had integrated HireVue's facial recognition capabilities?

✓ Correct. The HireVue case illustrates two distinct risks: vendor dependency (a regulatory or reputational pressure on your vendor can immediately affect your operations) and co-liability (you cannot outsource discrimination liability to your AI vendor — EEOC guidance makes employers responsible for their tools' discriminatory impacts).

✗ Incorrect. The primary lessons are vendor dependency risk (vendor product changes affect your processes) and co-liability risk (you bear discrimination liability for your AI tools regardless of who built them). Consent and GDPR issues may also apply but are not the primary lessons from the HireVue situation.

14. What is the core risk of using AI-generated code from tools like GitHub Copilot without human review for open-source licence compliance?

✓ Correct. The GitHub Copilot class action raised exactly this issue: AI-generated code may reproduce licensed open-source code without attribution, potentially violating licence terms and DMCA Section 1202 (which prohibits removing copyright management information including licence notices). Organisations need code review policies to catch this.

✗ Incorrect. The primary risk is that AI-generated code may reproduce licensed open-source material — requiring licence compliance — and may strip licence notices in violation of DMCA Section 1202. The code does not automatically inherit licences, and AI provider ownership claims are not the primary risk in this context.

15. Which combination of regulatory frameworks creates the most comprehensive legal compliance obligation for a European bank deploying an AI model to make automated credit decisions for retail customers?

✓ Correct. Automated credit decisioning by a European bank sits at the intersection of all three frameworks: GDPR Article 22 requires individual rights and explanation; the EU AI Act classifies credit scoring as high-risk requiring conformity assessment and human oversight; and EU Equal Treatment Directives prohibit disparate impact discrimination. Managing all three simultaneously is the compliance challenge.

✗ Incorrect. The most complete compliance picture for a European bank's automated credit AI requires: GDPR Article 22 (individual rights in automated decisions), EU AI Act high-risk obligations (conformity, documentation, oversight — credit scoring is explicitly listed), and Equal Treatment Directives (non-discrimination). These three frameworks together define the compliance obligation.