Module 5 · Lesson 1

AI Music Generation: From Suno to Udio

How text-to-music tools create full productions from a single prompt — and what that means for creators.

What happens when anyone can generate a radio-ready track in thirty seconds?

When Ghostwriter977 posted "Heart on My Sleeve" to TikTok and YouTube in April 2023, the track sounded indistinguishable from a collaboration between Drake and The Weeknd. Neither artist had touched it. Within 72 hours the song accumulated millions of streams, triggered DMCA takedowns by Universal Music Group, and sparked a congressional inquiry into AI-generated music. The track was built using voice-cloning AI layered over a produced beat — a workflow that, by mid-2024, any creator could replicate with consumer tools.

The incident crystallised a question the music industry had been quietly avoiding: when AI can reproduce any artist's voice and style with high fidelity, what counts as original work?

What Text-to-Music Tools Actually Do

Text-to-music platforms such as Suno (launched publicly December 2023) and Udio (April 2024) accept a natural-language prompt and return a complete audio production — vocals, instrumentation, mixing — in under a minute. The underlying models are trained on vast corpora of recorded music and learn to associate stylistic descriptors ("upbeat lo-fi hip hop, melancholic lyrics about city lights") with corresponding audio features.

Suno's architecture conditions a diffusion-based audio model on both text embeddings and a learned musical "grammar" that handles structure (verse, chorus, bridge). Udio uses a similar approach but allows stems — isolated vocal or instrumental tracks — giving producers more flexibility for downstream editing in a DAW.

By June 2024, the Recording Industry Association of America (RIAA) filed lawsuits against both Suno and Udio, alleging that their training datasets contained copyrighted recordings without licence. The cases remained active through 2025 and are expected to set precedent for the entire generative audio space.

Legal Context

In June 2024 the RIAA, representing major labels including Universal, Sony, and Warner, sued Suno for $150,000 per infringed work. The suits allege the models were trained on the labels' catalogues without permission or payment — the same legal theory applied to image generators in the 2023 Getty vs. Stability AI case.

Key Capabilities and Limitations

What these tools do well: rapid prototyping of musical ideas, generating background music for video or podcast content, exploring genre combinations that would require a large session band to test conventionally, and producing royalty-free commercial music for small creators who cannot afford licensing fees.

Persistent limitations: outputs frequently exhibit what engineers call "hallucinated lyrics" — phonetically plausible syllables that mean nothing. Long-form coherence remains weak; a four-minute track generated in a single pass often loses its harmonic or rhythmic identity mid-way. Live-instrument nuance — the expressive imperfections that make a jazz piano recording feel human — is still largely absent.

Producers at studios including Hypnosis Music (London) have reported using Suno and Udio as demo sketch tools — generating a reference track to communicate a brief to session musicians, rather than as a final deliverable. This "AI as translator" workflow has become common in advertising music houses.

Workflow Reality

For commercial work in 2024–2025, AI-generated music is most defensible legally when the human creator writes all lyrics, edits the arrangement in a DAW, re-records key elements with live instruments, and uses the AI output strictly as a structural scaffold. This preserves both copyright protection and artistic authorship.

Key Terms

Text-to-MusicA generative AI paradigm that produces audio from natural-language descriptions, covering melody, harmony, rhythm, and timbre simultaneously.

StemsIsolated audio tracks (e.g., vocals only, drums only) that can be mixed independently in a digital audio workstation.

Voice CloningAI technique that replicates a specific person's vocal timbre and style from a sample, enabling synthesis of new performances in that voice.

DMCA TakedownA copyright-infringement notice under the Digital Millennium Copyright Act that compels platforms to remove infringing content.

Lesson 1 Quiz

AI Music Generation — five questions

1. Which event in April 2023 most directly triggered major-label legal action against AI music tools?

Correct. "Heart on My Sleeve" by Ghostwriter977 went viral in April 2023, was taken down by Universal Music Group, and directly preceded the RIAA's June 2024 lawsuits against Suno and Udio.

Not quite. The pivotal event was "Heart on My Sleeve" by Ghostwriter977 — an AI track that mimicked Drake and The Weeknd — going massively viral before Universal issued DMCA takedowns.

2. What are "stems" in the context of AI music tools like Udio?

Correct. Stems are isolated components of a mix — for example, a vocal track with no instrumentation — allowing producers to import them into a DAW and mix them against other elements.

Not quite. Stems are the isolated audio tracks (vocals, drums, bass, etc.) exportable from tools like Udio, giving producers fine-grained control in a digital audio workstation.

3. Which organisation filed lawsuits against Suno and Udio in June 2024?

Correct. The RIAA, representing Universal, Sony, and Warner, filed suit against both Suno and Udio in June 2024, alleging unlicensed use of copyrighted recordings in training data.

Not quite. The RIAA — representing the major labels — filed the suits, alleging the AI models were trained on copyrighted recordings without licence or payment.

4. How have producers at studios like Hypnosis Music used AI music tools in real commercial workflows?

Correct. Commercial studios have found AI-generated music most useful as a rapid reference or communication tool — an "AI as translator" approach — rather than a finished product.

Not quite. Producers at firms like Hypnosis Music use AI generation to sketch a demo that communicates a brief, then hand the concept to session musicians for the final recording.

5. What is a persistent technical limitation of AI text-to-music tools as of 2024–2025?

Correct. AI music generators still struggle with maintaining structural and harmonic consistency over longer durations, often drifting in style or losing the original rhythmic feel mid-track.

Not quite. The key limitation is long-form coherence — a four-minute AI track generated in a single pass frequently loses its character mid-way, unlike a human-composed and -produced track.

Lab 1: Prompting for AI Music

Practice crafting effective text-to-music prompts and understanding their trade-offs

Your Mission

You're helping a small documentary filmmaker find background music for a 3-minute emotional scene set in a rain-soaked city at night. You have access to a text-to-music tool but no budget for licensing. In this lab, you'll work with an AI assistant to craft and refine prompts, understand what makes a music generation prompt effective, and think through the legal and creative trade-offs involved.

Start by describing the scene to the assistant and asking for help crafting a Suno or Udio prompt. Then explore: what words reliably shape mood, tempo, and instrumentation? What should you avoid to stay clear of style-copying issues?

AI Music Prompt Coach

Lab 1

Welcome to Lab 1. I'm your AI music prompt coach. Tell me about the scene you need music for — mood, setting, emotional arc — and we'll craft and refine a text-to-music prompt together. We'll also think through what's legally safest when you're generating commercial-use music.

Module 5 · Lesson 2

AI Voice Cloning and Podcast Production

How tools like ElevenLabs, Descript, and Adobe Podcast are reshaping audio storytelling — and where the ethical lines sit.

If an AI can speak in any voice, who controls what that voice says?

When the Screen Actors Guild–American Federation of Television and Radio Artists (SAG-AFTRA) went on strike in July 2023, one of its central demands was a prohibition on studios using AI voice clones of actors without consent and compensation. The union had documented cases where production companies had cloned voice performances from existing recordings to generate new lines — avoiding re-hire costs entirely. The five-month strike ended in November 2023 with a deal that included AI provisions, marking the first major collective bargaining agreement to address generative voice AI.

The strike forced the entire audio-production industry to confront a question that podcast producers, audiobook narrators, and radio talent had been asking quietly: if a studio can clone your voice once, do they ever need to hire you again?

The Voice Cloning Landscape

ElevenLabs, founded in 2022, emerged as the dominant consumer voice-cloning platform. By early 2024 it offered "Instant Voice Cloning" — upload 30 seconds of audio, receive a model that replicates timbre, pacing, and emotional inflection. The platform was implicated in a January 2024 incident where a robocall using a cloned voice of President Biden urged New Hampshire Democratic primary voters not to vote. ElevenLabs terminated the responsible account within hours and tightened identity verification.

Descript took a different approach, marketing "Overdub" — a voice clone feature embedded in a podcast editing environment. Overdub requires the user to record themselves reading a consent script before any clone is activated, creating at least a procedural consent layer. The feature was designed explicitly for podcasters who want to fix spoken errors by typing corrections rather than re-recording.

Adobe Podcast (now part of Adobe Premiere) added "Enhance Speech" in 2023 — not cloning, but AI-powered audio clean-up that removes background noise, room reverb, and microphone imperfections from any recording. This tool democratised broadcast-quality audio for creators with consumer microphones and home studios.

The Biden Robocall — January 2024

A political operative used ElevenLabs to clone President Biden's voice and distribute a robocall to New Hampshire voters discouraging primary participation. The Federal Communications Commission subsequently proposed new rules requiring disclosure of AI-generated audio in political advertising. The case became a landmark in AI misuse documentation.

Practical Podcast Production Workflows

For independent podcast producers, AI has introduced four genuinely transformative capabilities: automated transcription (tools like Whisper from OpenAI achieve near-human accuracy on clear speech), noise removal (Adobe Enhance Speech, Auphonic), filler-word deletion (Descript's "remove filler words" feature scans transcripts and cuts "um," "uh," and "like" without manual editing), and AI-generated chapter summaries and show notes (Riverside.fm's AI summary feature was adopted by over 10,000 shows by late 2023).

Larger productions use AI differently. Spotify announced in September 2023 that it would use AI voice translation to dub popular podcast episodes into Spanish, French, and German — preserving the original host's vocal character across languages. The pilot used ElevenLabs technology and was tested with podcasts by Lex Fridman, Dax Shepard, and Steven Bartlett.

Consent Framework

Ethical voice-cloning practice requires: (1) explicit recorded consent from the voice owner before any clone is created, (2) clear disclosure to audiences when a cloned voice is used, (3) a defined scope — the clone may not be used for purposes the owner did not agree to, and (4) compensation terms negotiated in advance. SAG-AFTRA's 2023 AI agreement codified versions of all four requirements.

Key Terms

Voice CloningAI synthesis of a specific individual's vocal characteristics from a sample, enabling new speech to be generated in their voice.

OverdubDescript's consent-gated voice clone feature that lets podcasters correct spoken errors by typing replacement text.

Enhance SpeechAdobe Podcast's AI audio clean-up tool that removes noise and reverb without requiring re-recording.

Automated TranscriptionAI conversion of spoken audio to text, used as a foundation for editing, chapter generation, and search indexing.

Lesson 2 Quiz

AI Voice Cloning and Podcast Production — five questions

1. What did SAG-AFTRA's November 2023 strike agreement establish that was historically significant for AI?

Correct. The SAG-AFTRA deal was the first major union contract to specifically address generative AI voice cloning — requiring consent, scope limitation, and compensation provisions.

Not quite. The deal was historically significant because it was the first major collective bargaining agreement to formally address AI voice cloning, requiring consent, defined scope, and compensation terms.

2. What procedural safeguard does Descript's "Overdub" feature require before activating a voice clone?

Correct. Descript requires users to record a consent script in their own voice before Overdub can be activated — creating a procedural layer that links the clone definitively to its owner's consent.

Not quite. Descript's consent mechanism requires the user to record themselves reading a specific consent script — a simple but meaningful procedural safeguard.

3. Spotify's September 2023 AI voice translation pilot preserved the original host's vocal character across which languages?

Correct. Spotify's pilot used ElevenLabs technology to dub episodes by Lex Fridman, Dax Shepard, and Steven Bartlett into Spanish, French, and German — preserving each host's vocal style.

Not quite. Spotify's 2023 pilot translated episodes into Spanish, French, and German, using ElevenLabs to maintain the original host's vocal character in each dubbed language.

4. What did Adobe Podcast's "Enhance Speech" feature primarily offer creators?

Correct. Enhance Speech is an audio clean-up tool — it makes consumer-microphone recordings sound broadcast-quality by removing noise and reverb, without any voice cloning.

Not quite. Adobe Podcast's Enhance Speech cleans up recordings by removing noise, room reverb, and microphone imperfections — it's an audio restoration tool, not a voice cloning tool.

5. What was the key ethical problem illustrated by the January 2024 Biden robocall incident?

Correct. The robocall demonstrated that mainstream consumer tools had made high-quality voice impersonation trivially accessible — creating a serious disinformation threat that the FCC moved to regulate.

Not quite. The incident showed that consumer-grade voice-cloning tools had become accessible enough that a political operative could produce a convincing presidential impersonation and distribute it as disinformation.

Lab 2: Ethical Voice AI Decisions

Work through real consent and disclosure scenarios for podcast and audio production

Your Mission

You host an independently produced true-crime podcast. Your lead narrator — who recorded 40 episodes with you — has moved abroad and is unavailable for re-recording. You're considering using an ElevenLabs voice clone to record corrections and new episodes. In this lab, explore the ethical and legal dimensions of that decision with the AI assistant, including consent requirements, disclosure obligations, and what SAG-AFTRA's framework says about independent creators.

Start by describing your situation. Ask the assistant to walk you through what ethical voice-cloning practice requires — then push on edge cases: what if you have a signed contract? What if your narrator agrees verbally but not in writing? What must you disclose to listeners?

Voice AI Ethics Advisor

Lab 2

Welcome to Lab 2. I'm your Voice AI Ethics Advisor. Tell me about your situation — who owns the voice, what rights you have, and what you're planning to do — and we'll work through the ethical, legal, and practical considerations together. No situation is too edge-case to explore.

Module 5 · Lesson 3

AI for Sound Design and Scoring

How AI tools are entering professional film, game, and interactive audio pipelines — and where they're replacing versus augmenting human composers.

When a film composer's assistant is an AI that can generate 500 cue variants overnight, what does the human bring to the table?

The Writers Guild of America's May–September 2023 strike produced landmark AI provisions that, while aimed at screenwriting, set a template the film music industry watched closely. Simultaneously, Hollywood composers' union Local 47 began negotiating AI provisions of its own, after reports emerged that at least two major streaming productions had used AI-generated score elements — synthesised from existing soundtracks — without composer knowledge or credit. The productions involved adaptive score tools that auto-generated ambient underscore for scenes the human composer had not been contracted to score.

The incidents illustrated a particular risk in AI scoring: the technology is most likely to displace work at the margins — the incidental, atmospheric cues that make up the bulk of a score's runtime but earn composers the majority of their backend royalties.

The Tools Reshaping Game and Film Audio

AIVA (Artificial Intelligence Virtual Artist), founded in Luxembourg in 2016, was among the first AI composers to receive performing rights society registration. By 2023 it had been used on over 300 commercial game soundtracks and was the preferred AI scoring tool at several mid-tier game studios seeking rapid iteration of ambient music without session musician costs. AIVA allows users to set key, time signature, tempo, and instrumentation, then generates multi-part MIDI arrangements that can be exported to any DAW.

Mubert operates differently — instead of composing new pieces, it uses a library of AI-generated stems tagged by mood and energy to assemble real-time adaptive music. Game developers integrate Mubert's API directly into their game engines, allowing the soundtrack to respond to player state without pre-composed trigger cues. Several indie games released in 2023–2024 used Mubert for their entire ambient layers.

Meta's AudioCraft (released open-source August 2023) included MusicGen — a model capable of generating short musical passages from text and reference audio. Meta's decision to open-source the model meant that within weeks, developers had fine-tuned versions for specific genres: film noir orchestral, chiptune, and cinematic trailer music, among others.

The Adaptive Score Disruption

Traditional game music is composed in discrete loops triggered by game state — a system composers have been paid to create since the 1980s. Adaptive AI scoring replaces the trigger-based loop architecture with real-time generation that responds continuously. This eliminates not just the composition fee but the ongoing royalties composers earn each time a loop is used in a shipped game.

Where Human Composers Remain Essential

Professional film composers interviewed by The Hollywood Reporter in mid-2023 consistently identified thematic development — the craft of writing a musical idea and then transforming it across a film's emotional arc — as the domain AI tools cannot yet replicate. A leitmotif that starts as a tender piano phrase in Act One and resurfaces as a full orchestral swell during the climax requires narrative understanding that present AI systems lack.

Sound designers have found a different balance. AI tools like Soundraw and Boomy handle generative background texture efficiently, freeing human sound designers to focus on the distinctive, character-defining sounds that make a film or game's audio world feel original — creature vocalisations, unique environmental reverbs, weapon sounds that reflect character personality.

The practical reality for working composers in 2024 is a two-tier market: AI handles ambient and functional music at scale; human composers are contracted for thematic content, emotional peak moments, and anything requiring narrative coherence across more than a few minutes.

AIVA and Copyright

Music generated by AIVA on paid plans is assigned to the user as the copyright owner — a model AIVA achieved by ensuring the user, not the AI, holds the composition credit. This is legally distinct from outputs generated by Suno or Udio, where copyright ownership remains contested pending the RIAA litigation outcomes.

Key Terms

Adaptive ScoreA film or game soundtrack that responds in real time to narrative or player state, rather than using pre-composed discrete loops.

LeitmotifA recurring musical theme associated with a character, place, or idea that develops and transforms across a narrative.

MIDIMusical Instrument Digital Interface — a protocol representing musical notes and performance data as digital instructions, not recorded audio.

MusicGenMeta's open-source audio generation model released in 2023, capable of producing music from text prompts and reference audio.

Lesson 3 Quiz

AI for Sound Design and Scoring — five questions

1. What type of scoring work did reports indicate AI tools were most likely to displace from human composers in 2023?

Correct. The specific concern was that AI would erode the ambient and incidental cues — not the prestige thematic material — which paradoxically generate the majority of a composer's backend streaming royalties.

Not quite. Reports highlighted incidental and atmospheric underscore — the ambient cues that fill most of a score's runtime — as the category most vulnerable to AI replacement, since these are functional rather than thematic.

2. How does Mubert's approach to game music differ from AIVA's?

Correct. Mubert's API assembles music from pre-generated stems tagged by mood — a real-time recombination approach. AIVA generates new multi-part compositions via MIDI based on user parameters.

Not quite. Mubert uses a library of AI-generated stems assembled in real time based on player state. AIVA composes original MIDI arrangements from parameters like key, tempo, and instrumentation.

3. Meta's AudioCraft / MusicGen was released in what form in August 2023, and why did this matter?

Correct. Open-sourcing MusicGen meant that within weeks, developers worldwide were fine-tuning it for specific genres — dramatically accelerating the proliferation of specialised music AI models.

Not quite. Meta released AudioCraft open-source, which meant developers could immediately fine-tune versions for film noir orchestral, chiptune, cinematic trailer music, and other specific genre needs.

4. According to Hollywood composers quoted in The Hollywood Reporter in 2023, which creative capability remains beyond current AI scoring tools?

Correct. Composers identified thematic development — a leitmotif that begins as a tender piano phrase and returns as a full orchestral swell — as requiring narrative understanding AI currently lacks.

Not quite. The capability AI lacks is thematic development — the craft of writing a musical idea and transforming it across a film's entire emotional arc in a way that reflects narrative understanding.

5. On AIVA's paid plans, who holds the copyright to generated music — and why is this legally significant?

Correct. AIVA's paid-tier copyright model assigns ownership to the user by structuring human input as compositional authorship — a legally significant contrast to Suno and Udio, where copyright remains contested.

Not quite. AIVA on paid plans assigns copyright to the user — it achieves this by ensuring the human is credited as composer, not the AI. This makes AIVA-generated music more commercially defensible than outputs from Suno or Udio.

Lab 3: AI Score Strategy for a Game

Design an adaptive audio strategy for an indie game using available AI tools

Your Mission

You're the solo developer of an indie horror-adventure game. Your total music budget is $500. You need: an atmospheric ambient layer that responds to player tension, a distinctive main theme, and five short event stings (discovery, danger, safety, puzzle solved, death). In this lab, work with the AI assistant to design a realistic audio strategy that combines AI tools with your budget constraints — and think through the copyright implications of each tool choice.

Begin by describing your game's mood and world. Ask the assistant which tools (AIVA, Mubert, Suno, Udio, MusicGen) would suit which part of your need. Then explore: how do you handle the adaptive layer technically? What do you still need a human musician for?

Game Audio Strategy Advisor

Lab 3

Welcome to Lab 3. I'm your Game Audio Strategy Advisor. Tell me about your game — genre, mood, setting, player experience — and we'll map out a realistic AI-assisted audio pipeline that fits your budget. I'll help you decide which tools to use for which elements and flag any copyright considerations you need to plan around.

Module 5 · Lesson 4

Rights, Royalties, and the Future of Audio AI

Navigating copyright, licensing, and the evolving business models shaping who gets paid when AI makes music.

When AI generates a hit song, who owns it — and who should?

At a 2023 industry summit documented by Music Week, Radiohead's Thom Yorke drew an explicit comparison between generative AI music tools and the Napster era: "The argument is exactly the same — it's someone else taking your life's work, packaging it, and offering it for free or almost free." The comparison was apt in structure if not in detail. Napster distributed existing recordings without payment; AI tools consume existing recordings as training data — also without payment — and produce new outputs that compete directly with the recordings they were trained on.

The distinction matters legally. Napster's liability was clear: distribution of copyrighted files. AI training-data liability is contested: the industry must establish whether training constitutes infringement, whether outputs are "substantially similar" to training data, and whether fair use applies at the scale of billions of audio samples. Those questions were actively before federal courts as of mid-2025.

The Copyright Landscape in 2024–2025

The U.S. Copyright Office issued a February 2023 guidance stating that works generated entirely by AI — without sufficient human authorship — are not eligible for copyright protection. This creates a foundational tension: if AI-generated music cannot be copyrighted, creators who use it lose the ability to exclusively control or license their output.

However, the Copyright Office also acknowledged that human-AI collaborative works may be protectable to the extent of human authorship. A musician who writes all lyrics, directs the AI to generate an instrumental scaffold, then edits, arranges, and masters the result has contributed sufficient human authorship to likely secure copyright — though the exact threshold remains legally untested.

The 2023 AI Act in the European Union introduced transparency requirements that directly affect audio AI: high-risk AI systems — including those capable of producing content that could be mistaken for human-created — must disclose AI involvement. This will affect how AI-generated music is labelled on streaming platforms for EU audiences.

The Royalty Stack Problem

Traditional music royalties flow through two channels: the master recording (owned by labels or artists) and the composition (owned by songwriters and publishers). When AI generates both the composition and the "recording" simultaneously, neither channel has a clear rights-holder. Streaming platforms in 2024 began requiring AI-generated track disclosures, but royalty routing for such tracks remained unresolved — meaning some AI-generated tracks were earning streaming revenue with no established mechanism for distributing it.

Emerging Business Models

Universal Music Group's "artist-centric" model, announced October 2023, proposed that streaming platforms prioritise royalty payments for tracks with demonstrated human engagement metrics — a de facto disadvantage for AI-generated catalogue. Several major platforms, including Deezer, adopted variants of this model in 2024.

SoundCloud's AI music policy, updated February 2024, was among the first to explicitly distinguish between human-created, AI-assisted, and AI-generated tracks — and to route royalties differently for each category. AI-generated tracks without a discernible human author receive no performance royalties under the updated policy.

Licensing AI training data has emerged as a new revenue stream for rights-holders. In May 2024, Universal signed what was reported as a licensing agreement with an unnamed AI company to provide access to its catalogue for training purposes — a model some analysts predicted would become standard, creating a "music training data market" analogous to stock photography.

Practical Guidance for Creators

In 2024–2025, the safest approach for commercial audio work: (1) use AI for structural scaffolding, not final output; (2) document every human creative decision you make on top of AI output; (3) disclose AI involvement to any platform, client, or publisher that asks — most now have explicit policies; (4) avoid prompting AI to reproduce the style of a specific named artist, as this is the most legally exposed territory; and (5) check the specific copyright terms of whichever tool you use — AIVA, Mubert, and Soundraw handle ownership differently from Suno and Udio.

Key Terms

Master RecordingThe specific fixed audio recording of a piece of music — owned separately from the underlying composition, typically by the label or the recording artist.

Composition CopyrightThe copyright in the melody and lyrics of a musical work, owned by the songwriter or publisher, independent of any specific recording.

Fair UseA U.S. copyright doctrine permitting limited use of copyrighted material without permission, assessed case-by-case based on purpose, nature, amount used, and market impact.

Artist-Centric ModelA streaming royalty framework that prioritises tracks with demonstrated human listener engagement over raw stream counts, disadvantaging bot-driven or AI-generated content.

Lesson 4 Quiz

Rights, Royalties, and the Future of Audio AI — five questions

1. What did the U.S. Copyright Office's February 2023 guidance establish about fully AI-generated works?

Correct. The Copyright Office's 2023 guidance stated that AI-generated works lacking sufficient human authorship cannot receive copyright protection — a foundational issue for any creator seeking to commercialise AI music.

Not quite. The Copyright Office ruled that works produced entirely by AI — without sufficient human creative input — are ineligible for copyright. This means pure AI-generated music can be freely copied by anyone.

2. What analogy did Thom Yorke draw between Napster and generative AI music tools at a 2023 industry summit?

Correct. Yorke's analogy was structural: Napster took recordings and distributed them without payment; AI tools consume recordings as training data without payment and produce competing outputs — the harm mechanism differs but the economic injury is parallel.

Not quite. Yorke's comparison: both Napster and AI tools take artists' work without payment and offer something competitive for free or cheap. Napster distributed; AI ingests as training data — different mechanism, similar injury.

3. What did SoundCloud's February 2024 policy update establish for AI-generated tracks?

Correct. SoundCloud distinguished between human-created, AI-assisted, and AI-generated tracks — and specified that fully AI-generated tracks with no identifiable human author earn no performance royalties under its updated policy.

Not quite. SoundCloud's 2024 policy stopped performance royalties for AI-generated tracks lacking a human author — one of the first platforms to operationalise the human-authorship distinction in royalty routing.

4. What was the "artist-centric model" announced by Universal Music Group in October 2023?

Correct. The artist-centric model rewards tracks that humans actually choose to listen to, rather than raw stream counts — which can be inflated by bot activity or bulk AI-catalogue uploads.

Not quite. The artist-centric model is a royalty framework that prioritises engagement metrics from real human listeners, effectively disadvantaging AI-generated tracks that accumulate streams without genuine audience interest.

5. According to the lesson's practical guidance, which prompting behaviour is described as the most legally exposed territory?

Correct. Prompting an AI to generate music "in the style of [specific named artist]" is the highest-risk behaviour because it most directly implicates the artist's identifiable stylistic elements and creates the clearest path to an infringement claim.

Not quite. The guidance identifies prompting AI to reproduce a specific named artist's style as the most legally exposed territory — it's the prompt type most likely to produce output that infringes on the artist's protectable stylistic identity.

Lab 4: Navigating Audio AI Rights

Work through real rights and royalty scenarios for AI-generated audio in commercial contexts

Your Mission

You run a small YouTube channel (95,000 subscribers) that publishes weekly video essays. A brand has approached you for a sponsored video and wants original background music that "sounds like Hans Zimmer's Interstellar score" — specifically because they love the emotional tone. You're considering using Suno to generate it. In this lab, work through the legal, ethical, and practical dimensions: what's the risk of that specific request? What alternatives exist? How should you disclose AI involvement to the brand and to YouTube?

Start by describing the brief you've received. Ask the assistant to help you evaluate the legal risk of the "sounds like Hans Zimmer" request, then explore alternatives that achieve the emotional brief without the legal exposure. Finally, draft a disclosure statement you could send to the sponsoring brand.

Audio Rights Advisor

Lab 4

Welcome to Lab 4. I'm your Audio Rights Advisor. Tell me about the sponsorship brief and the music request you've received. We'll assess the legal risk, find safer alternatives that still hit the emotional target, and work out what you need to disclose — to the brand, to YouTube, and to your audience. Let's start with the brief.

Module 5 Test

Music, Podcasts, and Audio With AI — 15 questions · 80% to pass

1. Which AI music platform was the subject of a June 2024 RIAA lawsuit alleging unlicensed use of copyrighted recordings in its training data?

Correct. The RIAA sued both Suno and Udio in June 2024 on behalf of Universal, Sony, and Warner, alleging their models were trained on copyrighted recordings without licence.

The RIAA sued Suno (and Udio) in June 2024, alleging their models ingested copyrighted recordings without licence. AIVA and Mubert were not part of those suits.

2. What is the primary legal question distinguishing the AI music training-data debate from the Napster case?

Correct. Unlike Napster (which distributed files), AI training-data liability hinges on whether ingestion for training constitutes infringement and whether fair use doctrine applies at massive scale.

The key legal distinction is whether using copyrighted audio as training data constitutes infringement — and whether fair use applies at the scale these models require.

3. The "Heart on My Sleeve" incident featured an AI track that convincingly mimicked which two artists?

Correct. "Heart on My Sleeve" by Ghostwriter977 convincingly mimicked Drake and The Weeknd, triggering DMCA takedowns by Universal Music Group and a congressional inquiry.

"Heart on My Sleeve" mimicked Drake and The Weeknd — convincingly enough to accumulate millions of streams before Universal issued DMCA takedowns.

4. What does Descript's "Overdub" feature primarily enable podcast creators to do?

Correct. Overdub lets podcasters type corrections into the transcript, which the tool then voices using their consent-gated voice clone — eliminating the need to re-record verbal mistakes.

Overdub lets creators fix spoken errors by typing new text — the tool voices the correction in the creator's own cloned voice. Consent must first be given by reading a specific script.

5. Spotify's September 2023 AI voice translation pilot used which company's technology?

Correct. Spotify used ElevenLabs to dub podcast episodes by Lex Fridman, Dax Shepard, and Steven Bartlett into Spanish, French, and German — preserving each host's vocal character.

Spotify's translation pilot used ElevenLabs technology to preserve the host's vocal character while dubbing episodes into Spanish, French, and German.

6. What is AIVA's approach to copyright ownership for music generated on its paid tier?

Correct. AIVA's paid-tier model assigns copyright to the user by framing the human as the composer of record — a legally significant arrangement that makes AIVA outputs more commercially defensible.

On AIVA's paid plans, copyright goes to the user — achieved by structuring the human's input as authorship, not the AI's. This contrasts with the contested ownership status of Suno and Udio outputs.

7. What feature of Udio distinguishes it from Suno for professional music production workflows?

Correct. Udio's stem export capability — allowing producers to take isolated tracks into a DAW — gives it a practical professional workflow advantage over Suno's single mixed-output approach.

Udio's distinguishing feature is stem export — the ability to receive isolated vocal and instrumental tracks for independent mixing in a DAW, which Suno does not offer in the same way.

8. How does Mubert's adaptive music system work in game development?

Correct. Mubert's game-engine API assembles music in real time from a library of AI-generated, mood-tagged stems — responding continuously to player state without pre-composed discrete loops.

Mubert uses a library of AI-generated stems tagged by mood and energy. Its API integrates with game engines to assemble these stems in real time as player state changes — no pre-composed loops required.

9. What did the January 2024 Biden robocall incident lead the FCC to propose?

Correct. The FCC responded to the Biden robocall by proposing rules that would require disclosure when AI-generated audio is used in political advertising — a direct regulatory response to the incident.

The FCC's response to the Biden robocall was to propose disclosure requirements for AI-generated audio in political advertising — not a ban, but a transparency mandate.

10. What was Meta's AudioCraft / MusicGen's most significant impact after its August 2023 release?

Correct. Because MusicGen was open-sourced, developers had fine-tuned genre-specific versions — film noir orchestral, chiptune, cinematic trailer — within weeks of release, dramatically accelerating specialised audio AI.

The open-source release was the key — it enabled a worldwide developer community to fine-tune MusicGen for specific genres almost immediately, proliferating specialised models far faster than a commercial release would have.

11. According to the U.S. Copyright Office's 2023 guidance, under what condition might a human-AI collaborative work be copyright-protectable?

Correct. The Copyright Office acknowledged that human-AI collaborative works may be protectable to the extent of human creative authorship — the human must contribute meaningful creative decisions, not merely prompt an AI.

The Copyright Office's guidance says human-AI works may be protected to the extent they reflect human authorship — the human's creative decisions layered on top of AI output must be meaningful and identifiable.

12. What did SAG-AFTRA's 2023 AI agreement specifically address regarding voice cloning?

Correct. The SAG-AFTRA agreement required explicit consent, defined scope of use, and negotiated compensation for any AI voice cloning of union members — the first such provisions in a major collective bargaining agreement.

SAG-AFTRA's 2023 AI provisions required consent, scope definition, and compensation terms for voice cloning — it did not ban the technology but placed it under a consent and payment framework.

13. What does Universal Music Group's "artist-centric model" prioritise in streaming royalty distribution?

Correct. The artist-centric model rewards genuine human engagement rather than raw stream counts — creating a structural disadvantage for AI-generated content that accumulates streams without real listener interest.

The artist-centric model prioritises human engagement metrics — it's designed to route royalties toward tracks people genuinely choose to listen to, rather than AI-generated catalogue that accumulates streams without authentic audience interest.

14. What legitimate commercial use have studios like Hypnosis Music (London) found for AI music generation tools in professional workflows?

Correct. Professional studios use AI music generation as a communication tool — generating a demo that conveys the brief to session musicians who then record the final version. This "AI as translator" workflow avoids copyright risk.

Studios like Hypnosis Music use AI to generate reference demos that translate a creative brief into audible form — the sessions musicians then record the real version. AI is translator, not final deliverable.

15. What new revenue stream for music rights-holders emerged from AI companies' need for training data in 2024?

Correct. Licensing music catalogues for AI training emerged as a new revenue stream in 2024 — Universal reportedly signed a deal in May 2024, and analysts predicted a "music training data market" analogous to stock photography would develop.

The emerging model is catalogue licensing for training data — Universal reportedly signed such a deal in May 2024, creating a potential ongoing revenue stream analogous to how stock photography libraries license images for AI training.