In 1877, Thomas Edison pressed a stylus into tin foil and recorded himself reciting "Mary Had a Little Lamb." Within two decades, the phonograph had fractured the entire economics of live music: orchestras that once commanded concert halls found themselves competing with their own recorded performances, sold for cents at department stores. When the Palace Theatre in London installed electric arc lighting in 1881, stage managers discovered for the first time that they could control mood with precision β and overnight the gaslighter's craft became obsolete. The introduction of synchronized sound in film, beginning commercially with The Jazz Singer in October 1927, eliminated roughly 22,000 theatre-pit musician jobs in the United States within three years.
The pattern is unmistakable: a new technology arrives, practitioners insist it cannot replace the irreducibly human thing they do, and then the technology reshapes not just who gets paid but what "performance" itself means. Today, AI voice synthesis can clone a living actor's voice from minutes of audio. Choreography software trained on motion-capture libraries generates novel movement sequences. In 2023, the writers' and actors' strikes in Hollywood placed AI restrictions at the center of contract negotiations for the first time β not as a hypothetical concern but as an immediate demand by studios already running experiments.
This course is not a forecast of doom nor a celebration of disruption. It is a map of what is actually happening β the real tools, the real deployments, the real labor disputes, and the real creative possibilities β so that performers, directors, designers, and producers can make informed decisions rather than reactive ones. You will finish with a working understanding of where AI currently sits in casting, composition, choreography, and live production, and with a clearer sense of what questions to ask when the technology lands in your own rehearsal room.
If you finish every module, here's who you become:
On the opening night of To Be With Hamlet at the Tribeca Film Festival in April 2017, audiences donned virtual reality headsets and stood β literally stood β inside a performance space shared with a live actor. The AI-driven spatial audio system tracked each viewer's head position in real time, rendering a version of the soliloquy that changed based on where you were standing relative to the performer. The director, Gabo Arora, described it not as a gimmick but as an attempt to collapse the boundary between watcher and watched. The technology was invisible to most people in the room. That invisibility, it turns out, is the defining condition of AI in performing arts today.
That same year, theatre researchers at Georgia Tech demonstrated Shimon, a marimba-playing robot that improvised in real time with human musicians using a neural network trained on jazz and classical corpora. The system didn't just play along β it listened, anticipated phrase endings, and chose responses that felt stylistically coherent. Critics who attended the demonstration used the word "musicianly." The word choice was not accidental.
The phrase "AI in performing arts" covers a sprawling range of applications that have almost nothing in common technically. It is useful to distinguish four broad categories before going further.
Production AI refers to tools that assist with design, scheduling, and logistics. ETC's Eos lighting consoles have incorporated machine-learning cue suggestions since roughly 2019. Casting platforms like Spotlight and Casting Networks use algorithmic ranking systems to surface candidates. Stage management software now predicts rehearsal overruns from historical data.
Creative AI refers to systems that generate or co-generate artistic content β scripts, scores, choreography notation, set designs. GPT-class language models were used by several experimental theatre companies in 2022 and 2023 to generate dialogue that human writers then edited. The UK-based company Punchdrunk explored generative narrative branching in immersive productions.
Performance AI refers to AI systems that are themselves present during a live event β responsive lighting, adaptive soundscapes, real-time translation overlays, or interactive digital characters. These systems must operate with latency constraints under 30 milliseconds to feel live rather than lagged.
Labour-displacing AI refers specifically to applications that substitute for human performers or creative workers. Voice synthesis capable of reproducing a named actor's voice is the most contested current example. Background-replacement AI, which generated controversy during the 2023 SAG-AFTRA negotiations, allows studios to capture a performer's likeness once and reuse it indefinitely without additional compensation.
The Writers Guild of America went on strike on May 2, 2023. Among their demands: a prohibition on using AI to generate scripts that would then be "written" by a credited human writer at a lower rate, and a requirement that studios disclose when AI-generated material is given to writers for editing. The strike lasted 148 days. The eventual agreement, ratified in October 2023, prohibited AI from being credited as a writer and established that AI-generated material does not constitute "source material" under guild rules β meaning it cannot reduce a writer's credit or payment.
SAG-AFTRA struck beginning July 14, 2023, explicitly over AI provisions. The union's specific concerns included digital replicas β AI-generated likenesses of performers β and synthetic performers β entirely AI-generated characters that could replace background artists. The agreement reached in November 2023 required informed consent and "meaningful" additional compensation for digital replica use, and established that background performers could not have their likenesses scanned without knowing what the scan would be used for.
These are not abstract labor-law details. They represent the first time that AI use in the performing arts was governed by enforceable contract language β a genuine historical inflection point equivalent, in scope if not in drama, to the invention of the screen actors' guild itself in 1933.
The 2023 strike agreements established legal precedent for AI consent in performance. Every contract negotiated after them β in theatre, opera, dance, and broadcast β is now written in their shadow.
AI in performing arts sits at the intersection of two genuinely difficult problems: the creative problem (can a machine generate something worth experiencing?) and the labour problem (who owns the output, and who gets paid?). This course addresses both β but they are not the same question, and conflating them produces bad analysis.
You've learned the four-part taxonomy: Production AI, Creative AI, Performance AI, and Labour-displacing AI. Now apply it. Think of a specific performing arts context you work in or care about β a theatre company, a dance ensemble, a concert venue, a film set, an opera house β and discuss with the AI assistant how different types of AI might already be present, or might arrive soon.
The assistant will help you think through specific use cases, potential benefits, and genuine risks for your context. Push back if something doesn't seem right. Ask follow-up questions.
In September 2023, the estate of the late actor Burt Reynolds confirmed that it had received inquiries from production companies wishing to use AI voice synthesis to recreate Reynolds's voice for projects he never participated in. Reynolds had died in September 2018. His estate declined. But the inquiry revealed something significant: the technical capability to reconstruct a recognizable voice now preceded any legal framework governing whether doing so required permission from the estate, the original studio, or no one at all.
That same year, voice actor Flula Borg publicly raised concerns about a different scenario: living performers discovering their voices had been used, without consent, as training data for commercial synthesis tools. The platform ElevenLabs, which launched publicly in January 2023, could produce voice clones of startling quality from as little as one minute of sample audio. By mid-2023, a cloned version of President Biden's voice had been used in robocall disinformation. The performing arts and political contexts were suddenly in the same legal and ethical space.
Modern AI voice synthesis systems β including ElevenLabs, Microsoft's VALL-E (demonstrated January 2023), and tools built into Adobe Premiere β operate through a process called neural voice cloning. They analyze a short sample of a target voice to extract what researchers call a "speaker embedding" β essentially a mathematical fingerprint of that person's vocal characteristics: pitch range, resonance, rhythm, breath patterns, and accent markers.
Once a speaker embedding exists, the system can generate entirely new speech in that voice from any text input. VALL-E demonstrated that three seconds of audio was sufficient to produce a recognizable clone. The quality improves with longer samples, but the threshold for "recognizable" is now dramatically lower than it was even in 2021, when comparable systems required hours of studio-quality recordings.
The implications for performing arts are direct. A voice actor who has recorded hundreds of hours for audiobooks, video games, or animation has, inadvertently, created a vast training corpus for anyone who can access those recordings. That the recordings exist for an entirely different purpose provides no legal protection under current US law unless a specific contract clause addresses it.
In February 2023, voice actor Tim Friedlander, president of the National Association of Voice Actors (NAVA), testified before a US House Judiciary subcommittee that AI voice cloning represented an existential threat to the profession β not because it was futuristic, but because it was already happening in commercial work.
As of early 2024, there is no federal US law specifically governing AI voice or likeness cloning. Protection comes from a patchwork of existing doctrines, all of which have significant gaps when applied to AI.
Right of Publicity laws, which exist in roughly 35 US states, protect individuals from commercial use of their name, image, or likeness without consent. California's statute is among the strongest. However, most right-of-publicity law was written for photograph and video contexts and has not yet been consistently applied to audio-only cloning or to training data use.
The NO FAKES Act, proposed in the US Senate in October 2023, would create a federal right against AI-generated digital replicas of voice or likeness without consent, with protections extending 70 years after death. As of early 2024, it has not been enacted.
Tennessee's ELVIS Act (Ensuring Likeness Voice and Image Security), signed into law March 2024, became the first state law in the US specifically to prohibit AI voice cloning without consent β named, pointedly, for Elvis Presley's estate, which is headquartered in Nashville.
In the UK, voice and likeness are addressed partly through passing off law (using someone's voice to imply their endorsement of something) and partly through data protection regulations. The UK's Intellectual Property Office published a consultation on AI and copyright in 2022 but had not finalized guidance by early 2024.
For working performers, the practical implications are immediate. Existing contracts β particularly for voiceover, animation, and audiobook work β were not written with AI synthesis in mind. Most contain no clause restricting the use of recorded material as AI training data. Until contracts are renegotiated or new legislation passes, performers bear the risk.
The SAG-AFTRA 2023 agreement addressed digital replicas for its covered productions, but it does not govern non-union work, independent productions, or uses of material recorded before the agreement. NAVA has published model contract language specifically addressing AI use, which freelance voice actors can request be included in new contracts.
The creative dimension is also genuinely complex. Some performers β including Anthony Hopkins's team and the estate of Judy Garland β have licensed voice and likeness for specific AI uses. The question is not whether AI voice synthesis is ever acceptable, but whether the performer's informed consent and fair compensation are present.
The consistent demand from performing arts unions, legal scholars, and ethicists is the same: informed consent before use, and meaningful compensation proportionate to the commercial value generated. The technology is not inherently unethical. The deployment of it without consent is.
The legal landscape around AI voice synthesis is moving fast, but contracts can address gaps in legislation right now. In this lab, you'll work with the AI assistant to draft a short contract clause β or evaluate an existing one β governing AI voice use.
You might bring a real contract scenario you've encountered, ask the assistant to explain what a strong clause should include, or test a draft clause you've written against the principles from the lesson. The assistant will challenge vague language and ask what specific protections you're trying to create.
In November 2022, a short play called AI: When a Robot Writes a Play premiered at the McCarter Theatre in Princeton, New Jersey. The text had been generated by GPT-3 and then edited β lightly β by a human dramaturg, Shayok Misha Chowdhury. The production was advertised explicitly as AI-authored. The reviews were mixed not on artistic grounds but on conceptual ones: was this a play, or a demonstration? Was the human editor's work enough to constitute authorship? The production raised questions that the theatre community had been avoiding for years.
Three months earlier, in August 2022, the US Copyright Office had issued a significant decision: AI-generated images in the graphic novel Zarya of the Dawn by Kristina Kashtanova could not be copyrighted because they lacked human authorship. The Copyright Office subsequently issued guidance confirming that copyright protection requires human creative expression. For performing arts, this created an immediate practical problem: if a generative AI produces a substantial portion of a musical score, that portion may be uncopyrightable β and therefore freely reproducible by anyone.
By 2023, the use of large language models in script development had moved from experiment to common practice in certain sectors. Television writers' rooms, under pressure from shortened seasons and faster turnaround demands, were using tools like ChatGPT to generate outline drafts, test dialogue variations, and identify structural weaknesses in pilot scripts. The WGA's position β reflected in the 2023 agreement β was not that AI should be banned from this process, but that AI-generated material should not be passed off as a writer's original work, and that it should not reduce the human writer's credit or fee.
The distinction the WGA drew is conceptually important: AI as a tool (generating options a human writer selects from) versus AI as a substitute (producing a draft that a human merely edits at a lower pay rate). The former is analogous to a thesaurus or a dramaturgical consultant. The latter is analogous to replacing the writer.
In practice, the line between these is blurry. A writer who prompts an AI model extensively, curates the output, arranges it into scenes, and writes connective tissue may have produced something genuinely original β or may have produced a lightly edited AI document. The craft lies in what the writer contributes that the AI couldn't; the labor question lies in whether that contribution is recognized and compensated.
AI music composition tools have been available in various forms since at least 2016, when Jukedeck launched as the first commercially available AI music generator. Google's Magenta project, also begun in 2016, produced a series of research models demonstrating AI melody generation, harmonization, and style transfer. By 2023, tools like Suno AI and Udio were producing complete songs β vocals, instrumentation, lyrics β from text prompts, at a quality level that crossed the threshold of casual listenability.
For the performing arts specifically, AI composition has found its most interesting applications in three areas. First, in interactive performance, where a score needs to adapt in real time to what performers are doing β responding to tempo, dynamics, and improvisational choices. Second, in film and television scoring, where a composer may use AI to generate thematic sketches quickly across a large volume of scenes. Third, in immersive and installation work, where continuous generative music removes the concept of a "fixed" score entirely.
The copyright question is genuinely unresolved. ASCAP and BMI, the major US performing rights organizations, began reviewing their membership policies in 2023 to address whether AI-generated works could be registered. A piece with substantial AI contribution and minimal human creative selection may not be registerable β meaning it earns no performance royalties.
In April 2023, a song called "Heart on My Sleeve" β using AI-cloned voices of Drake and The Weeknd β went viral on TikTok and Spotify before being pulled. Universal Music Group issued takedowns. The producer, identified only as ghostwriter977, argued the song was commentary. The incident accelerated discussions at major performing rights organizations about how to handle AI-generated music in their catalogs.
Choreography sits in a particularly interesting position relative to AI. Dance notation systems β Laban notation, Benesh notation β are detailed enough to be processed computationally, but the corpus of digitized dance notation is small compared to text or music corpora, which means large AI models trained on movement data are still relatively limited.
The more developed applications use motion capture. In 2022, the choreographer Wayne McGregor worked with Google Arts & Culture to develop "Living Archive," a tool trained on 25 years of motion-capture data from McGregor's own company. The tool could generate movement suggestions in McGregor's style β his vocabulary, essentially β that he then responded to, modified, or rejected in the creation of new work. McGregor described it explicitly as a conversation with his own past self.
The authorship question in choreography is compounded by the fact that choreographic copyright in the US only fully covers work that has been "fixed" β notated or recorded. Improvised movement, however artistically significant, may not be protected. AI tools trained on publicly available performance recordings may be building on unprotected material without legal consequence.
Current US copyright law requires human authorship. AI output, without significant human creative selection and arrangement, is not protectable. For performing arts practitioners, this means that work produced with substantial AI involvement may enter the public domain the moment it is created β with no royalty protections, no licensing control, and no ability to prevent others from copying it.
The copyright and credit questions raised in Lesson 3 don't have clean answers β they depend on the specifics of how AI was used, how much human selection was involved, and what contracts say. In this lab, you'll work through a concrete creative scenario and apply the principles from the lesson.
Describe a specific creative project β real or hypothetical β where you might use or have used AI. The assistant will ask you probing questions about your contribution, the AI's role, and what copyright and credit implications follow. There are no right answers; the goal is clearer thinking.
In 2019, Catie Cuan, a roboticist at Stanford, published research demonstrating that audiences watching a robot perform simple movement sequences attributed emotional intent to the robot based solely on its motion dynamics β how it accelerated, paused, and changed direction. The robot had no face, no voice, and no representational appearance. And yet audiences consistently reported that it seemed "nervous" or "confident" or "sad." This is not evidence that robots can perform emotion; it is evidence that human audiences are extraordinarily skilled at projecting emotion onto motion. The distinction matters enormously when evaluating AI in live performance.
The performing arts have always rested on a paradox: the audience knows the performer is not really dying, not really in love, not really afraid β and yet they feel it. That paradox β sometimes called the "Stanislavski problem" β requires the performer to genuinely access internal states that then communicate through unpredictable channels of voice, micro-expression, breath, and presence. Current AI systems do not have internal states. They have outputs that can resemble those channels. The difference is not philosophical hairsplitting; it is an engineering reality that shapes what AI can and cannot do in live performance today.
Genuine Spontaneity. Improvisation in live performance is not random variation; it is responsive meaning-making in real time between performers and audience. AI systems can generate varied outputs, but they do not perceive a room, cannot sense the particular tension of a specific audience on a specific night, and cannot make the kind of risk-laden choice that gives improvisation its stakes. The Upright Citizens Brigade Theatre's training methods, for instance, rest on performers listening and responding to what is actually happening β not to a statistical average of what tends to happen. AI operates on statistical averages.
Embodied Presence. Philosophers of theatre from Jerzy Grotowski to Anne Bogart have argued that live performance is constituted by the shared physical presence of performer and audience β what Grotowski called the "holy actor" who risks something real. AI has no body to risk. Audience responses to live performance involve measurable physiological changes β heart rate synchronization between audience members, elevated cortisol levels during climactic moments β that appear linked to the biological reality of a human performer in the space. These effects are not fully replicated by screen mediation or robotic avatars under current conditions.
Contextual Repair. Experienced performers handle unexpected disruptions β a set piece that doesn't move, a missed cue, an audience member who speaks β by incorporating them. This is not just skill; it requires a real-time model of what the audience understands and what the story requires at that moment. Current AI performance systems operate on fixed response trees or probabilistic outputs that cannot genuinely adapt to a novel disruption of this kind.
Ethical Accountability. When a performer makes an artistic choice β a controversial staging, an interpretation of a character that challenges an audience β they can be held accountable, can explain themselves, and can revise their work in response to critique. An AI system that generates a staging choice has no capacity for that dialogue. The question of who is accountable for AI-generated artistic decisions (the programmer? the producer? the prompter?) is unresolved.
The limitations above are not arguments against using AI; they are arguments for using it in the right places. AI's genuine strengths in performing arts align precisely with where human limitations tend to bind.
Scale and variation. A human choreographer cannot generate 500 variations on a movement phrase in an afternoon. An AI system trained on motion data can. The choreographer's role becomes curatorial β selecting, refining, rejecting β which may itself be a creative act of high value.
Consistency across repetition. A recorded film score plays exactly the same way at the 500th screening as at the first. Generative AI can produce a score that is subtly different every time while remaining within a specified stylistic range β giving each audience a nominally unique experience without requiring live musicians for each showing.
Accessibility of production tools. Composing software that required a $10,000 hardware synthesizer in 1985 now runs on a laptop. AI music tools are moving the threshold further: a composer who cannot afford session musicians can approximate an orchestral texture for a fringe production. This democratization is real and has already changed the economics of small-scale performing arts production.
A 2023 study by researchers at the Royal Academy of Dramatic Art (RADA) in London found that trained actors could reliably distinguish AI-generated performance coaching from human coaching β specifically, by identifying the absence of responsive specificity: AI coaching gave accurate general advice but could not identify the specific physical mannerism that was undermining a scene. Human coaches could.
The word "yet" in the lesson title deserves honest examination. Some of the limitations described above β scale, consistency, accessibility β have already fallen to AI capability. Others β embodied presence, ethical accountability β may be permanently beyond AI's reach because they depend on properties of human consciousness that AI systems do not share and may never share.
The middle category is genuinely uncertain: contextual repair, genuine spontaneity, and the specific quality of performer-audience biological synchrony. Whether these yield to more sophisticated AI systems is not a question anyone can answer with confidence in 2024. The honest position is that the track record of "this will never be automated" predictions in creative fields is poor. Photography was going to end painting. Recorded music was going to end live concerts. Neither happened β but both transformed the field permanently.
What performing arts practitioners need is not a prediction but a practice: continuously reassessing which parts of their work AI can now do, which it cannot, and what the distinction means for how they train, how they contract, and what they choose to put in front of an audience.
AI has arrived in the performing arts through four simultaneous channels: production tools, creative generation, live performance systems, and labour displacement. The legal and contractual frameworks are still catching up. What AI cannot yet do β genuine spontaneity, embodied presence, contextual repair, ethical accountability β defines the space where human performers remain irreplaceable. How long that space holds is not a given. Knowing precisely what fills it is the beginning of a strategic response.
The module ends with a question about practice: what do you actually do with this knowledge? In this final lab, you'll conduct a personal "AI audit" for your performing arts work β identifying where AI capability currently sits relative to what you do, and starting to build a strategic response.
The assistant will ask you to be specific about your role, your work, and your concerns. It will then help you identify where AI is currently a useful tool, where it poses a genuine risk to your practice or livelihood, and where it simply isn't relevant yet. The goal is a clearer picture β not reassurance and not alarm.