Story generation at scale — mechanics, limitations, and the narrative uncanny valley.
When OpenAI released GPT-3 in 2020, researchers and writers immediately began experimenting with long-form narrative generation. The results were striking in their local coherence and unsettling in their long-range incoherence. The model could write beautiful prose for paragraphs, then contradict established facts, forget character names, or introduce inexplicable plot elements. Researchers called this the "narrative coherence cliff" — the point where local statistical fluency could no longer mask the absence of a governing intent.
Language models generate narrative by predicting token sequences over a finite context window. Three structural properties shape their narrative output:
Readers often report a sense of unease with AI-generated narrative — something that sounds right but feels wrong. This may reflect the model's lack of a governing narrative intent: the text is pursuing statistical likelihood, not meaning. Plot events occur because they are likely given prior text, not because they serve a larger purpose.
The same property that makes AI narrative convincing at the sentence level — statistical fluency — makes it hollow at the structural level. Good narrative requires intention, not just probability.
5 questions — free, untracked, retake anytime.
causes the 'narrative coherence cliff' in AI-generated stories?
is 'recency bias' in language model narrative generation?
does 'distributional averaging' mean for AI narrative voice?
do readers sometimes feel the 'narrative uncanny valley' with AI-generated text?
maintain long-range narrative coherence in AI-generated stories, writers need:
Develop scaffolding strategies for AI-assisted long-form narrative.
Analyze the structural limitations of AI narrative and develop strategies for overcoming them.
Co-authorship theory, emergent narrative, and the problem of creative credit.
Researcher Ross Goodwin drove from New York to New Orleans in 2017 with a camera, GPS, microphone, and clock all feeding real-time data into a neural network trained on road trip literature. The network generated a novel in real time as he drove. The result — "1 the Road" — was published as a book. The question of who wrote it has no clean answer: Goodwin designed the system, drove the route, curated the inputs. The network generated the text. The training data supplied the aesthetic vocabulary.
Some of the most interesting AI-collaborative narrative happens when the output is not fully controlled — where the system generates something the human didn't anticipate and couldn't have written. This "emergent narrative" is distinct from prompted generation: the AI is not executing a directive but producing unexpected material that the human then responds to, curates, or incorporates.
When is a work "AI-generated" vs. "AI-assisted" vs. "human-authored with AI tools"? The field has no consensus. Practical distinctions that have emerged:
The Goodwin case shows why: the human conceived, drove, curated, and published. The AI generated. The training data supplied. Credit is distributed and none of the traditional categories fully fit.
5 questions — free, untracked, retake anytime.
makes Ross Goodwin's '1 the Road' project an example of emergent narrative?
is curation considered a creative act in AI-collaborative work?
question best distinguishes 'AI-assisted' from 'AI-generated' work?
the Goodwin case, who or what contributed to authorship of '1 the Road'?
does 'responsive authorship' mean in AI-collaborative creative work?
Experience and analyze the authorship in AI-collaborative storytelling.
Experience and analyze emergent co-authorship.
Diffusion architecture, GANs, training data politics, and the commodification of style.
In 2023, artist Greg Rutkowski discovered that his name had become one of the most-used style prompts in Stable Diffusion — "in the style of Greg Rutkowski" appeared in millions of AI-generated images. Rutkowski, a living Polish fantasy artist, had not consented to his style being used as a training signal. "I'm afraid to search my own name," he told a reporter. The Stable Diffusion model had been trained on his work scraped from the internet without permission or payment.
Diffusion models work through a two-phase process: a forward process that progressively adds noise to training images until they become pure noise, and a reverse process trained to remove that noise step by step, guided by a conditioning signal (the text prompt). What the model learns is the mapping between text descriptions and visual statistical patterns.
Style is not copyrightable under current law — you cannot copyright "impressionism" or "the style of Vermeer." But the ability to generate convincing approximations of a living artist's distinctive style at near-zero cost has commercial implications that existing copyright law was not designed to address.
An artist's style is their commercial identity — the thing clients pay for. A tool that can generate "a Greg Rutkowski painting" on demand doesn't violate his copyright but does compete directly with his livelihood. Current law has no mechanism to address this.
5 questions — free, untracked, retake anytime.
diffusion models, what does the 'reverse process' learn to do?
does CLIP do in text-to-image generation?
does the Greg Rutkowski case represent a gap in current copyright law?
is LoRA fine-tuning in the context of image generation?
phrase 'commodification of style' in AI art means:
Develop a policy position on AI art and artists' rights.
Analyze the Rutkowski case and develop a policy position on AI art and living artists.
Deepfake architecture, detection arms races, and the epistemology of synthetic media.
In 2024, a finance worker in Hong Kong transferred $25 million after attending a video conference with what appeared to be his company's CFO and other executives — all of whom were, in fact, AI-generated deepfakes. The attacker had used publicly available photos and video of the real executives to create convincing synthetic avatars. No technical detection was used by the victim; the deception was social and visual.
Deepfake video typically uses GANs (Generative Adversarial Networks): a generator that produces synthetic images and a discriminator that tries to detect fakes. They train against each other — the generator improves by fooling the discriminator; the discriminator improves by catching the generator. After sufficient training, the generator produces output the discriminator (and often humans) cannot reliably distinguish from real.
Deepfake detection tools have consistently lagged behind generation quality. Detection relies on artifacts — compression anomalies, facial inconsistencies, blinking patterns — that disappear as generation improves. Researchers have moved toward content provenance as a more durable approach: cryptographic signatures embedded at creation that verify authenticity, rather than trying to detect forgery after the fact.
The Coalition for Content Provenance and Authenticity (C2PA) is developing an open standard for embedding cryptographic provenance information into media files at creation — a kind of digital chain of custody. Adopted by Adobe, Microsoft, and major camera manufacturers. The challenge: forged media doesn't carry this signature, so absence of provenance becomes suspicious but not definitive.
5 questions — free, untracked, retake anytime.
does GAN training produce increasingly convincing deepfakes?
do deepfake detection tools consistently lag behind generation quality?
is the C2PA provenance approach, and why is it more durable than detection?
the Hong Kong $25M deepfake case, what made the attack successful?
does 'absence of C2PA provenance' not definitively prove a media file is synthetic?
Develop technical and policy responses to synthetic media threats.
Analyze the deepfake problem from both technical and policy perspectives.
Copyright doctrine, fair use, authorship theory, and the legal frontier of AI creative output.
The legal landscape in 2023-24: The US Copyright Office published guidance stating that AI-generated content lacks human authorship and cannot be copyrighted, while human creative selection and arrangement of AI output can be. The Authors Guild sued OpenAI. Getty Images sued Stability AI. Sarah Silverman sued Meta and OpenAI. The New York Times sued OpenAI and Microsoft, alleging that verbatim memorization of their content constituted infringement — producing in discovery evidence that the model could reproduce NYT articles verbatim when prompted. No cases had reached final verdict as of early 2025.
Three distinct legal questions have emerged:
Copyright law was built on Romantic authorship theory — the idea of a singular creative genius producing original work. AI challenges this at every level: training (no single author), output (no human author), style (drawn from many humans). Legal philosopher Shyamkrishna Balganesh has argued that AI output requires a new authorship category that can accommodate distributed, human-machine creative processes.
If a human writes detailed prompts, curates AI output extensively, edits and arranges the results — how much human creative input is sufficient for copyright protection? The Copyright Office has not answered this question, and the courts haven't either.
5 questions — free, untracked, retake anytime.
did NYT v. OpenAI introduce as evidence that potentially undermines the 'transformativeness' fair use argument?
current US Copyright Office guidance, what determines whether AI-assisted work can be copyrighted?
does 'style is not copyrightable' create a gap in legal protection for AI and living artists?
does Shyamkrishna Balganesh's argument about AI authorship propose?
'fair use' defense for AI training on copyrighted works primarily rests on which argument?
Develop a policy framework for AI copyright and training data rights.
Analyze the unresolved legal questions and develop a policy framework.
Augmentation theory, creative labor markets, and the political economy of AI creativity.
McKinsey's 2023 economic analysis estimated that generative AI could automate 60-70% of the tasks currently performed by knowledge workers. For creative workers specifically, the pattern was asymmetric: senior creative directors and art directors saw their productivity multiply; junior illustrators, stock photo contributors, entry-level copywriters, and session musicians saw their markets shrink rapidly. The same technology augmented at the top and replaced at the bottom of the creative labor market simultaneously.
AI creative tools do not affect all creative workers equally. The pattern that has emerged:
The junior roles being eliminated are not just jobs — they are training grounds. Illustrators learn by doing thousands of illustrations; copywriters learn by writing thousands of pieces of copy. If AI eliminates the market for junior-level execution work, it potentially breaks the pipeline that produces future senior creatives.
If AI eliminates the entry-level roles that train future concept-level creative directors, who does the concept-level work in ten years? The augmentation argument assumes a pipeline that AI may be disrupting.
5 questions — free, untracked, retake anytime.
was the asymmetric pattern of AI creative tools on the labor market in 2023?
does the elimination of junior creative roles create a structural risk beyond just job loss?
'concept-level workers benefit' argument assumes:
does McKinsey's 60-70% task automation estimate mean for creative workers specifically?
does it mean to say AI 'breaks the learning pipeline' in creative work?
Analyze AI's asymmetric impact on creative labor markets.
Analyze the political economy of AI creative tools and develop a policy response.
Consent, training data, artists' rights, and the moral philosophy of AI creativity.
Sarah Andersen, Kelly McKernan, and Karla Ortiz — three artists whose styles had been heavily used as prompts in AI image generation — filed a class action lawsuit against Stability AI, Midjourney, and DeviantArt in 2023. Their attorney argued that the models had been trained on their work without consent, and that the ability to generate images "in their style" constituted a form of theft. The defendants argued that training is transformative and that style isn't copyrightable. The case raised questions that copyright law wasn't designed to answer.
AI image models were trained on billions of images scraped from the internet — including work from living artists who had made their work publicly accessible for viewing, not for AI training. The ethical question precedes the legal one: even if scraping is legal, is it ethical to use an artist's creative output to build a commercial tool that competes with them?
Beyond copyright: does training on an artist's work without consent violate something that copyright law doesn't protect? Philosopher Nils-Hennes Stear argues that there is a moral right to creative attribution and creative integrity that is distinct from copyright — and that using someone's creative work to train a competitive tool violates both, regardless of legal status.
Making creative work publicly accessible does not constitute consent to have it used for every downstream purpose. The same principle applies to data: sharing data publicly is not the same as consenting to that data being used to train a commercial AI competitor.
5 questions — free, untracked, retake anytime.
is the central ethical argument of artists suing AI image companies, beyond copyright?
does an 'opt-in' framework for AI training data mean?
distinguishes the Adobe Firefly and Getty Images approach from standard practice?
Nils-Hennes Stear argues that training on artists' work without consent violates:
is the key principle regarding public accessibility and consent to AI training?
Develop an ethical framework for AI art and training data.
Develop your ethical and policy framework for AI training data and artists' rights.
Practical prompting, iterative creation, and building a human-AI creative practice.
Author Robin Sloan has written publicly about using language models in his creative process since 2021. His practice: generate large amounts of raw material (thousands of words of rough prose, alternative scenes, character dialogue in different registers), then curate aggressively, extracting the phrases and images that feel genuinely surprising or useful. "I'm not looking for the AI to write my novel," he said in an interview. "I'm looking for it to show me things I wouldn't have thought of, so I can decide if they're interesting."
The most productive AI-assisted creative practices share common features:
A practical framework for AI-assisted narrative creation:
AI is a raw material generator. You are the author. The work is yours when you've provided the direction, made the curatorial decisions, and written the voice. That's what authorship has always meant.
5 questions — free, untracked, retake anytime.
does Robin Sloan use AI for in his creative process?
does generating 5-10x more than you need improve AI-assisted creative work?
is the purpose of an external world-state document in AI-assisted long-form narrative?
'voice protection' principle in AI-assisted creative work means:
the iterative creation protocol, what is the function of Phase 1 (Direction)?
Develop your personal AI-assisted creative framework.
Develop your personal framework for AI-assisted creative work.
8 questions covering all lessons. Free, untracked, retake anytime.
structural property causes AI narrative to lose coherence over long distances?
Ross Goodwin's '1 the Road', authorship was:
fine-tuning enables:
C2PA provenance standard addresses synthetic media by:
learning pipeline problem in AI creative labor means:
ethical principle 'public accessibility is not consent to training' means:
the iterative creation protocol, Phase 4 (Voice) is:
NYT verbatim memorization evidence in its lawsuit against OpenAI was significant because: