On January 1, 2019, Gabonese president Ali Bongo Ondimba appeared in a televised New Year's address — the first time the public had seen him in months following a suspected stroke. Within days, analysts and opposition leaders began claiming the video was a deepfake. The Gabonese military cited the suspicious video as partial justification for a coup attempt on January 7. Independent investigators later concluded the video was probably authentic, but the damage was done: a real video had been dismissed as fake, triggering real political violence.
The episode illustrated a phenomenon researchers now call the "liar's dividend" — even if deepfakes themselves cause harm, the mere existence of the technology lets bad actors dismiss genuine evidence by simply saying "that's a deepfake."
The word "deepfake" combines deep learning — a form of AI — with fake. The technique was first popularized in late 2017 when a Reddit user named "deepfakes" began posting AI-generated face-swap pornography using celebrity faces. The underlying method is called a Generative Adversarial Network (GAN).
A GAN pits two neural networks against each other. The generator creates synthetic images; the discriminator tries to detect which images are fake. Through millions of training rounds, the generator keeps improving until its fakes fool the discriminator — and often, human eyes as well. Modern deepfakes also use diffusion models and face-swapping encoders that can map one person's facial geometry onto another person's video in real time.
Audio deepfakes — sometimes called voice clones — work similarly, training on recordings of a person's speech to produce new utterances they never made. In 2023, robocalls using a cloned voice of President Biden were sent to New Hampshire voters telling them not to vote in the primary election. The Federal Communications Commission traced the calls and fined the perpetrators $6 million.
Deepfakes are not hypothetical threats. The nonprofit Security Research Labs documented over 500,000 deepfake videos online in 2023. Roughly 96% depicted non-consensual intimate imagery — a serious crime in many jurisdictions. A smaller but growing share targets politicians, journalists, and corporate executives for fraud and disinformation.
Deepfakes emerged from open-source AI research, spread rapidly without regulatory guardrails, and now affect elections, financial markets, and personal safety. Understanding the technology — not just the danger — is the first step toward effective detection and defense.
You're investigating the history and mechanics of deepfake technology. Use the AI assistant below to explore questions about GANs, the timeline of deepfake development, and the real-world incidents from Lesson 1. Ask at least three questions to complete the lab.
On March 16, 2022, a deepfake video appeared on hacked Ukrainian TV and news websites showing President Volodymyr Zelensky telling Ukrainian soldiers to "lay down their weapons and surrender." The video was technically crude — Zelensky's head appeared too large for his body, his speech pattern was unnatural — but it still spread rapidly on Telegram and Facebook before platforms could remove it.
Zelensky responded within hours with a real video from his office, mocking the fake. Meta and YouTube removed the deepfake, but not before it had been viewed hundreds of thousands of times. Military analysts noted the video's primary goal was not to fool soldiers but to create confusion and fear — to make Ukrainians doubt what was real.
Deepfakes are not made by a single type of actor. Research from the Sensity AI threat intelligence firm categorizes creators into four rough groups:
The largest category by volume. Predominantly targets women. Apps like DeepNude (banned in 2019) and its successors allow users to remove clothing from photos. UK law criminalized this in 2023; the US DEFIANCE Act passed in 2024.
In 2020, criminals used a deepfake audio of a company director's voice to authorize a $35 million bank transfer at a Hong Kong bank. Similar CEO voice fraud has been reported in the UK and UAE.
State-sponsored and independent actors use deepfakes to discredit political opponents. Slovakia's 2023 election saw audio deepfakes of liberal candidate Michal Šimečka supposedly discussing vote-buying — released 48 hours before the election, too close to deadline to debunk effectively.
Legal, labeled deepfakes for comedy shows, art projects, and film production. These legitimate uses create public familiarity with the technology — but also normalize the sight of famous faces saying unexpected things.
Researchers at MIT Media Lab found that false information spreads six times faster on Twitter/X than true information — and deepfakes add a visual dimension that text corrections cannot easily counter. Three platform dynamics accelerate deepfake spread:
Algorithmic amplification: Outrage and novelty drive engagement. A video of a world leader appearing to surrender generates more clicks than a quiet fact-check post. Platforms optimized for engagement reward sensational content.
Cross-platform laundering: A deepfake posted on Telegram can be screenshot and reshared on WhatsApp, Instagram, and TikTok — each hop losing the original context and making removal harder.
Correction asymmetry: The original deepfake can be shared indefinitely for free. Effective corrections require coordinated fact-checkers, platform cooperation, and media coverage — all of which cost time and money and reach fewer people than the original fake.
Two days before the September 30 election, audio deepfakes circulated on Facebook appearing to show progressive candidate Michal Šimečka discussing how to rig the election. Meta's fact-checkers could not respond in the mandatory 48-hour pre-election media silence window. Šimečka lost by a narrow margin. Researchers cannot prove the deepfakes changed the outcome, but they document it as the first election where a deepfake surfaced too close to voting day to be debunked in time.
Creating a convincing deepfake in 2017 required thousands of training images and days of GPU time. By 2023, apps like FaceSwap, DeepFaceLab, Reface, and commercial services on Telegram bots could produce face-swapped videos in minutes from a single photograph. Voice cloning services like ElevenLabs require as little as three seconds of audio to clone a voice — a fact that led the company to implement abuse controls after its voice engine was used to clone public figures without consent.
The democratization of deepfake creation means the threat is no longer limited to well-funded state actors. A determined individual with a consumer-grade laptop can now produce media that, without specialized detection tools, is indistinguishable from authentic footage.
Deepfakes spread through a combination of algorithmic amplification, platform cross-posting, and the fundamental asymmetry between viral misinformation and slow corrections. Understanding who makes them — and why the platforms they travel through reward engagement over accuracy — is essential to understanding why detection and media literacy matter.
You're mapping the ecosystem of deepfake spread. Use the AI to explore platform dynamics, case studies from the lesson, and the structural reasons corrections fail. Ask at least three questions to complete the lab.
In 2016, the US Defense Advanced Research Projects Agency (DARPA) launched the Media Forensics (MediFor) program with a specific mandate: build AI tools that could automatically assess the integrity of photos and videos. Over five years and tens of millions of dollars, DARPA-funded teams at MIT Lincoln Laboratory, SRI International, and other research groups developed detection systems that could identify compression artifacts, inconsistent lighting, and pixel-level anomalies invisible to human eyes.
The program produced genuinely capable tools — but also surfaced a fundamental problem. As detection methods improved, so did the deepfakes. Adversarial training — feeding deepfake generators feedback from detectors — produced synthetic media specifically optimized to evade those detectors. The technology became a true arms race.
Trained observers can spot deepfakes using visual cues that AI models still struggle to perfectly replicate. However, these cues become less reliable as the technology improves. Key tells that worked in 2020–2022 include:
Each of these tells has been substantially reduced or eliminated in state-of-the-art deepfakes made in 2023–2024. The detection cues above are useful for spotting lower-quality or quickly made fakes, but should never be the sole basis for concluding a video is real. Absence of visible artifacts does not confirm authenticity.
Several AI-powered detection tools are now publicly available or widely deployed by platforms:
Microsoft's Video Authenticator (2020) analyzes videos frame-by-frame for subtle blending artifacts and produces a confidence score. Microsoft released it specifically to counter election deepfakes but acknowledged it would become less effective as deepfake quality improved.
Deepware Scanner and FakeCatcher (Intel, 2022) use different methods — FakeCatcher analyzes blood-flow signals (rPPG) detectable in pixel color changes in real human skin that synthetic faces cannot replicate. Intel claimed 96% accuracy in testing, though independent verification in real-world conditions is harder to achieve.
In 2023, researchers at the University of Buffalo published a corneal reflection method: the eyes of real people reflect a consistent image of the environment; deepfake eyes show inconsistent or physically impossible reflections. This method was highly accurate at launch but was expected to be patched by generative models within months.
Every publicly released detector becomes a training signal for the next generation of deepfakes. This is not theoretical — researchers demonstrated in 2020 that adversarially trained deepfakes specifically designed to fool Microsoft's Video Authenticator could reduce its accuracy from 90% to below 50% with minimal degradation in visual quality visible to humans.
This means detection cannot be a static solution. The most reliable long-term approaches rely less on spotting artifacts and more on provenance — tracking where media came from, not just what it looks like.
Both human and automated deepfake detection work — imperfectly, temporarily, and only as long as deepfake technology isn't specifically trained to evade them. Smart detection combines visual inspection with source verification, platform context, and metadata analysis. No single method is reliable on its own.
You're a media forensics analyst reviewing suspicious videos. Use the AI to practice applying detection methods from Lesson 3. Describe a hypothetical video scenario and ask what detection approaches to use, or explore why each method has limitations. Ask at least three questions to complete the lab.
In 2021, Adobe, Microsoft, Intel, Twitter, and the BBC co-founded the Coalition for Content Provenance and Authenticity (C2PA). Their goal was not to detect fakes after the fact but to create a cryptographic chain of custody for authentic media from the moment of capture.
By 2024, Sony and Leica had released cameras that embed C2PA Content Credentials — a cryptographic signature — into every photo at the moment of shutter press. News agencies including the Associated Press began testing the system. The credentials travel with the image and can be verified at any C2PA-compliant platform, showing where a photo was taken, on what device, and whether it has been edited.
The system is not foolproof — a deepfake could be photographed with a C2PA camera, acquiring false credentials — but it represents the first serious infrastructure for media provenance at scale.
Laws governing deepfakes have developed unevenly. As of 2024, the major legislative actions include:
Signed into law in July 2024. Allows victims of non-consensual intimate deepfakes to sue creators and distributors in federal court. Specifically covers AI-generated content. Passed with bipartisan support following the Taylor Swift incident.
Requires deepfakes to be labeled as AI-generated content. Platforms must disclose when content is synthetic. Violations carry fines of up to 3% of global annual turnover. Effective from 2025.
Criminalized sharing non-consensual intimate deepfakes. The 2024 Criminal Justice Bill went further, also criminalizing the creation of such content — not just its distribution.
China's Cyberspace Administration requires deepfakes to carry visible labels and prohibits using them to spread false news. Platforms must verify users before allowing deepfake creation — one of the strictest frameworks globally.
Three technical approaches are being deployed at scale:
Invisible watermarking: Google's SynthID (2023) embeds imperceptible watermarks into AI-generated images and audio produced by Google's Gemini and Imagen tools. The watermark survives screenshots, compression, and mild editing, and can be detected by SynthID tools. Adobe's Content Authenticity Initiative uses a visible "Content Credentials" badge that links to provenance data.
Cryptographic provenance (C2PA): As described above, this embeds signed metadata at the moment of creation. The AP, Reuters, and major broadcasters have committed to using C2PA-compliant cameras and editing software for news photography.
Platform-level detection: Meta deploys automated deepfake detection and labels AI-generated content in its political advertising policy. YouTube requires creators to disclose when uploaded content uses AI to "realistically alter or generate" people or events.
No policy or technology eliminates the need for individual verification habits. For any surprising video of a public figure: (1) Check if the original source is a verified account or official channel. (2) Search for the same clip on established news sites — if only fringe outlets have it, that's a red flag. (3) Look for a C2PA "Content Credentials" badge on platforms that support it. (4) Use reverse image/video search (InVID, Google, TinEye) to find the original context. (5) Apply the deepfake visual checklist from Lesson 3 — not as definitive proof but as one signal among many.
Beyond law and technology, the deepfake problem has a consent dimension. In 2023, the Screen Actors Guild (SAG-AFTRA) went on strike partly over studios' attempts to use AI to replicate actors' likenesses without compensation or consent. The resulting contract established that studios must obtain explicit, informed consent and negotiate compensation for any AI replication of a performer's voice or appearance.
This principle — that a person's likeness requires consent — is increasingly being recognized in law. California's AB 2602 (2024) specifically protects performers from posthumous AI replication of their likeness without estate consent. These frameworks establish that the ethical use of deepfake-adjacent technology begins with who gave permission, not just whether the output looks realistic.
Defending against deepfakes requires three layers working simultaneously: technical standards (C2PA, watermarking), legal frameworks (DEFIANCE Act, EU AI Act), and individual media literacy habits (source verification, visual inspection, provenance checking). No single layer is sufficient — the most resilient defense combines all three.
You're advising a news organization on deepfake defense policies. Use the AI to explore C2PA implementation, legal obligations under the EU AI Act, and how to train staff in verification habits. Ask at least three questions to complete the lab.