L1
Β·
Quiz
Β·
Lab
L2
Β·
Quiz
Β·
Lab
L3
Β·
Quiz
Β·
Lab
L4
Β·
Quiz
Β·
Lab
Module Test
Module 6 Β· Lesson 1

The Dual-Use Dilemma

When the same discovery heals and destroys
How has history shown that knowledge with peaceful applications can be weaponized β€” and what does AI add to this ancient problem?

In the autumn of 2011, virologist Ron Fouchier at Erasmus Medical Center in Rotterdam submitted a paper to Science describing how his team had engineered H5N1 avian influenza to transmit between ferrets via respiratory droplets. A parallel team led by Yoshihiro Kawaoka at the University of Wisconsin submitted similar findings to Nature. Both papers were immediately flagged by the U.S. National Science Advisory Board for Biosecurity β€” not for fraud, but for being too dangerous to publish in full. The research was legitimate science aimed at understanding pandemic risk. The methods were also a potential blueprint for a catastrophic biological weapon.

The debate that followed β€” over whether the journals should redact the methods sections β€” was the most visible public confrontation with dual-use research of concern in the modern era. Both papers were eventually published, largely intact.

What Dual-Use Research Means

The concept predates modern science. The same metallurgy that forged plows forged swords. The Haber-Bosch process, developed by Fritz Haber and Carl Bosch in the early twentieth century, enabled nitrogen fixation that feeds roughly half of humanity today β€” and also enabled industrial-scale production of explosives and chemical weapons. Haber personally supervised Germany's first large-scale chlorine gas attacks at Ypres in April 1915.

In contemporary policy, dual-use research of concern (DURC) refers specifically to research that, while conducted for legitimate purposes, could be misused to threaten public health, safety, security, or other significant values. The U.S. government formalized this definition in its 2012 DURC policy, requiring institutional review for certain life-sciences research categories.

The challenge is not that bad actors do bad research. It is that good actors doing good research produce knowledge that bad actors can exploit. This asymmetry is the core of the dual-use problem.

The Mousepox Case β€” 2001

Australian researchers Ron Jackson and Ian Ramshaw, working on a mouse contraceptive, inadvertently created a hyper-lethal mousepox virus by inserting the IL-4 gene. The modified virus killed mice that had been vaccinated against the normal strain. The implications for smallpox β€” a pathogen with a known genome and extinct in the wild β€” were immediately alarming. The researchers published their findings in Journal of Virology, later stating they had not considered the dual-use implications before submission.

Why AI Changes the Calculus

Prior to AI, dual-use knowledge posed a diffusion problem: dangerous methods existed in academic papers and specialist communities, but accessing, understanding, and operationalizing them required significant technical expertise and often expensive laboratory infrastructure. The barrier was not the knowledge itself but the human capital required to act on it.

Large language models and AI-assisted design tools compress this barrier. In 2023, a study commissioned by the Johns Hopkins Center for Health Security found that an AI chatbot could provide "meaningful uplift" to individuals seeking to synthesize dangerous pathogens β€” not by providing information unavailable in literature, but by providing synthesis, guidance, and troubleshooting in a conversational format that dramatically reduced the expertise threshold. The study was partially redacted before publication.

Similar dynamics apply to cyber intrusion tooling, disinformation generation, materials science for improvised weapons, and autonomous system design. In each case, AI does not necessarily introduce new knowledge β€” it democratizes access to existing knowledge in ways that legacy export controls and classification regimes were not designed to handle.

DURC Dual-Use Research of Concern β€” legitimate research that could be directly misapplied to pose significant threats.
Uplift The degree to which AI assistance increases an actor's capability beyond what they could achieve without it.
Information Hazard (infohazard) True information whose disclosure creates risk of harm, regardless of intent.
The Spectrum of Dual-Use Concern

Not all dual-use risk is equivalent. Policymakers and researchers have developed rough taxonomies. At one end: basic science with theoretical misuse potential (e.g., published protein folding data). At the other: specific synthesis routes for select agents with no plausible civilian application in the form described.

The 2017 Nunn-Lugar Cooperative Threat Reduction program extension explicitly recognized AI-enabled biology as a new threat vector, funding detection research at DOE national laboratories. DARPA's Safe Genes program, launched the same year, developed safeguards for gene-editing technologies that are inherently dual-use.

For AI specifically, the spectrum runs from: general-purpose language models that can answer chemistry questions, to specialized models trained on restricted literature, to purpose-built AI systems for molecular design such as those used in legitimate pharmaceutical discovery β€” and potentially in bioweapons design if the objective function is reversed.

AlphaFold & Biosecurity

DeepMind's AlphaFold 2, released publicly in 2021, solved the protein structure prediction problem that had stumped biology for fifty years. The database now contains over 200 million predicted structures. The biosecurity community immediately noted that the same tool enabling vaccine development could assist in designing novel protein-based toxins or engineering pathogen proteins for enhanced virulence or immune evasion. No restrictions were placed on public access.

Governance as It Stood Before AI

Pre-AI dual-use governance relied on several interlocking mechanisms: export controls (the Wassenaar Arrangement, Export Administration Regulations) restricting transfer of controlled technologies; select agent regulations limiting who may work with dangerous pathogens; pre-publication review by institutional biosafety committees; and classification of the most sensitive government-funded research.

Each mechanism assumed a relatively slow diffusion of technical knowledge through a credentialed professional community. AI disrupts that assumption at multiple points simultaneously β€” by making existing published literature more actionable, by enabling non-credentialed actors to navigate technical domains, and by potentially generating novel dangerous knowledge rather than merely retrieving existing knowledge.

Lesson 1 Quiz

The Dual-Use Dilemma β€” check your understanding
The Fouchier/Kawaoka H5N1 controversy in 2011–12 centered on which specific concern?
Correct. NSABB flagged both papers not for fraud but because the methods sections described how to make H5N1 airborne-transmissible β€” dual-use of concern in its clearest form.
Incorrect. The research was scientifically valid. The concern was that accurate methods could be weaponized, not that they were false.
Fritz Haber's career illustrates the dual-use problem because he:
Correct. Haber-Bosch enables modern agriculture; Haber also deployed chlorine gas at Ypres. One scientist, one chemical knowledge base, two radically different applications.
Incorrect. Haber co-developed nitrogen fixation (feeding humanity) and personally led Germany's first industrial chemical weapons campaign in WWI β€” the same chemistry, opposite purposes.
In the context of AI and dual-use risk, "uplift" refers to:
Correct. Uplift is the key policy concept β€” AI's marginal contribution to a threat actor's capability is what matters for risk assessment, not merely whether harmful information exists somewhere.
Incorrect. Uplift measures the capability gap AI closes for a would-be bad actor β€” how much closer to a harmful goal AI assistance brings someone who couldn't otherwise get there.
Which statement best describes how AI disrupts pre-existing dual-use governance frameworks?
Correct. Legacy governance assumed that dangerous technical knowledge was difficult to operationalize without specialist training. AI compresses that barrier by making existing knowledge more accessible and actionable.
Incorrect. The primary concern is democratization β€” making existing dangerous knowledge actionable for actors who previously lacked the expertise to use it, not necessarily generating wholly new knowledge.

Lab 1 β€” Mapping the Dual-Use Spectrum

Discuss dual-use concepts with your AI tutor Β· Complete 3 exchanges to finish

Your Task

Explore the concept of dual-use research of concern with a focus on how AI changes the traditional uplift calculus. Consider the Fouchier case, AlphaFold, and the expertise-threshold problem.

Suggested starting point: "Explain why AlphaFold's public release raised biosecurity concerns even though it's a tool for understanding protein structures, not making pathogens."
AI Tutor β€” Dual-Use Research
Module 6 Β· L1
Welcome to Lab 1. We're exploring the dual-use dilemma β€” specifically how AI tools like AlphaFold or large language models change the risk calculus for dangerous knowledge. What aspect would you like to dig into first?
Module 6 Β· Lesson 2

AI-Enabled Biology: The Sharpest Edge

Molecular design, gain-of-function, and the speed of artificial synthesis
How have AI tools for drug discovery and protein design blurred the line between pharmaceutical research and bioweapon development β€” and what have real incidents revealed?

In March 2022, researchers at Collaborations Pharmaceuticals published a paper in Nature Machine Intelligence describing an experiment they had conducted as a thought exercise for a biosecurity conference. Using their AI drug-discovery model, MegaSyn, they inverted the objective function β€” rather than optimizing for low toxicity, they tasked the model with identifying molecules with high toxicity potential. Within six hours, the model had generated approximately 40,000 candidate molecules. Many scored higher on predicted toxicity than known chemical warfare agents, including VX nerve agent. The team declined to publish the full output. The paper's conclusion was stark: the same AI infrastructure used to discover life-saving drugs could be trivially repurposed for offensive chemistry.

How AI Drug Discovery Works β€” and Why It's Dual-Use

Modern AI drug discovery platforms β€” including SchrΓΆdinger's physics-based modeling suite, Recursion Pharmaceuticals' phenomics platform, and academic tools like RoseTTAFold β€” operate on a core paradigm: given a target protein and a desired interaction profile, generate candidate molecules likely to bind and produce a desired effect. The AI learns from vast databases of known molecule-effect relationships.

The dual-use problem emerges from a simple observation: the model does not know or care whether the desired effect is therapeutic or lethal. "Inhibit this receptor" and "maximally disrupt this receptor" are both valid query framings. The molecular design process is the same. The objective function is what differs.

Beyond small molecules, AI is transforming protein engineering. Tools like ProteinMPNN and RFdiffusion (both from the Baker Lab at University of Washington) can design novel proteins with specified structural and functional properties. In 2023, the Baker Lab demonstrated de novo protein binders for influenza hemagglutinin β€” a major advance for antivirals. The identical approach could be used to design proteins that enhance pathogen binding to human receptors.

The Gain-of-Function Debate

Gain-of-function (GOF) research deliberately enhances pathogen characteristics β€” transmissibility, virulence, immune evasion β€” to study pandemic risk and develop countermeasures. The NIH imposed a funding moratorium on certain GOF research in 2014, lifted in 2017 with new oversight requirements under the P3CO (Potential Pandemic Pathogen Care and Oversight) framework. AI accelerates GOF by enabling computational prediction of which mutations would achieve desired functional changes before any laboratory work is done β€” expanding what can be explored at near-zero cost before expensive wet-lab confirmation.

The Nucleic Acid Synthesis Problem

Even a perfectly designed biological agent requires physical synthesis. For pathogens, this means synthesizing the nucleic acid sequences that encode the agent. The commercial DNA synthesis industry has grown dramatically, with companies like Twist Bioscience, IDT, and Genscript capable of producing long DNA sequences on demand.

In 2022, the Nuclear Threat Initiative (NTI) published an assessment finding that biosecurity screening practices varied widely across synthesis providers and that the International Gene Synthesis Consortium's voluntary screening protocol covered only a fraction of global capacity. AI compounds this risk by generating optimized sequences that might evade signature-based screening β€” sequences with the same functional effect as a dangerous pathogen gene but different enough in sequence to avoid detection flags.

The Biden administration's Executive Order 14110 (October 2023) addressed this directly, requiring federal agencies to develop minimum standards for nucleic acid synthesis screening β€” the first federal policy intervention specifically targeting AI-enabled bioweapons risk in the synthesis supply chain.

Gain-of-Function Research that alters an organism to give it new or enhanced abilities, including increased transmissibility or virulence.
De Novo Protein Design Creating entirely new protein sequences with specified functional properties, now tractable via AI tools like RFdiffusion.
Nucleic Acid Synthesis Screening Checking DNA/RNA synthesis orders against databases of dangerous sequences before fulfillment.
Biosecurity Community Response

The biosecurity community has not been passive. The Johns Hopkins Center for Health Security, NTI Bio, and SecureBio (a nonprofit dedicated to AI biosecurity) have each published frameworks for evaluating AI biosecurity risk. SecureBio's Biological Hazard Assessment for AI (BHA-AI) framework, developed with former intelligence community officials, evaluates AI systems across five dimensions: knowledge provision, task completion, access facilitation, quality uplift, and speed uplift.

DARPA's Biological Technologies Office has funded work on metagenomic monitoring β€” environmental surveillance systems capable of detecting engineered pathogens in real time. The logic: if AI makes creation easier, detection infrastructure must become faster and broader.

Leading AI laboratories have begun implementing domain-specific safeguards. Anthropic, OpenAI, and Google DeepMind have each published biosecurity policies restricting detailed synthesis guidance for dangerous pathogens. Whether these policies are effective under adversarial prompting remains an active area of red-team research.

The Nucleotide BLAST Limitation

Traditional screening compares submitted sequences against known dangerous sequences using BLAST (Basic Local Alignment Search Tool). AI-designed sequences can be functionally equivalent to dangerous agents while being sufficiently different in sequence to evade BLAST matches. In 2023, MIT researchers demonstrated this in a controlled study β€” designing functional analogs to toxin-encoding genes that passed standard screening. Their paper argued for AI-based screening to counter AI-based evasion.

Lesson 2 Quiz

AI-Enabled Biology β€” check your understanding
The Collaborations Pharmaceuticals MegaSyn experiment demonstrated that AI drug-discovery tools could be weaponized by:
Correct. The same model, same training data, same infrastructure β€” only the objective was changed. The model generated ~40,000 high-toxicity candidates in six hours, many exceeding VX in predicted lethality.
Incorrect. No classified data was involved. The team simply reversed the optimization target from "low toxicity" to "high toxicity" using the same drug-discovery AI platform already in production use.
Why does AI pose a specific challenge to nucleic acid synthesis screening systems that use sequence-comparison tools like BLAST?
Correct. BLAST-type screening looks for known dangerous sequences. AI can generate novel sequences with the same dangerous function but different enough nucleotide composition to pass screening β€” demonstrated in controlled research settings.
Incorrect. The concern is about sequence design, not database intrusion. AI-generated functional analogs can evade signature matching because they look different while doing the same dangerous biological thing.
The NIH's 2014 moratorium and subsequent 2017 P3CO framework addressed which category of research?
Correct. The P3CO framework established review requirements for research that enhances pathogen transmissibility, virulence, or immune evasion β€” exactly the characteristics AI can now help predict computationally before wet-lab work begins.
Incorrect. The P3CO framework governs potential pandemic pathogen gain-of-function research β€” experiments that deliberately enhance pathogen characteristics like transmissibility or lethality. AI now makes the computational phase of such work dramatically faster and cheaper.
Executive Order 14110 (October 2023) addressed AI-enabled bioweapons risk specifically by:
Correct. EO 14110 was the first federal policy to directly target the AI-synthesis supply chain intersection, directing agencies to establish baseline screening standards for DNA synthesis providers.
Incorrect. EO 14110 targeted the synthesis supply chain β€” requiring screening standards for nucleic acid synthesis orders to prevent AI-designed dangerous sequences from being physically produced.

Lab 2 β€” AI Biology Risk Assessment

Discuss AI-enabled biological risks with your AI tutor Β· Complete 3 exchanges to finish

Your Task

Engage with the specific dual-use risks from AI tools in biology β€” drug discovery platforms, protein design, and the synthesis supply chain. Apply the concept of objective-function inversion and screening evasion.

Suggested starting point: "How does inverting an AI drug discovery objective function differ in risk from a scientist manually searching the literature for toxic compounds?"
AI Tutor β€” AI-Enabled Biology
Module 6 Β· L2
Ready to explore AI-enabled biological risks. The MegaSyn case and the synthesis screening problem are both excellent entry points. What would you like to examine first?
Module 6 Β· Lesson 3

Cyber, Autonomy, and Materials

Dual-use beyond biology β€” code, weapons, and things that go wrong at scale
How do AI-enabled dual-use risks in cybersecurity, autonomous systems, and advanced materials differ from the biological case β€” and what governance mechanisms apply?

In July 2023, cybersecurity researchers at SlashNext documented WormGPT, a large language model fine-tuned on malware data and made available on criminal forums for a monthly subscription. Unlike frontier models with safety training, WormGPT had no content restrictions and would assist in writing malicious code, crafting business email compromise attacks, and advising on exploitation techniques. Shortly after, a second tool, FraudGPT, appeared on Telegram offering similar capabilities. Both were advertised as enabling "everything you feared ChatGPT could do." These were not sophisticated nation-state tools β€” they were commoditized, subscription-based, and available to low-skill actors.

AI and Offensive Cyber

The cybersecurity dual-use problem is the oldest in the digital domain. Penetration testing tools are identical in function to attack tools. Vulnerability research produces knowledge that can be used for patching or for exploitation. AI extends this tension across several dimensions:

Code generation: In 2023, researchers at CyberArk demonstrated that GPT-4 could assist in developing polymorphic malware β€” code that continuously rewrites itself to evade signature detection. The researchers noted that producing such code previously required advanced reverse-engineering expertise. The model compressed that requirement significantly.

Social engineering: AI dramatically reduces the cost and improves the quality of phishing content. A 2023 study by IBM Security found AI-generated phishing emails achieved click rates comparable to human-written ones while costing 96% less to produce. Spear-phishing, historically expensive because it required per-target research and writing, becomes near-free.

Vulnerability discovery: AI-assisted fuzzing and code analysis tools can discover software vulnerabilities faster than human analysts. DARPA's Cyber Grand Challenge (2016) demonstrated autonomous systems finding and patching vulnerabilities in real time. The same capability can find vulnerabilities to exploit rather than patch.

The Stuxnet Precedent

Stuxnet (discovered 2010, attributed to the U.S. and Israel) targeted Iranian nuclear centrifuges using four zero-day exploits β€” unprecedented in a single weapon. It demonstrated that cyber capabilities could produce kinetic physical effects on industrial infrastructure. AI-assisted vulnerability discovery and exploit development makes the technical sophistication required to build Stuxnet-equivalent weapons increasingly accessible, though the intelligence and targeting requirements remain high.

Autonomous Weapons and the Dual-Use Question

Autonomous weapons systems β€” military platforms capable of selecting and engaging targets without human intervention β€” represent a fundamentally different dual-use structure than biology or cyber. Here, the civil-military dual-use is not about the same tool being used for different purposes but about commercial AI capabilities being integrated into weapons platforms.

The commercial drone industry illustrates this concretely. DJI drones, designed for photography and inspection, have been extensively modified by non-state actors in conflicts including Ukraine and the Islamic State's operations in Iraq and Syria. In Ukraine (2022–present), both sides have used commercial FPV (first-person view) racing drones fitted with explosives as precision munitions. The AI-assisted stabilization, obstacle avoidance, and target-tracking capabilities developed for consumer use translate directly into weapons capabilities.

Computer vision systems trained for autonomous vehicle navigation can be repurposed for target identification. Reinforcement learning systems trained in simulation can control autonomous platforms. The DOD's Project Maven, launched in 2017, explicitly used commercial computer vision AI (initially Google TensorFlow) to analyze drone surveillance footage β€” and prompted a Google employee petition and eventual Google withdrawal from the contract, illustrating the governance tension when commercial AI firms encounter weapons applications.

LAWS Lethal Autonomous Weapons Systems β€” platforms capable of selecting and engaging targets without real-time human control.
Polymorphic Malware Malicious code that continuously alters its signature to evade detection, now generatable with AI assistance.
Project Maven DOD initiative using commercial AI for military image analysis, which triggered the first major tech-firm ethical crisis over weapons applications.
Advanced Materials and Nuclear Relevance

AI applications in materials science represent a less-discussed but significant dual-use concern. GNoME (Graph Networks for Materials Exploration), released by Google DeepMind in November 2023, predicted the structures of 2.2 million new stable materials β€” 380,000 of which DeepMind assessed as immediately synthesizable. The research community celebrated this as a breakthrough for battery technology, superconductors, and other clean-energy applications.

The nuclear nonproliferation community noted a different implication: GNoME and similar tools could assist in identifying novel materials useful for weapons applications β€” including nuclear weapon components, radiation shielding, and advanced conventional explosives. The Nuclear Threat Initiative published an analysis in early 2024 noting that AI materials discovery tools were not subject to the same DURC review requirements as life-sciences research, representing a governance gap.

Export controls under the Export Administration Regulations (EAR) and the International Traffic in Arms Regulations (ITAR) address specific controlled materials but were not designed to regulate AI models that discover new materials. The question of whether an AI model that could identify weapons-relevant materials constitutes a controlled technology is legally unresolved.

The Wassenaar Arrangement Gap

The Wassenaar Arrangement on Export Controls for Conventional Arms and Dual-Use Goods and Technologies covers specific software and hardware categories. In 2019, participating states discussed but failed to reach consensus on including certain AI capabilities β€” particularly intrusion software and autonomous control technologies β€” in the control lists. The arrangement's consensus-based amendment process has not kept pace with AI development cycles, leaving significant gaps.

Governance Mechanisms Specific to These Domains

Each domain has domain-specific governance architecture of varying effectiveness. Cybersecurity: The Vulnerabilities Equities Process (VEP) governs U.S. government decisions about whether to disclose discovered vulnerabilities to vendors or retain them for offensive use. AI-discovered vulnerabilities in principle fall within the VEP but the process was not designed for the discovery volumes AI enables. Autonomous weapons: DOD Directive 3000.09 (updated 2023) requires human judgment for lethal force decisions, but "human judgment" is interpreted broadly and the directive has no treaty-level international counterpart. The Campaign to Stop Killer Robots has sought a binding international instrument without success at the UN Group of Governmental Experts level. Materials: The 2023 CHIPS and Science Act included provisions for AI research security but did not specifically address AI-enabled materials discovery.

Lesson 3 Quiz

Cyber, Autonomy, and Materials β€” check your understanding
WormGPT and FraudGPT, documented in 2023, are significant for dual-use AI governance because they demonstrated:
Correct. WormGPT and FraudGPT represented commoditization of offensive AI β€” available for monthly subscription on criminal forums, removing the need for advanced coding expertise to conduct sophisticated cyberattacks.
Incorrect. These were commoditized subscription tools available on criminal forums β€” significant precisely because they lowered the skill threshold for cyber offense, not because of nation-state involvement or autonomous operation.
Google's withdrawal from Project Maven in 2018 illustrated which specific tension in AI dual-use governance?
Correct. Over 3,000 Google employees signed a petition opposing Project Maven, and twelve resigned. Google eventually declined to renew the contract β€” the first major case of commercial AI firm workforce governance affecting national security AI programs.
Incorrect. Project Maven worked technically. The issue was that Google employees objected to their employer's AI being used for military targeting, raising a governance question about whether commercial AI firms can or should set limits on weapons applications of their technology.
DeepMind's GNoME materials discovery tool raised dual-use concerns in the nuclear nonproliferation community because:
Correct. GNoME identified 2.2 million new stable materials. Unlike life-sciences DURC, AI materials discovery tools face no mandatory review process β€” a governance gap NTI flagged in 2024.
Incorrect. GNoME used publicly available crystallography data. The concern was the governance gap: life-sciences DURC has review frameworks; AI materials discovery tools do not, even though they could identify materials with weapons applications.
The use of commercial DJI drones as weapons in conflicts in Iraq, Syria, and Ukraine exemplifies which dual-use dynamic?
Correct. DJI drones were designed for photography. The AI-assisted stabilization, obstacle avoidance, and navigation capabilities built for civilian use translate directly to weapons applications β€” a dual-use conversion that no export control regime effectively anticipated.
Incorrect. DJI drones are civilian products. The dual-use problem is that capabilities like AI stabilization and navigation, designed for photography, were repurposed by end users as munition delivery systems β€” commercial AI becoming weapons without any state involvement.

Lab 3 β€” Cyber and Autonomy Risk Mapping

Discuss non-biological dual-use AI risks with your AI tutor Β· Complete 3 exchanges to finish

Your Task

Explore AI dual-use risks in cybersecurity, autonomous weapons, and materials science. Consider governance gaps β€” the Wassenaar Arrangement lag, DOD Directive 3000.09 limitations, and the absence of materials DURC review.

Suggested starting point: "Why is the dual-use problem for autonomous weapons systems structurally different from the dual-use problem for AI biology tools?"
AI Tutor β€” Cyber, Autonomy & Materials
Module 6 Β· L3
Ready to explore non-biological dual-use AI risks. Whether it's WormGPT, Project Maven, DJI drones in conflict zones, or GNoME materials β€” each domain has a different dual-use structure. Where would you like to start?
Module 6 Β· Lesson 4

Governance Frameworks and Open Questions

How states, institutions, and companies are trying to manage what they built
What governance mechanisms exist for dual-use AI research, how effective are they, and where do the most critical gaps remain?

When the United Kingdom announced the AI Safety Institute (AISI) at the November 2023 Bletchley Park AI Safety Summit, its mandate explicitly included evaluating dual-use capabilities in frontier AI models β€” particularly biosecurity and cybersecurity risks. The UK became the first government to establish a permanent, funded body specifically tasked with pre-deployment evaluation of AI systems for dangerous dual-use outputs. The U.S. AI Safety Institute, announced days later under NIST, obtained similar mandate language. Both institutions immediately faced a structural problem: they had no legal authority to require companies to submit models for evaluation before deployment.

The Pre-Publication Review Model and Its AI Adaptation

The life-sciences community's most mature dual-use governance instrument is the pre-publication review β€” institutional biosafety committee evaluation before sensitive research goes public. The model has documented limitations: it depends on researcher compliance, many countries lack comparable institutions, and the review process was not designed for the speed of AI-generated outputs.

The Responsible Disclosure norm in cybersecurity offers a parallel framework: researchers who discover vulnerabilities notify vendors privately before public disclosure, giving time for patches. AI laboratories have adapted this concept through staged deployment (releasing to trusted testers before general release) and capability thresholds (withholding or restricting specific capabilities identified as high-risk). GPT-4's system card, published in March 2023, is the most detailed public example β€” OpenAI described testing for CBRN (chemical, biological, radiological, nuclear) uplift and implementing mitigations before release.

The Voluntary Commitments β€” White House, July 2023

In July 2023, seven leading AI companies β€” Anthropic, Google, Meta, Microsoft, Amazon, OpenAI, and Inflection β€” made voluntary commitments to the White House including: sharing safety information with governments and each other; investing in cybersecurity and insider threat safeguards; and conducting research on CBRN risk from AI. These commitments were voluntary, unverified, and unenforceable. They represented a political signal and a baseline, not a binding governance framework. Critics noted the absence of timelines, metrics, or enforcement mechanisms.

Export Controls as AI Governance β€” BIS Actions

The Bureau of Industry and Security (BIS) at the Commerce Department has been the most active U.S. regulatory actor on AI dual-use governance. Its October 2022 chip export controls β€” restricting sale of advanced AI chips (NVIDIA A100, H100) and chip-manufacturing equipment to China β€” were explicitly framed as dual-use controls, targeting AI capabilities that could enable advanced weapons development and mass surveillance.

In October 2023, BIS tightened these controls significantly with a new rule that closed workarounds China had exploited, added performance thresholds to capture future chip generations, and extended controls to additional countries. The controls represent perhaps the most consequential dual-use AI governance action to date β€” not because they prevent China from developing AI, but because they significantly slow access to leading-edge training infrastructure.

The model for computing controls differs from traditional dual-use governance in an important way: it targets the means of production rather than the knowledge itself. You cannot embargo the understanding of transformer architectures, but you can restrict access to the chips needed to train competitive models at scale.

AISI AI Safety Institute β€” government bodies in the UK and U.S. tasked with evaluating frontier AI models for dangerous capabilities including dual-use risks.
BIS Bureau of Industry and Security β€” the U.S. agency that administers export controls, including AI chip restrictions targeting China.
CBRN Chemical, Biological, Radiological, Nuclear β€” the four categories of weapons of mass destruction, and the primary focus of AI DURC governance concern.
The Information Hazard Publication Problem

One of the most contested governance questions is whether AI safety research itself constitutes an information hazard. When researchers demonstrate that an AI model can be prompted to provide dangerous bioweapon synthesis guidance, publishing a detailed description of the jailbreak enables adversaries to replicate the approach. When they demonstrate synthesis-screening evasion, they publish a technique for evading screening.

The community has not reached consensus on this. A 2023 Center for Security and Emerging Technology (CSET) analysis identified three camps: those who favor full disclosure on the grounds that adversaries already know these techniques; those who favor full suppression; and those who advocate for coordinated disclosure β€” sharing findings with AI developers, government, and affected industries before or instead of public publication.

The Biological Weapons Convention (BWC) review conferences have twice (2011, 2016) discussed but failed to adopt measures specifically addressing dual-use life-sciences research. The BWC has no verification mechanism and no standing scientific advisory body. AI's intersection with the BWC's scope has been raised in Expert Meetings but has not produced binding guidance.

The Compute Governance Frontier

Compute governance β€” controlling access to the hardware needed to train and run frontier AI systems β€” has emerged as a potentially tractable governance lever. The argument: unlike knowledge (which spreads freely), compute is physical, trackable, and manufactured by a small number of firms in a small number of locations. The current semiconductor supply chain runs through TSMC in Taiwan, with ASML in the Netherlands holding monopoly supply of EUV lithography machines essential for advanced chip production.

Proposals under active policy discussion include: Know Your Customer (KYC) requirements for large cloud computing providers, requiring verification of end-user identity and stated purpose for large training runs; on-chip governance mechanisms that could enable remote attestation of how chips are being used; and international monitoring of large compute clusters analogous to nuclear facility inspection regimes. None of these are currently implemented at scale.

The Institute for AI Policy and Strategy (IAPS) and Georgetown CSET have both published technical analyses of compute governance feasibility, generally concluding that the window for implementing effective compute-based controls is narrowing as chip designs proliferate and domestic manufacturing capacity grows in China.

The Openness Dilemma

Meta's release of the LLaMA model weights (February 2023, July 2023 with LLaMA 2) crystallized an unresolved governance debate. Once model weights are public, no subsequent safety measure, content filter, or access restriction applies β€” anyone with sufficient compute can fine-tune the model to remove safety training. Defenders of open release argue that closed models concentrate power dangerously and that open models enable safety research. Critics argue that releasing weights of capable models is an irreversible action whose dual-use risk cannot be contained. Both arguments have merit, and no international governance norm yet addresses this question.

Lesson 4 Quiz

Governance Frameworks and Open Questions β€” check your understanding
The UK AI Safety Institute established at Bletchley Park in 2023 faced which immediate structural limitation in its dual-use governance mandate?
Correct. Both the UK and U.S. AISIs launched as advisory bodies β€” they can evaluate models companies voluntarily submit, but cannot compel pre-deployment review. This is the central enforcement gap in current AI safety governance.
Incorrect. The structural limitation was legal, not technical. Both AISIs had expert staff but could not compel companies to submit models β€” evaluation was voluntary, making the governance framework advisory rather than binding.
BIS's October 2022 AI chip export controls differed from traditional dual-use knowledge governance by targeting:
Correct. You cannot embargo the understanding of transformer architecture, but you can restrict the chips needed to train at frontier scale. This "means of production" approach is a distinctive feature of compute governance β€” controlling physical infrastructure rather than information flows.
Incorrect. The chip controls target hardware β€” NVIDIA A100/H100 chips and chip-manufacturing equipment. This is a "means of production" approach: restricting the physical infrastructure needed to train frontier AI rather than trying to control the freely spreading knowledge of how AI works.
Meta's public release of LLaMA model weights is cited as a dual-use governance challenge primarily because:
Correct. Weight release is irreversible. Content filters, access restrictions, and safety alignment all become ineffective for anyone who downloads the weights β€” they can fine-tune the model with any objective, including removing safety training entirely. This makes weight release a categorically different governance question from API access controls.
Incorrect. The governance problem is about irreversibility. Once LLaMA weights were public, no safety measure applied to anyone who downloaded them β€” fine-tuning can remove safety training. The debate is about whether releasing capable model weights is a responsible action given that dual-use safeguards become unenforceable post-release.
The July 2023 White House voluntary AI commitments from seven major AI companies were criticized primarily on which grounds?
Correct. The voluntary commitments were a political signal β€” valuable for norm-setting but not governance. No verification mechanism existed, no timelines were specified, and no consequences attached to non-compliance. Critics called it a "promise" rather than a policy.
Incorrect. The core criticism was about enforceability. The commitments were voluntary β€” companies could reinterpret or abandon them without consequence. Effective governance requires verification, metrics, and enforcement, none of which the July 2023 commitments included.

Lab 4 β€” Governance Design Challenge

Discuss AI dual-use governance frameworks with your AI tutor Β· Complete 3 exchanges to finish

Your Task

Engage with the governance frameworks and gaps covered in Lesson 4. Evaluate mechanisms like the UK/US AISIs, BIS chip controls, voluntary commitments, and compute governance proposals against the real dual-use risks identified in earlier lessons.

Suggested starting point: "Given that LLaMA model weights are already publicly available, what governance mechanisms β€” if any β€” could still meaningfully reduce dual-use risk from open-weight models?"
AI Tutor β€” Dual-Use Governance
Module 6 Β· L4
Welcome to the governance design lab. We've covered the risks β€” now let's stress-test the responses. The AISI enforcement gap, the LLaMA weight-release problem, compute governance proposals β€” there are no clean solutions here. Where would you like to probe?

Module 6 Test β€” Dual-Use Research

15 questions Β· 80% required to pass
1. The core structural asymmetry in the dual-use problem is that:
Correct.
The core asymmetry: legitimate good-faith research creates knowledge that can be misused. The problem is not bad-faith research β€” it's that good research has dangerous applications.
2. Fritz Haber's career is cited in DURC discussions because he:
Correct.
Haber co-created Haber-Bosch (feeding billions) and directed the Ypres chlorine gas attack β€” the same chemistry serving radically opposite purposes.
3. The 2011–12 H5N1 controversy involved the NSABB recommending:
Correct.
NSABB recommended redacting the methods sections β€” not prosecution or classification. Both papers were ultimately published largely intact after extensive international debate.
4. AlphaFold 2's public release raised biosecurity concerns despite being a protein structure prediction tool because:
Correct.
The same tool that helps design vaccines can help engineer dangerous proteins. AlphaFold doesn't synthesize anything β€” the concern is that it makes the design step dramatically easier for beneficial and harmful applications equally.
5. In the Collaborations Pharmaceuticals MegaSyn experiment, the dual-use risk was created by:
Correct.
Same model, same data β€” only the optimization target changed. The model generated ~40,000 high-toxicity candidates in six hours using standard drug-discovery infrastructure.
6. AI-assisted evasion of nucleic acid synthesis screening works by:
Correct.
BLAST looks for known dangerous sequences. AI can design novel sequences with the same dangerous function but different enough composition to evade those matches β€” demonstrated in controlled MIT research.
7. Executive Order 14110 (October 2023) addressed AI bioweapons risk by:
Correct.
EO 14110 targeted the synthesis supply chain β€” the first federal policy specifically addressing the AI-to-physical-synthesis pathway by requiring screening standards for DNA synthesis orders.
8. WormGPT and FraudGPT (2023) were significant because they demonstrated:
Correct.
WormGPT/FraudGPT were commoditized subscription tools β€” not frontier lab products, not autonomous agents. Their significance was the democratization of offensive AI capability to low-skill actors.
9. Google's withdrawal from Project Maven in 2018 illustrated which governance tension?
Correct.
Over 3,000 employees petitioned; twelve resigned. Google declined to renew. The issue was about values and consent β€” whether commercial AI employees and firms should have a voice in weapons applications of their tools.
10. The NIH P3CO (Potential Pandemic Pathogen Care and Oversight) framework governs:
Correct.
P3CO governs gain-of-function research on potential pandemic pathogens β€” experiments that enhance transmissibility, virulence, or immune evasion. AI now makes the computational phase of such research faster and cheaper, straining the review framework.
11. DOD Directive 3000.09's limitation as an autonomous weapons governance mechanism is that:
Correct.
DOD 3000.09 only binds the U.S., "human judgment" is loosely defined, and there is no binding international instrument on LAWS β€” the UN GGE process has not produced a treaty.
12. BIS's October 2022 AI chip export controls represented a novel governance approach because they targeted:
Correct.
You can't embargo transformer architecture knowledge β€” it's freely published. But you can restrict NVIDIA A100/H100 chips and chip-manufacturing equipment. This hardware-focused approach is the distinctive feature of compute governance.
13. The "openness dilemma" posed by LLaMA weight release centers on:
Correct.
The core issue is irreversibility and the unenforcability of downstream safeguards. Weight release means anyone with compute can fine-tune away safety alignment β€” content filters and access restrictions become meaningless post-release.
14. The July 2023 White House voluntary AI commitments were principally criticized for lacking:
Correct.
The commitments were unverified, unenforceable, and unaccompanied by timelines or metrics. Critics characterized them as political signaling β€” a "promise" rather than a governance mechanism.
15. Compute governance proposals like KYC requirements for cloud providers and on-chip governance mechanisms share which theoretical advantage over knowledge-based DURC governance?
Correct.
The theoretical advantage of compute governance is physical tractability β€” chips are manufactured at a handful of fabs, shipped through trackable supply chains, and concentrated in visible data centers. Knowledge governance tries to contain something that spreads freely; compute governance targets something physical and scarce.