L1
Β·
Quiz
Β·
Lab
L2
Β·
Quiz
Β·
Lab
L3
Β·
Quiz
Β·
Lab
L4
Β·
Quiz
Β·
Lab
Module Test
Module 7 Β· Lesson 1

Accelerationists vs. Decelerationists

The foundational fracture: those who want to push forward and those who want to slow down
Why do brilliant people who agree AI is powerful disagree so sharply about what to do next?

When Marc Andreessen published his "Techno-Optimist Manifesto" in October 2023, he did not mince words. He called those urging AI caution "decelerationists" β€” a pejorative β€” and described them as enemies of human progress. The document was read more than a million times in its first week. The AI field had a new fault line, and it ran right through Silicon Valley.

The Accelerationist Position

The effective accelerationist (e/acc) movement emerged around 2022–2023 as a philosophical counterweight to AI safety culture. Its proponents argue that AI development should proceed as fast as possible because the technology's benefits β€” curing disease, eliminating poverty, expanding human capability β€” vastly outweigh speculative risks. Slowing down, they contend, is itself a moral hazard: every year of delay costs lives that better medicine or agriculture could have saved.

The movement draws intellectual lineage from Nick Land's accelerationist philosophy and economic growth theory. Among its visible advocates are Andreessen Horowitz partner Marc Andreessen, who framed restraint as anti-human, and anonymous X users with handles like @bayeslord who built followings of tens of thousands arguing that "slowdown = death."

The position is not fringe. Yann LeCun, Meta's chief AI scientist, has repeatedly said he believes the AGI-catastrophe scenarios are "preposterous" and that focusing on them crowds out real, present harms from bias and misinformation. His view is institutional: Meta does not do pre-deployment safety evaluations on the scale Anthropic or DeepMind do.

Documented Claim

In a January 2024 interview at Davos, Yann LeCun stated: "The question of whether AI will become more intelligent than humans and take over the world is just not on the agenda for the next decade." He argued alignment concern was a distraction engineered by companies seeking regulatory moats.

The Decelerationist Position

On the opposing side, figures like Yoshua Bengio β€” one of the three "Godfathers of Deep Learning" β€” have moved dramatically toward the cautionary camp. Bengio signed the May 2023 Center for AI Safety statement warning that AI extinction risk should be treated as a global priority comparable to pandemics and nuclear war. He described this shift as a "personal and scientific evolution."

The Pause AI movement, formally organized in early 2023, advocates for a moratorium on training runs more powerful than GPT-4 until safety can be assured. It held protests outside OpenAI's San Francisco offices and the UK AI Safety Summit at Bletchley Park in November 2023. While the movement has not achieved regulatory success, it shaped the framing of multiple government hearings in the US, EU, and UK.

The March 2023 Future of Life Institute open letter calling for a six-month pause on "giant AI experiments" was signed by over 33,000 people including Elon Musk, Apple co-founder Steve Wozniak, and historian Yuval Noah Harari. Notably, few leading AI researchers at major labs signed β€” a revealing absence.

The Asymmetry Problem

Both sides invoke asymmetric risk but in opposite directions. Accelerationists say: the risk of going too slow is certain harm from unsolved problems. Decelerationists say: the risk of going too fast is potentially irreversible catastrophe. Neither side has empirical resolution β€” which is precisely why the debate persists.

Key Terms

e/accEffective Accelerationism β€” the view that rapid AI development is morally obligatory and that safety concern is a form of Luddism or regulatory capture.
Pause AIA formal advocacy movement calling for a moratorium on frontier AI training until safety governance frameworks are in place.
Regulatory moatThe accusation that established AI labs support regulation primarily to prevent new competitors from catching up.

The fracture is not simply philosophical. It has organizational consequences. Companies that align with the accelerationist view β€” Meta, for instance β€” release model weights openly and do not submit to voluntary pre-deployment evaluations. Companies in the cautionary camp β€” Anthropic, DeepMind β€” invest heavily in interpretability and submit to government safety testing. The debate shapes hiring, publishing norms, and investment decisions in real time.

Lesson 1 Quiz

Accelerationists vs. Decelerationists
What term did Marc Andreessen use in his 2023 manifesto to describe those urging AI caution?
Correct. Andreessen's "Techno-Optimist Manifesto" used "decelerationists" as a pejorative for those who advocate slowing AI development.
Not quite. The specific term Andreessen deployed β€” which became a rallying label β€” was "decelerationists."
Which of the following best describes Yann LeCun's publicly stated position on AGI catastrophe risk?
Correct. LeCun has consistently argued that existential risk from AI is not a credible near-term concern and that focusing on it displaces attention from real present harms.
Incorrect. LeCun has been one of the most prominent public critics of AI doom narratives, calling them preposterous at Davos in 2024.
How many people signed the March 2023 Future of Life Institute letter calling for a pause on giant AI experiments?
Correct. The letter accumulated over 33,000 signatures including Musk, Wozniak, and Harari β€” though notably few active AI lab researchers signed.
The actual number was over 33,000. The notable absence of leading lab researchers was widely discussed.
What does the "regulatory moat" accusation claim about AI safety proponents at major labs?
Correct. The "regulatory moat" argument holds that safety regulation, if designed by incumbents, can function as a barrier to entry that benefits established labs competitively.
The regulatory moat argument is specifically about incumbents using safety rules as competitive barriers β€” not about funding or infrastructure monopolies.

Lab 1 β€” Debate Mapping

Stress-test the accelerationist and decelerationist positions with an AI interlocutor

Your Task

You're going to pressure-test both sides of the acceleration debate. Pick one of the prompts below and engage the assistant β€” it will play devil's advocate, push you to find weaknesses, and help you build a more nuanced position.

Try: "Make the strongest possible case for the e/acc position" β€” or β€” "Where is the Pause AI movement's argument weakest?"
Debate Lab
Alignment Β· M7
Welcome to the Debate Mapping lab. I can argue either side of the acceleration debate, identify logical weaknesses in both positions, and help you map where the real empirical disagreements lie. What would you like to explore?
Module 7 Β· Lesson 2

Safety Researchers vs. Capability Researchers

The internal divisions inside AI laboratories β€” between those building the systems and those trying to constrain them
When the people building the most powerful AI also run its safety teams, can the safety function ever be truly independent?

On November 17, 2023, OpenAI's board fired CEO Sam Altman, citing a loss of candor. Within 48 hours, nearly all of the company's 770 employees had signed a letter threatening to resign unless Altman was reinstated. He returned five days later. Three of the four board members who voted to remove him left the board. The episode revealed a company structurally unable to enforce safety governance against its own commercial momentum.

The Structural Tension

Every major AI lab officially values safety. The tension is not between pro-safety and anti-safety people β€” it is between different timelines, risk tolerances, and interpretations of what "safety" demands in practice. Capability researchers typically believe the path to safe AI runs through better AI: more capable systems are easier to align because they can understand nuanced instructions. Safety researchers often believe capability advances outpace alignment advances, creating widening gaps.

This tension became visible at Google DeepMind in 2023 when a significant number of senior researchers who had worked on safety-oriented projects left to found or join competing organizations. Paul Christiano, who led OpenAI's alignment team, departed in 2021 to found the Alignment Research Center. Jan Leike, who ran OpenAI's superalignment team, resigned in May 2024 with a public statement saying: "Safety culture and processes have taken a backseat to shiny products."

Jan Leike's Resignation Statement β€” May 2024

"I have been disagreeing with OpenAI leadership about the company's core priorities for quite some time, until we finally reached a breaking point. Over the past few months my team has been sailing against the wind. Building smarter-than-human machines is an inherently dangerous endeavor. OpenAI is shouldering an enormous responsibility on behalf of all of humanity."

Key Figures and Their Positions

The landscape of who stands where has shifted rapidly:

JL
Jan Leike
Former VP of Alignment, OpenAI β†’ Anthropic
Resigned publicly in May 2024 citing safety culture erosion. Joined Anthropic shortly after. Represents the view that capability progress is outrunning safety progress at frontier labs.
PC
Paul Christiano
Former OpenAI Alignment Lead β†’ Alignment Research Center
Left OpenAI in 2021 to found ARC. Developed the eliciting latent knowledge research agenda. Believes interpretability is necessary for credible safety assurance.
IS
Ilya Sutskever
Co-founder & former Chief Scientist, OpenAI
Voted to remove Altman in November 2023. Later founded Safe Superintelligence Inc. (SSI) in June 2024 with the stated goal of building superintelligence safely without commercial product pressure.
SA
Sam Altman
CEO, OpenAI
Argues that safety and capability advance together, and that OpenAI's mission requires commercial success to fund the research needed to make AI safe. Represents the view that slowdown helps no one.

The Superalignment Bet

In July 2023, OpenAI announced its Superalignment initiative: a commitment to dedicate 20% of its compute to solving the alignment of superintelligent AI within four years. It was a bold framing β€” and a controversial one. Critics noted that 20% of compute at a company racing toward superintelligence might be insufficient, and that the timeline was optimistic to the point of irresponsibility. When the Superalignment team's co-leads β€” Sutskever and Leike β€” both departed within a year, the initiative's credibility collapsed publicly.

The episode illustrates a systemic problem: safety programs at commercial labs are structurally subordinate to product timelines. When commercial pressure intensifies, safety teams can be deprioritized without formal announcement β€” simply by resourcing and attention. Tracking this requires watching departures, not press releases.

Structural Insight

The real indicator of a lab's safety commitment may not be its public statements but its organizational structure: Is safety leadership empowered to delay or halt a deployment? Do safety researchers have equity incentives aligned with the commercial product? Are safety findings published promptly even when unflattering?

Lesson 2 Quiz

Safety Researchers vs. Capability Researchers
What was the key claim in Jan Leike's May 2024 resignation statement from OpenAI?
Correct. Leike's statement said safety culture and processes had "taken a backseat to shiny products" β€” a direct indictment of OpenAI's organizational priorities.
Leike's resignation statement specifically said safety culture had "taken a backseat to shiny products" β€” an organizational critique, not a legal or board one.
What did Ilya Sutskever found after leaving OpenAI in 2024?
Correct. Sutskever founded Safe Superintelligence Inc. (SSI) in June 2024, explicitly structured to pursue superintelligence without commercial product pressure.
Sutskever founded Safe Superintelligence Inc. (SSI). The Alignment Research Center was founded by Paul Christiano.
OpenAI's Superalignment initiative announced in July 2023 committed what percentage of compute to alignment research?
Correct. OpenAI pledged 20% of its compute to the Superalignment initiative β€” a commitment whose credibility came under scrutiny after both co-leads departed within a year.
The announced figure was 20% of compute. Critics questioned whether this was sufficient given the pace of capability development.
According to the lesson, what is a more reliable indicator of a lab's real safety commitment than its public statements?
Correct. The lesson emphasizes organizational structure over headcount or publications β€” specifically whether safety has genuine authority to delay commercial deployments.
The lesson points to organizational structure and genuine authority as the key indicator β€” not team size or publication counts, which can be gamed.

Lab 2 β€” Organizational Autopsy

Analyze the structural incentives that shape safety culture at AI labs

Your Task

You've read about the OpenAI board crisis and the Superalignment collapse. Now dig into the structural reasons. The assistant can help you think through why safety functions fail inside commercial organizations β€” and what structures might actually work.

Try: "Why did the OpenAI board's attempt to enforce safety governance fail so completely?" β€” or β€” "What organizational structure would give safety researchers genuine authority?"
Org Structure Lab
Alignment Β· M7
Ready to analyze the organizational dynamics of AI safety governance. The OpenAI board crisis is a rich case study β€” but we can also look at how DeepMind, Anthropic, and others structure the relationship between safety and capability teams. What would you like to examine?
Module 7 Β· Lesson 3

Open Source vs. Closed Development

The most consequential infrastructure decision in AI policy: who gets access to the weights?
If releasing model weights enables both beneficial innovation and dangerous misuse, how should that tradeoff be decided β€” and by whom?

When Meta released the weights of Llama 2 in July 2023 β€” available for commercial use β€” it was simultaneously celebrated as a democratization of AI and condemned as reckless. Within weeks, fine-tuned versions optimized for generating malware instructions, removing content filters, and producing non-consensual intimate imagery had appeared on Hugging Face. Meta had made a calculation: the benefits of open access outweigh the risks. Not everyone agreed.

The Case for Open Weights

Meta, together with a coalition of researchers and civil liberties organizations, has argued that open-source AI is structurally safer than closed AI for several reasons. First, it enables independent security auditing β€” thousands of researchers can examine a model for vulnerabilities rather than trusting a single company's red team. Second, it prevents monopolization: if only a handful of companies control AI access, they also control what AI can say and do, with no democratic accountability.

The AI Safety Levels debate played out practically when the Biden administration's executive order on AI (October 2023) required companies training models above a compute threshold to report safety results to the government β€” but this threshold exempted most open-source releases. Meta argued this was appropriate; critics argued it was a gap large enough to drive catastrophe through.

Stability AI, which released Stable Diffusion in 2022, was an early test case. Stable Diffusion's open weights were immediately used to generate non-consensual deepfakes of real people. CEO Emad Mostaque initially defended open release on freedom-of-information grounds before the company shifted to a more cautious stance. Mostaque resigned in March 2024 amid governance turmoil.

Pro-Open Weights
The Democratization Argument

Open weights enable competitive balance, independent auditing, academic research, and deployment in low-resource settings where API pricing is prohibitive. Concentration of AI in closed models is itself a safety risk.

Pro-Closed Development
The Containment Argument

Releasing weights is irreversible. Fine-tuning can remove any safety guardrails the original developer installed. Closed APIs allow the developer to monitor, patch, and update models in response to emerging misuse β€” open weights cannot be recalled.

The Responsible Scaling Question

The open/closed debate intersects with responsible scaling policies (RSPs) β€” formal commitments by labs to evaluate models at capability thresholds before proceeding. Anthropic published its RSP in September 2023, defining conditions under which it would pause development. OpenAI and Google DeepMind followed with similar frameworks. Crucially, all three apply to closed models. There is no equivalent mechanism for open-weight releases, where the developer loses control of deployment the moment the weights are published.

The RAND Corporation's 2024 report on open-source AI risk argued that frontier open-weight models with demonstrated chemical weapons synthesis uplift represented a qualitatively different risk category than previous open-source releases. The report recommended compute-based thresholds for mandatory pre-release evaluation β€” a position that Meta and many academic researchers rejected as regulatory overreach.

The Irreversibility Problem

Software vulnerabilities can be patched. Model weights, once public, cannot be un-released. This asymmetry fundamentally changes the risk calculus: a mistake in releasing closed model access can be corrected; a mistake in releasing weights is permanent. Critics of open-weight release argue this asymmetry demands higher caution, not lower.

Timeline of Key Events

Aug 2022
Stable Diffusion released β€” first major open-weight image model. Within days, deepfake misuse documented. Sets the template for open-weight debates.
Feb 2023
Meta releases LLaMA 1 (research only). Weights leak to 4chan within a week, enabling uncensored fine-tunes.
Jul 2023
Llama 2 released commercially. Safety-removed fine-tunes appear within weeks. Debate intensifies in policy circles.
Oct 2023
Biden AI Executive Order sets compute threshold for safety reporting β€” excludes most open-weight models below the threshold.
Apr 2024
Meta releases Llama 3. EU AI Act's treatment of open-weight models is contested; Meta lobbies for exemptions. Final Act text offers limited open-weight concessions.

Lesson 3 Quiz

Open Source vs. Closed Development
What happened within weeks of Meta releasing Llama 2's weights in July 2023?
Correct. Fine-tuned versions optimized for malware instructions, filter removal, and NCII generation appeared on Hugging Face within weeks of the release.
The documented outcome was rapid appearance of safety-removed fine-tunes on platforms like Hugging Face β€” illustrating the irreversibility problem.
What is a "Responsible Scaling Policy" (RSP)?
Correct. RSPs are self-imposed governance frameworks by labs β€” Anthropic pioneered the format in September 2023 β€” defining capability thresholds at which development pauses for evaluation.
RSPs are voluntary, lab-developed policies β€” not government regulations or treaties. Anthropic published the first formal RSP in September 2023.
According to the lesson, what is a key structural argument against open-weight AI releases compared to closed API access?
Correct. The irreversibility asymmetry is the central structural argument: closed API mistakes can be patched; weight releases cannot be un-done.
The core argument is irreversibility β€” once weights are public, any misuse cannot be stopped by the original developer, unlike a closed API where access can be revoked.
What did the RAND Corporation's 2024 report on open-source AI risk specifically recommend?
Correct. RAND recommended compute thresholds triggering mandatory pre-release safety evaluation β€” a targeted intervention rather than a blanket ban.
RAND's recommendation was targeted: compute-based thresholds for mandatory pre-release evaluation, not a blanket ban on open source.

Lab 3 β€” Policy Design Thinking

Draft governance frameworks for open-weight AI releases

Your Task

The open vs. closed debate requires practical policy answers, not just philosophical positions. In this lab, you'll work through what a workable governance framework for open-weight releases might look like β€” and where the hard tradeoffs lie.

Try: "Design a compute threshold policy for open-weight releases that balances safety with research access" β€” or β€” "What are the enforcement problems with any open-weight governance framework?"
Policy Design Lab
Alignment Β· M7
Welcome to the Policy Design lab. We'll work through the concrete governance challenges of open-weight AI β€” compute thresholds, enforcement mechanisms, international coordination, and the tradeoffs between safety and democratization. What aspect would you like to tackle first?
Module 7 Β· Lesson 4

The Race Dynamics Problem

Why competitive pressure between labs and nations may be the deepest obstacle to AI safety
If every actor knows that building powerful AI recklessly is dangerous, but each fears that slowing down hands the advantage to someone less careful β€” what happens?

In his 2023 congressional testimony, Sam Altman described the competitive landscape with unusual candor: "If we don't do this, someone else will β€” and they may not be as careful." The logic is a classic multi-player prisoner's dilemma. Each actor would prefer a world where everyone moved cautiously. But each also prefers being ahead in a world where others don't. The result is mutual acceleration despite shared misgivings.

The US-China Dimension

The race dynamic operates at the geopolitical level most visibly between the United States and China. US export controls on advanced semiconductors β€” the October 2022 export control rules and their November 2023 expansion β€” were explicitly designed to slow Chinese AI development by restricting access to NVIDIA H100 GPUs and the manufacturing equipment needed to produce competing chips. The controls were the most sweeping US technology restrictions since the Cold War.

The policy rationale was prevention of military AI applications β€” particularly autonomous weapons and mass-surveillance tools β€” not purely commercial competition. But the mechanism creates a paradox: by framing AI as a strategic competition, the US government simultaneously accelerates domestic AI investment (to stay ahead) while arguing internationally that AI safety is a shared global concern.

China's response has included accelerated domestic chip development (notably by Huawei's Ascend line), large state investments in AI through the New Generation AI Development Plan, and the development of Baidu's ERNIE Bot and Alibaba's Tongyi Qianwen as GPT competitors. China has also published its own AI governance frameworks β€” including the Generative AI Service Management Provisions (August 2023) β€” though enforcement mechanisms differ significantly from Western approaches.

The Export Control Paradox

US export controls on AI chips aim to slow Chinese military AI. But they also signal to US companies and investors that AI is a strategic national priority β€” which accelerates domestic development and the race overall. The same policy that slows one competitor may speed the broader race.

The Inter-Lab Race

The race dynamic between commercial labs is equally documented. Internal communications from the GPT-4 development period, reported by The New York Times in 2023, showed OpenAI engineers raising safety concerns about the model's tendency toward sycophancy and potential for deception β€” and being told deployment would proceed on schedule because Google's Bard launch was imminent. The specific claim about the timeline pressure was disputed by OpenAI, but the structural dynamic was not.

The Gemini launch in December 2023 was widely reported as rushed to close the gap with GPT-4. Initial benchmarks were later found to have been presented in a misleading format. Google DeepMind apologized and rereleased the benchmarks. The episode illustrated that competitive pressure can affect not just deployment timelines but how safety and capability results are communicated publicly.

The response to ChatGPT's November 2022 launch is itself a case study in race dynamics. Google declared a "code red," Microsoft accelerated its OpenAI investment, and virtually every large technology company reallocated resources toward LLM development within months. The pace was driven not by new safety or capability findings but by competitive fear of being left behind.

The Coordination Problem

Race dynamics are coordination failures. The known solution is binding agreements β€” treaties, enforceable standards, mutual verification. The international community has done this with nuclear weapons (imperfectly) and biological weapons (more imperfectly). Whether AI is analogous β€” and whether coordination at this speed and scale is possible β€” is the central open question in AI governance.

Proposed Coordination Mechanisms

Several proposals for addressing race dynamics have been advanced:

The UK AI Safety Institute (launched November 2023 at Bletchley) and its US counterpart (launched February 2024) represent attempts to build internationally shared evaluation infrastructure β€” a neutral body that can test models before deployment. The value is that safety evaluations from a government body carry legitimacy that self-reported evaluations do not.

Frontier model forums β€” convened by the labs themselves (Anthropic, Google, Microsoft/OpenAI, Meta) β€” attempt self-regulatory coordination. Critics note that self-regulatory bodies have historically been captured by the interests they regulate, citing financial services and social media as cautionary analogues.

The most structurally binding proposal β€” a compute governance regime β€” would require any training run above a certain FLOP threshold to register with an international body and submit to pre-deployment evaluation. This would require participating nations to share intelligence about compute infrastructure, which faces significant sovereignty and intelligence concerns.

The race dynamics problem may be the hardest alignment problem of all β€” not a technical challenge but a collective action failure among rational actors who all, in some sense, know better. Understanding the debate in the field means recognizing that disagreements about alignment are often disagreements about this deeper coordination problem: not whether safety matters, but whether any individual actor can afford to prioritize it unilaterally.

Lesson 4 Quiz

The Race Dynamics Problem
What game theory concept best describes the race dynamics problem among AI labs?
Correct. The race dynamic is a classic prisoner's dilemma: each actor prefers mutual caution but individually prefers being ahead, leading to mutual acceleration despite shared misgivings.
The lesson frames this explicitly as a multi-player prisoner's dilemma β€” each actor knows mutual caution is better, but each individually prefers being ahead.
What was the stated purpose of the US October 2022 export controls on advanced semiconductors?
Correct. The export controls were explicitly framed around preventing military AI applications β€” though the broader effect on commercial competition was also significant.
The stated rationale was preventing military AI applications β€” autonomous weapons and mass surveillance β€” not commercial protection or energy concerns.
What happened when Google's Gemini benchmarks were released in December 2023?
Correct. The Gemini benchmark controversy β€” where presentation format inflated apparent performance β€” was attributed by observers to competitive pressure affecting how results were communicated.
The documented outcome was Google apologizing and rereleasing benchmarks after the presentation format was found to be misleading β€” a case study in competitive pressure affecting communications.
What is a "compute governance regime" as proposed by AI governance researchers?
Correct. Compute governance would use training compute (measured in FLOPs) as a trigger threshold for mandatory international registration and evaluation β€” analogous to nuclear material accounting.
A compute governance regime would use FLOP thresholds to trigger mandatory international registration and pre-deployment evaluation of frontier training runs.

Lab 4 β€” Race Dynamics Simulation

Think through coordination mechanisms for the AI race problem

Your Task

Race dynamics are a coordination failure. In this lab, you'll stress-test proposed solutions β€” from compute governance to international treaties to self-regulatory forums. The assistant will push back on your proposals and help identify the hardest implementation challenges.

Try: "Why did nuclear arms control succeed (partially) while AI coordination seems harder?" β€” or β€” "Design a compute governance regime and explain its biggest weakness."
Race Dynamics Lab
Alignment Β· M7
Ready to explore AI race dynamics and coordination mechanisms. We can analyze the nuclear analogy, stress-test compute governance proposals, examine why self-regulatory forums tend to fail, or explore what a binding international AI agreement would require. Where would you like to start?

Module 7 Test

The Debate in the Field β€” 15 questions Β· Pass at 80%
1. The "e/acc" movement stands for what?
Correct. e/acc stands for Effective Accelerationism β€” the view that rapid AI development is morally obligatory.
e/acc stands for Effective Accelerationism β€” the philosophical counterweight to AI safety culture that emerged around 2022–2023.
2. Which AI figure published the "Techno-Optimist Manifesto" in October 2023?
Correct. Andreessen Horowitz partner Marc Andreessen published the manifesto, which was read over a million times in its first week.
The Techno-Optimist Manifesto was published by Marc Andreessen of Andreessen Horowitz in October 2023.
3. Yoshua Bengio's shift toward the cautionary camp was marked by what action in May 2023?
Correct. Bengio signed the CAIS statement warning that AI extinction risk should be treated as a global priority comparable to pandemics and nuclear weapons.
Bengio's key public shift was signing the CAIS statement on AI extinction risk in May 2023, which he described as a personal and scientific evolution.
4. What event in November 2023 revealed OpenAI's structural difficulty enforcing safety governance against commercial momentum?
Correct. The board fired Altman on November 17, but 770 employees threatened to resign, forcing his reinstatement within five days and the removal of the board members who voted against him.
The November 2023 crisis was the Altman firing and reinstatement β€” which showed that a safety-focused board could not enforce governance against employee and commercial pressure.
5. Paul Christiano left OpenAI to found which organization?
Correct. Christiano founded the Alignment Research Center (ARC) in 2021, developing the eliciting latent knowledge research agenda.
Christiano founded the Alignment Research Center. Safe Superintelligence Inc. was founded by Ilya Sutskever.
6. What was Stable Diffusion's release in August 2022 used for within days, illustrating open-weight risks?
Correct. Stable Diffusion's open weights were almost immediately used to generate non-consensual deepfakes, making it an early and documented case study in open-weight misuse.
The documented early misuse of Stable Diffusion was generating non-consensual intimate imagery (deepfakes) of real people.
7. What happened to Meta's original LLaMA 1 weights released for research in February 2023?
Correct. LLaMA 1 leaked to 4chan within a week of its research-only release, producing uncensored fine-tunes β€” demonstrating that "research-only" restrictions on weights are practically unenforceable.
LLaMA 1 leaked to 4chan within a week, enabling uncensored fine-tunes β€” a key illustration that research-only weight releases cannot be controlled once distributed.
8. Which organization published the first formal Responsible Scaling Policy (RSP) in September 2023?
Correct. Anthropic pioneered the RSP format in September 2023, defining capability thresholds at which development would pause for evaluation.
Anthropic published the first formal RSP in September 2023. OpenAI and Google DeepMind published similar frameworks afterward.
9. The US export controls on AI chips enacted in October 2022 specifically targeted access to which product?
Correct. The export controls restricted access to NVIDIA H100 GPUs and the semiconductor manufacturing equipment needed to produce competing chips.
The export controls focused on NVIDIA H100 GPUs β€” the primary compute resource for frontier AI training β€” along with manufacturing equipment.
10. What response did ChatGPT's November 2022 launch trigger at Google?
Correct. Google declared a "code red" in response to ChatGPT, and the company reallocated significant resources toward LLM development β€” a documented instance of competitive fear driving acceleration.
Google's response to ChatGPT was an internal "code red" β€” a clear example of competitive dynamics driving AI acceleration independent of capability or safety findings.
11. The "regulatory moat" argument is primarily associated with which critique of AI safety advocacy?
Correct. The regulatory moat argument is that incumbent labs have competitive incentives to support regulation they helped design β€” making entry by new competitors harder.
The regulatory moat critique is specifically about incumbents using safety regulations as competitive barriers, not about regulatory quality or jurisdiction shopping.
12. Ilya Sutskever's Safe Superintelligence Inc. (SSI) was explicitly structured to do what differently from OpenAI?
Correct. SSI's stated rationale was removing commercial product pressure from the safety-focused pursuit of superintelligence β€” a direct critique of OpenAI's structure.
SSI's founding premise was removing commercial product pressure β€” the explicit lesson Sutskever drew from the OpenAI experience.
13. The UK AI Safety Institute was launched at what event in November 2023?
Correct. The UK AI Safety Institute was launched at the Bletchley Park AI Safety Summit in November 2023 β€” the first major international government convening on frontier AI risks.
The AI Safety Institute launched at Bletchley Park β€” where the UK hosted the first AI Safety Summit and where Pause AI activists also held protests.
14. China's response to US AI export controls included accelerated development of which domestic AI chip line?
Correct. Huawei's Ascend AI chip line became a focal point of China's response to US semiconductor export controls.
Huawei's Ascend chip line was the primary domestic semiconductor response to US export controls targeting Chinese AI development.
15. According to the module, what is the deepest reason race dynamics may be the hardest alignment problem of all?
Correct. The module concludes that race dynamics are not primarily a technical or informational failure but a coordination failure β€” where every individual actor has incentives that undermine the collectively preferred outcome.
The lesson's conclusion is that race dynamics are a collective action failure β€” rational actors who understand the problem cannot solve it unilaterally, which is what makes it so hard.