GPT vs. Claude vs. Gemini · Module 4 · Lesson 1

Gemini's Origins: From LaMDA to the Multimodal Era

How Google transformed its conversational research into a competitive AI product — and why the journey was bumpier than the demos suggested.

On February 8, 2023, Google CEO Sundar Pichai unveiled Bard in a rushed blog post. The announcement came one day before Microsoft's Bing-ChatGPT event — and within hours, a promotional GIF showed Bard confidently answering a question about the James Webb Space Telescope with a factual error. Alphabet's stock dropped roughly $100 billion in market capitalization in a single day. The moment crystallized a central tension in Google's AI story: an organization with unmatched research depth, chronically anxious about disrupting its own search business.

That tension runs all the way back to 2017, when Google researchers authored the original Attention Is All You Need paper that introduced the Transformer architecture underlying every modern large language model — including the very models now threatening Google's core product.

From LaMDA to Bard to Gemini

Google's conversational AI lineage begins with LaMDA (Language Model for Dialogue Applications), first revealed at Google I/O 2021. LaMDA was trained specifically on dialogue and was notable for generating open-ended, naturalistic conversation. It became the subject of public controversy in June 2022 when engineer Blake Lemoine published internal transcripts and claimed the model exhibited signs of sentience — an assertion Google and virtually all AI researchers rejected.

LaMDA's successor, PaLM (Pathways Language Model), was announced in April 2022. PaLM was Google's first model trained on the Pathways system, allowing it to scale to 540 billion parameters across thousands of TPU chips simultaneously. PaLM demonstrated strong chain-of-thought reasoning and was followed by PaLM 2 in May 2023, which underpinned the early Bard product.

The pivot to the Gemini brand came in December 2023. Gemini 1.0 was positioned as Google DeepMind's flagship model family, merging the talent of Google Brain and DeepMind under a single organizational roof — a consolidation Pichai announced in April 2023. Gemini was architected as natively multimodal from the ground up, meaning it was trained on text, images, audio, video, and code simultaneously rather than having vision grafted on after the fact.

The Gemini 1.0 Demo Controversy

The December 2023 Gemini launch included a viral six-minute demo video showing the model responding fluidly to live voice and sketched drawings in real time. The footage was striking. It was also misleading. Within days of publication, Google acknowledged in a fine-print blog post that the demo had been heavily edited: responses were not generated in real time, prompts were still images rather than live video, and narration was added in post-production.

This episode echoed the Bard Webb telescope error almost exactly. The pattern — impressive underlying capability undermined by overpromising communications — would become a recurring theme in Gemini's early public narrative and prompted significant internal discussion about how Google's AI products should be previewed publicly.

Key Distinction

When evaluating Gemini's capabilities, it is important to distinguish between the model family (Gemini Ultra, Pro, Nano, Flash) and the product (Gemini.google.com, formerly Bard). The Ultra-class model powering the professional tier has consistently outperformed or matched GPT-4 on standard benchmarks; the free consumer tier uses lighter models with more noticeable limitations.

Google DeepMind Consolidation

Prior to April 2023, Google operated two major AI research organizations: Google Brain, focused on scalable deep learning, and DeepMind, the London-based lab acquired in 2014 and known for AlphaGo, AlphaFold, and reinforcement learning. Their merger into Google DeepMind under CEO Demis Hassabis was explicitly motivated by the need to compete more effectively with OpenAI.

The consolidation brought together complementary strengths. Brain contributed infrastructure, TPU expertise, and large-scale language model experience. DeepMind contributed reasoning research, safety work, and the scientific domain knowledge embedded in systems like AlphaFold 2 — which in 2020 solved the 50-year protein-folding problem and remains arguably the most consequential AI achievement of the decade outside of language models.

Why This Matters for Practitioners

Understanding Gemini's origins explains its architecture and integration priorities. Because it was built natively multimodal and tightly coupled with Google's infrastructure — Search, Workspace, Cloud, Android — Gemini behaves differently from GPT-4 or Claude in enterprise and mobile contexts. Knowing the history helps you predict where each model is likely to excel or struggle.

Lesson 1 Quiz

3 questions — free, untracked, retake anytime.

What factual error in a promotional GIF caused Alphabet's stock to drop approximately $100 billion in February 2023?

✓ Correct. The GIF showed Bard incorrectly answering a James Webb question, triggering immediate market and media backlash.

✗ The error was about the James Webb Space Telescope — a factual astronomy claim Bard got wrong in the promotional GIF.

Which Google model was trained on the Pathways system and scaled to 540 billion parameters?

✓ Correct. PaLM (Pathways Language Model) was announced April 2022 and trained on the Pathways infrastructure at 540B parameters.

✗ PaLM is the correct answer — it was Google's first Pathways-trained model and a direct predecessor to Gemini.

What made the Gemini 1.0 launch demo deceptive, according to Google's own subsequent disclosure?

✓ Correct. Google's own blog post acknowledged the video was edited: no live video, no real-time responses, post-produced narration.

✗ Google disclosed that the demo was edited — prompts were static images, responses were not live, and narration was post-production.

Lab 1 · Gemini's History and Architecture

Discuss Google's AI lineage, the LaMDA–Bard–Gemini transition, and multimodal design with your AI tutor.

What to Explore

In this lab you will interrogate the historical and architectural decisions that shaped Gemini. Ask about the Google DeepMind merger, the significance of the Pathways training system, or how native multimodality differs from retrofitted vision. The tutor will provide factual, documented answers grounded in public announcements and research papers.

Try asking: "Why did Google merge Brain and DeepMind instead of keeping them separate? What were the strategic reasons?" — or — "How does training natively multimodal differ from adding a vision encoder to a language-only model?"

AI Lab — Gemini OriginsTUTOR

GPT vs. Claude vs. Gemini · Module 4 · Lesson 2

Gemini's Model Tiers: Ultra, Pro, Flash, and Nano

Google's strategy of deploying a tiered model family across devices, the cloud, and consumer products — and what each tier actually does.

At Google I/O in May 2024, Pichai announced over 100 new product updates featuring Gemini, from search summaries to Gmail drafting to on-device Pixel features. The breadth was deliberate: Google needed to demonstrate that Gemini was not just a research artifact or a standalone chatbot but a platform capability woven through every product it makes. The challenge was explaining to developers and consumers which Gemini they were actually using at any given moment.

The Four-Tier Architecture

Google structured Gemini into distinct capability tiers designed for different deployment contexts. Understanding this hierarchy is essential for anyone evaluating Gemini for professional or enterprise use.

Gemini Ultra is the largest and most capable model, designed for highly complex tasks requiring deep reasoning, extended context, and advanced multimodal understanding. In the 1.0 generation, Ultra outperformed GPT-4 on the Massive Multitask Language Understanding (MMLU) benchmark, scoring 90.0% versus GPT-4's 86.4% — the first model to achieve human-expert-level performance on that benchmark. Ultra powers Gemini Advanced, the paid subscription tier available via Gemini.google.com and the Google One AI Premium plan.

Gemini Pro is the mid-tier model optimized for a wide range of tasks at scale. It powers the free Gemini consumer experience and is available via the Gemini API on Google AI Studio and Vertex AI. Pro is the primary model most developers interact with and is well suited to summarization, classification, code generation, and structured data tasks.

Gemini Flash, introduced with Gemini 1.5, is optimized for speed and cost efficiency. Flash processes requests significantly faster than Pro and at lower cost per token, making it the preferred choice for high-volume applications like real-time chatbots, document processing pipelines, and consumer-facing features where latency matters. Flash with 1.5 still retains the breakthrough one-million-token context window that defines the 1.5 generation.

Gemini Nano is designed to run entirely on-device without a network connection. Nano is embedded in Google's Pixel 8 Pro, Pixel 9 series, and Samsung Galaxy S24 series. It powers features like Summarize in the Recorder app, Smart Reply in Gboard, and on-device Magic Compose in Messages. Because Nano operates locally, it processes sensitive data without sending it to Google's servers — a meaningful privacy differentiation.

The 1.5 Generation: Long Context as a Differentiator

The most significant architectural advance in Gemini 1.5, announced in February 2024, was the dramatic expansion of context window capacity. While GPT-4 Turbo offered 128,000 tokens and Claude 3 offered 200,000, Gemini 1.5 Pro launched with one million tokens in public preview — enough to process approximately 700,000 words, one hour of video, or 30,000 lines of code in a single prompt.

This was achieved through a technique called Multi-Query Attention combined with Google's Mixture-of-Experts architecture, which allows the model to activate only the specialized subnetworks relevant to a given input rather than running the entire parameter space for every token. The result is dramatically improved efficiency at scale.

In an internal test published by Google in February 2024, Gemini 1.5 Pro successfully retrieved a specific 30-word passage hidden within a 10-million-token corpus — essentially a needle-in-a-haystack task at unprecedented scale. The model located the passage with near-perfect accuracy.

Practical Note for Developers

As of mid-2024, Gemini 1.5 Pro with a 1M-token context window was available in Google AI Studio and Vertex AI. The 2M-token context window was made available in limited preview. For most enterprise document processing tasks, Flash at 1M tokens offers the best cost-to-capability ratio in Google's ecosystem.

Gemini 1.5 vs. Gemini 1.0: What Changed

Beyond context length, Gemini 1.5 demonstrated notably improved performance on multimodal tasks. In evaluations published by Google, 1.5 Pro could watch an hour-long film and answer detailed plot questions; analyze entire codebases and suggest architectural refactors; and translate between previously unseen languages after being given a grammar reference document in-context — without any fine-tuning.

This in-context learning capability was highlighted as a distinguishing factor from competitors. While GPT-4 and Claude 3 also support in-context learning, the scale at which Gemini 1.5 operates enables qualitatively different applications — entire software projects, legal discovery corpora, or a full season of quarterly earnings transcripts all processable in one call.

Competitive Positioning

Google positions Gemini's tiered architecture as covering the full deployment spectrum — from a sub-100ms on-device reply to a million-token enterprise document analysis — under a single model brand. No competitor at launch offered both on-device and ultra-long-context capability under one product umbrella. Whether this breadth translates to developer adoption depends heavily on the quality of tooling and API ergonomics.

Lesson 2 Quiz

3 questions — free, untracked, retake anytime.

What is the primary design goal of Gemini Nano compared to other tiers in the Gemini family?

✓ Correct. Nano is embedded in Pixel and Samsung devices to run local inference with no server round-trip, enabling privacy-sensitive features.

✗ Gemini Nano is specifically designed for on-device, offline operation — that's its defining characteristic versus Pro, Flash, and Ultra.

On which benchmark did Gemini Ultra 1.0 score 90.0%, reportedly outperforming GPT-4, making it the first model to surpass human-expert-level performance?

✓ Correct. MMLU tests knowledge across 57 subjects. Gemini Ultra 1.0 scored 90.0%, the first model reported to exceed human expert performance on that benchmark.

✗ The benchmark is MMLU — Massive Multitask Language Understanding — where Gemini Ultra scored 90.0% to GPT-4's 86.4%.

What architectural technique enables Gemini 1.5 to handle one-million-token context windows efficiently?

✓ Correct. Gemini 1.5 uses a Mixture-of-Experts design combined with Multi-Query Attention, activating only relevant sub-networks per input for efficiency at scale.

✗ The answer is Mixture-of-Experts plus Multi-Query Attention — only relevant model sub-networks activate per token, enabling efficient long-context processing.

Lab 2 · Model Tiers and Context Windows

Explore Gemini's tier architecture, benchmark performance, and the one-million-token context window with your AI tutor.

What to Explore

This lab focuses on practical decision-making within the Gemini model family. Ask the tutor to help you choose the right tier for a specific use case, understand the real-world implications of a 1M-token context window, or compare Gemini Flash versus Pro for a document processing pipeline.

Try asking: "I need to process 500-page legal contracts and extract key clauses automatically. Which Gemini tier should I use and why?" — or — "What's the actual cost difference between Gemini 1.5 Pro and Flash for a high-volume text classification task?"

AI Lab — Gemini TiersTUTOR

GPT vs. Claude vs. Gemini · Module 4 · Lesson 3

Google's Ecosystem Integration: Search, Workspace, and Cloud

Gemini's most powerful advantage is not its raw model capability — it's the infrastructure it plugs into and the data infrastructure Google uniquely controls.

When Google launched AI Overviews in Search at Google I/O 2024 — replacing what had been called Search Generative Experience in beta — it became the largest single deployment of generative AI to consumers in history, reaching approximately 1 billion users. Within days, screenshots went viral: the system recommended putting glue on pizza to help cheese stick, suggested eating rocks as part of a daily diet, and provided other responses that were factually wrong or drawn from satirical forum posts like Reddit.

Google pulled back several AI Overview categories and issued updates rapidly. The episode demonstrated both the scale of Google's AI ambition and the friction between deploying generative models at search speed and ensuring factual accuracy — a challenge no competitor has faced at equivalent scale.

Search Integration: SGE to AI Overviews

Google began experimenting with AI-generated search summaries through the Search Generative Experience (SGE), available in Search Labs starting May 2023. SGE provided an AI-synthesized paragraph above traditional blue links when a query was deemed complex enough to benefit from summarization. After roughly a year of testing involving millions of users, Google graduated SGE into AI Overviews and rolled it out to all US English searches in May 2024.

The integration is technically complex. Gemini's language understanding is combined with Google's Knowledge Graph — a structured database of roughly 500 billion facts about real-world entities — and its web index of hundreds of billions of pages. The result is grounded generation: responses are supposed to cite specific sources rather than generating from model weights alone. When source grounding fails, as in the pizza-glue incident, it exposes the fundamental tension between neural generation and factual reliability.

Critically, AI Overviews created a potential threat to the publisher ecosystem. If users receive answer-style summaries at the top of search results, they may click fewer links — reducing ad revenue for third-party sites that depend on Google referral traffic. Several publishers filed complaints and the News/Media Alliance published studies suggesting organic click-throughs declined in SGE-affected query categories.

Workspace: Gemini for Business and Enterprise

Google Workspace — encompassing Gmail, Docs, Sheets, Slides, Drive, and Meet — has approximately 3 billion users. Google began embedding Gemini capabilities directly into Workspace under the brand Duet AI in 2023, then rebranded the suite as Gemini for Workspace in early 2024. This positions Gemini in direct competition with Microsoft's Copilot for Microsoft 365.

The Workspace integration includes: Help Me Write in Gmail and Docs for drafting and refining text; Help Me Organize in Sheets for structuring data and generating formulas; Help Me Create in Slides for generating presentation decks from text outlines; and real-time meeting transcription and summarization in Meet. Enterprise customers on the Gemini Business or Gemini Enterprise add-on ($20–$30 per user per month as of mid-2024) receive access to Gemini 1.5 Pro capabilities within these tools.

A meaningful distinction from Microsoft Copilot: Workspace's Gemini integration uses data isolation — customer data processed by Gemini is not used to train Google's models by default, and enterprise data never leaves the tenant's environment. Google made this commitment explicit following enterprise customer pressure after the initial Duet AI announcement.

Competitive Context

Microsoft committed approximately $13 billion to OpenAI and built Copilot around GPT-4 across Office 365, Windows, Bing, and Azure. Google's counter-strategy is not to license a third-party model but to deploy its own — giving it full control over fine-tuning, safety configuration, and pricing. The question is whether end-user quality matches enterprise buyer expectations.

Google Cloud and Vertex AI

For enterprise AI development, Google offers Gemini models through Vertex AI, its managed ML platform, and through Google AI Studio, a lighter developer sandbox. Vertex AI provides Gemini Pro and Ultra APIs alongside model fine-tuning, grounding via Google Search, Retrieval-Augmented Generation (RAG) tooling, enterprise security controls, and integration with BigQuery and Cloud Storage.

Google Cloud's AI revenue grew significantly following Gemini's launch. In Q1 2024, Google Cloud revenue hit $9.57 billion, up 28% year-over-year — its fastest growth in years — with management attributing a portion of the acceleration to enterprise AI adoption through Vertex AI and Workspace add-ons. AWS and Azure still lead in overall cloud market share, but Google Cloud's AI tooling is increasingly competitive for organizations already in the Google ecosystem.

A specific differentiator is grounding with Google Search on Vertex AI, which allows enterprise applications to connect Gemini responses to real-time web data — a capability not available in GPT-4 or Claude APIs without a third-party search plugin.

The Infrastructure Moat

Google's deepest competitive advantage with Gemini is not the model itself but the distribution. One billion Search users, three billion Workspace users, two billion Android devices, and YouTube's creator ecosystem all represent deployment surfaces no AI lab starting from scratch can replicate. The strategic question for enterprise buyers is whether Google's model quality is good enough relative to OpenAI or Anthropic — and whether the workflow integration value outweighs model performance gaps in specific tasks.

Lesson 3 Quiz

3 questions — free, untracked, retake anytime.

What was the name of Google's AI search experience in beta testing before it became AI Overviews?

✓ Correct. Search Generative Experience launched in Search Labs in May 2023 and graduated to AI Overviews in May 2024 for all US English queries.

✗ The beta product was called Search Generative Experience (SGE), launched in Google Search Labs starting May 2023.

What was Google's AI productivity suite for Workspace called before it was rebranded as Gemini for Workspace in 2024?

✓ Correct. Google's Workspace AI features launched under the name Duet AI in 2023 before being rebranded as Gemini for Workspace in early 2024.

✗ The previous brand was Duet AI — Google consolidated it under the Gemini brand in early 2024 to align with the model family name.

What specific Vertex AI capability differentiates Gemini from GPT-4 and Claude APIs in enterprise deployments?

✓ Correct. Vertex AI offers grounding with Google Search — connecting Gemini responses to live web data — which GPT-4 and Claude APIs do not offer natively without third-party plugins.

✗ The differentiator is grounding with Google Search on Vertex AI, allowing Gemini enterprise responses to reference real-time web data directly.

Lab 3 · Google Ecosystem Integration

Explore how Gemini fits into Search, Workspace, and Cloud — and what it means for your organization.

What to Explore

In this lab you'll think through real enterprise and consumer scenarios involving Gemini's ecosystem integrations. The tutor can help you evaluate AI Overviews' impact on content strategy, compare Gemini for Workspace to Microsoft Copilot for a specific team workflow, or design a RAG pipeline using Vertex AI with Google Search grounding.

Try asking: "My organization uses Google Workspace and is deciding between Gemini Enterprise add-on and Microsoft Copilot for M365. What are the key tradeoffs?" — or — "How does AI Overviews affect SEO strategy for a content-heavy website?"

AI Lab — Ecosystem IntegrationTUTOR

GPT vs. Claude vs. Gemini · Module 4 · Lesson 4

Choosing Between GPT, Claude, and Gemini: A Practical Decision Framework

No model wins on every dimension. Knowing which one to reach for — and why — is the skill that separates informed practitioners from tool collectors.

The product team at a mid-sized healthcare analytics company is 45 minutes into a meeting about which AI model to use for their new clinical documentation assistant. The engineering lead wants GPT-4 Turbo — it's what the team already prototyped with. The compliance officer wants Claude — she read that Anthropic has strong safety commitments and they already have an AWS relationship. The data science manager is pushing for Gemini — the company runs entirely on Google Workspace and she can see a path to integrating directly with the EHR data stored in BigQuery. All three are right. All three are also missing important context.

This meeting happens in thousands of organizations every week. The answer is almost never "use the best model" — it's "use the best model for your specific constraints, ecosystem, and risk tolerance." This lesson gives you the framework to navigate that conversation.

When Gemini's Ecosystem Advantage Wins

Gemini's primary competitive edge is not raw model performance — it is distribution and integration depth. Choose Gemini when the following conditions apply:

Your organization is already Google-native. If your team lives in Gmail, Docs, Sheets, and Meet, Gemini for Workspace removes all integration friction. Help Me Write in Gmail, generative summaries in Meet, and formula assistance in Sheets require zero API work. The value is immediate and requires no developer involvement.

You need real-time web grounding without a plugin. Vertex AI's grounding with Google Search is a first-party, enterprise-grade capability. For use cases like competitive intelligence, real-time regulatory lookup, or news summarization, this is a meaningful differentiator. GPT-4 and Claude require third-party integrations to achieve equivalent results.

You are processing massive documents or entire codebases. Gemini 1.5 Pro's one-million-token context window is effectively class-leading for tasks requiring a model to hold an entire corpus in working memory — legal discovery, large-scale code review, or processing an entire year of meeting transcripts in one call.

You have Android or Pixel device deployment requirements. Gemini Nano's on-device capability is the only option when network connectivity is unreliable or when regulatory constraints require local inference on mobile endpoints.

When GPT-4 or Claude Win Instead

GPT-4 (OpenAI) tends to win when: Your team is building on the most mature developer ecosystem. OpenAI's API, documentation, community tooling, LangChain integrations, and fine-tuning infrastructure are the most battle-tested in the industry. If your use case requires extensive customization, a large existing community of example code, or Azure-hosted enterprise deployment with compliance certifications already in place, GPT-4 is often the lower-friction choice.

GPT-4's function calling and structured output capabilities are also particularly mature. For agentic workflows where the model needs to reliably call tools, parse schemas, and maintain structured state, many teams find GPT-4 more predictable than alternatives at equivalent complexity levels.

Claude (Anthropic) tends to win when: The use case is long-form, nuanced writing or document synthesis — tasks where tone, coherence across thousands of words, and resistance to hallucination matter. Claude's Constitutional AI training makes it more likely to acknowledge uncertainty, express caveats, and decline clearly harmful requests without becoming uselessly restrictive.

Claude also wins in highly regulated industries where customers are sensitive to AI training data practices. Anthropic's explicit commitments around not training on enterprise API data, combined with its AWS partnership (and thus SOC 2 and HIPAA alignment through AWS infrastructure), make it a natural choice for healthcare, legal, and financial services contexts. The same healthcare documentation scenario from our opening story is, in practice, often resolved in Claude's favor for exactly these reasons.

Privacy and Data Isolation

All three providers have made enterprise data isolation commitments: customer data is not used to train production models by default. However, the mechanisms differ. Google enforces isolation at the Workspace and Vertex AI tier level — free consumer Gemini has different data practices. OpenAI offers data processing agreements and an enterprise tier with explicit opt-out of training. Anthropic does not train on API data at all by default. When advising an enterprise buyer, verify the current data processing addendum for each provider rather than relying on general marketing claims.

Google's Future AI Roadmap: What's Coming

Project Astra is Google DeepMind's research initiative toward a universal AI agent — a system that can see through a camera, remember what it observed across sessions, and take actions in real-world contexts. Demoed at Google I/O 2024, Astra showed a prototype using a phone camera to identify objects, explain code on a screen, and answer questions about a physical environment in near-real time. This represents Google's long-term vision for Gemini: not a chatbot but a persistent, ambient AI operating across all of a user's devices and contexts.

Gemini 2.0 and subsequent generations are expected to continue pushing multimodal boundaries — particularly native audio output (not text-to-speech but audio generated directly by the model), video understanding at commercial scale, and expanded agent action capabilities through the Gemini API's function-calling and code execution features.

NotebookLM deserves particular attention as a signal of Google's product direction. NotebookLM is a research and note-taking tool that uses Gemini to deeply analyze a user's uploaded documents — papers, notes, transcripts — and answer questions about them with precise citations. Its Audio Overview feature, which converts a document corpus into a realistic two-host podcast discussion, became a viral consumer hit in late 2024. NotebookLM demonstrates that Gemini's long-context strength can be packaged as consumer-accessible tools that don't require any developer sophistication.

Practical Selection Heuristics

Use this as a quick-decision scaffold: Start with Gemini if you are Google-native, need real-time web grounding, or have a 500K+ token document task. Start with GPT-4 if you need mature tooling, Azure compliance, or reliable structured-output agentic workflows. Start with Claude if your task is long-form writing, you are in a regulated industry with data sensitivity concerns, or you need a model that handles nuance and uncertainty gracefully. When in doubt, run a parallel evaluation: send the same representative prompt to all three and score outputs against your actual rubric — not a generic benchmark.

Lesson 4 Quiz

3 questions — free, untracked, retake anytime.

According to the practical decision framework in Lesson 4, which scenario most favors choosing Gemini over GPT-4 or Claude?

✓ Correct. Gemini's ecosystem integration advantage is most compelling when an organization is already Google-native — Workspace AI requires zero API work and delivers immediate value.

✗ The clearest Gemini win is for Google-native organizations where Workspace integration delivers immediate value without any development work.

What is Google's Project Astra, announced at Google I/O 2024?

✓ Correct. Project Astra is Google DeepMind's research prototype for a persistent, ambient AI agent that operates through device cameras and across contexts — Google's long-term vision for where Gemini is heading.

✗ Project Astra is a universal AI agent research initiative — demonstrated via phone camera, capable of seeing, reasoning about, and acting on a user's physical environment.

Which AI model provider does NOT train on enterprise API data by default, according to Lesson 4's discussion of data isolation?

✓ Correct. Anthropic does not train on API data by default. Google's isolation applies only to Workspace and Vertex AI tiers — consumer Gemini has different practices. OpenAI requires a data processing agreement for enterprise training opt-out.

✗ Anthropic's default policy is that API data is never used for training. Google and OpenAI have more tier-dependent or agreement-dependent policies.

Lab 4 · Choosing the Right Model for Your Use Case

Work through real selection scenarios with your AI tutor — and stress-test the decision framework from Lesson 4 against your own projects.

What to Explore

In this lab you will apply the practical decision framework from Lesson 4 to concrete use cases. The tutor can help you think through model selection for a specific project you're working on, evaluate tradeoffs between GPT-4, Claude, and Gemini for a given scenario, or pressure-test the reasoning behind a choice you've already made. Bring a real use case if you have one — the framework is most useful with specifics.

Try asking: "My company is considering using AI to summarize legal contracts before attorney review. We use Microsoft 365, not Google, and our legal team is very concerned about data privacy. Which model should we use and why?" — or — "I'm building a customer support chatbot that needs to escalate to a human when sentiment turns negative. Walk me through the model selection decision."

AI Lab — Model SelectionTUTOR

Module Test

15 questions covering all lessons — free, untracked, retake anytime.

Score: 0/15

Which Google model, announced in April 2022, was the first trained on the Pathways system and scaled to 540 billion parameters?

✓ Correct. PaLM was announced April 2022 and was Google's first Pathways-trained model at 540B parameters.

✗ PaLM — Pathways Language Model — was the first Pathways-trained model, announced April 2022 at 540 billion parameters.

What caused Alphabet's stock to drop approximately $100 billion in a single day in February 2023?

✓ Correct. The Bard GIF error about the James Webb Space Telescope triggered an immediate market selloff, erasing roughly $100 billion in market cap.

✗ A promotional GIF showing Bard giving a wrong answer about the James Webb Space Telescope caused the drop — not a delay or legal issue.

What did Google acknowledge about its December 2023 Gemini 1.0 launch demo after journalists and researchers examined it closely?

✓ Correct. Google's own blog acknowledged the demo was edited: responses were not live, video was replaced by still images, and narration was post-produced.

✗ Google admitted the demo was edited — no real-time responses, no live video (still images only), and narration was added afterward.

What two AI research organizations merged in April 2023 to form Google DeepMind?

✓ Correct. Google Brain (scalable deep learning and TPU infrastructure) and DeepMind (reasoning, safety, and scientific AI) merged under Demis Hassabis in April 2023.

✗ Google Brain and DeepMind merged — two complementary research organizations combined to compete more effectively with OpenAI.

Which Gemini tier is designed to run entirely on-device without a network connection, powering features like Summarize in Google's Recorder app?

✓ Correct. Gemini Nano runs locally on Pixel 8 Pro, Pixel 9, and Samsung Galaxy S24 devices with no server round-trip required.

✗ Gemini Nano is the on-device tier — embedded in Pixel and Galaxy devices for offline local inference.

On the MMLU benchmark, what score did Gemini Ultra 1.0 achieve — the first model reported to surpass human-expert-level performance?

✓ Correct. Gemini Ultra 1.0 scored 90.0% on MMLU, compared to GPT-4's 86.4% — the first model to exceed human-expert-level performance on that benchmark.

✗ Gemini Ultra scored 90.0% on MMLU. GPT-4 scored 86.4%. The human expert threshold is approximately 89.8%.

What is the maximum context window available in Gemini 1.5 Pro, announced in February 2024?

✓ Correct. Gemini 1.5 Pro launched with a one-million-token context window in public preview — roughly 700,000 words or one hour of video in a single prompt.

✗ Gemini 1.5 Pro launched with a one-million-token context window — far exceeding GPT-4 Turbo (128K) and Claude 3 (200K) at the time.

What architectural technique enables Gemini 1.5 to process one-million-token context windows efficiently by activating only relevant sub-networks per input?

✓ Correct. MoE activates only the specialist sub-networks relevant to a given input, while Multi-Query Attention reduces memory overhead — together enabling efficient long-context processing.

✗ The answer is Mixture-of-Experts with Multi-Query Attention — routing inputs to relevant specialist sub-networks rather than activating the full model for every token.

What viral controversy followed the May 2024 launch of AI Overviews in Google Search?

✓ Correct. Screenshots went viral showing AI Overviews recommending glue on pizza and eating rocks — responses drawn from satirical Reddit posts, exposing the grounding failure risk at scale.

✗ The pizza-glue and rocks-as-diet recommendations went viral — responses sourced from satirical forum posts, demonstrating the risks of generative search at billion-user scale.

What was the name of Google's AI productivity suite for Google Workspace before it was rebranded as Gemini for Workspace in early 2024?

✓ Correct. Duet AI was Google's 2023 brand for Workspace AI features before the suite was consolidated under the Gemini name in early 2024.

✗ Duet AI was the original brand — Google rebranded it as Gemini for Workspace in early 2024 to align with the broader model family name.

Which capability on Vertex AI differentiates Gemini from GPT-4 and Claude APIs in enterprise deployments without requiring a third-party plugin?

✓ Correct. Vertex AI's first-party grounding with Google Search connects Gemini enterprise responses to live web data — a capability GPT-4 and Claude APIs require third-party integrations to match.

✗ Grounding with Google Search on Vertex AI is the key differentiator — real-time web data access without a third-party plugin.

Approximately how many users does Google Workspace have, making it a critical distribution channel for Gemini's enterprise capabilities?

✓ Correct. Google Workspace has approximately 3 billion users — a distribution advantage no AI competitor can replicate by building a standalone product.

✗ Google Workspace has approximately 3 billion users, which is a core part of what makes Gemini's enterprise distribution so strategically significant.

According to the Lesson 4 decision framework, which scenario most clearly favors choosing Claude over GPT-4 or Gemini?

✓ Correct. Claude wins for regulated industries with data sensitivity concerns and long-form tasks requiring careful handling of nuance — the healthcare clinical documentation scenario fits all of these criteria.

✗ Claude is the framework's recommendation for regulated industries (healthcare, legal, financial services) where data privacy and nuanced, uncertainty-aware outputs matter most.

What is NotebookLM and why does it represent a strategic signal about Google's product direction?

✓ Correct. NotebookLM demonstrates that Gemini's long-context strength can be packaged as consumer-accessible products — the Audio Overview podcast feature went viral in late 2024, showing how Google plans to bring AI to non-technical users.

✗ NotebookLM is a document-analysis and research tool whose Audio Overview feature — converting documents into a two-host podcast — went viral, signaling Google's intent to make long-context AI consumer-friendly.

What does the Lesson 4 framework recommend when a practitioner is genuinely uncertain which model to choose for a new use case?

✓ Correct. The framework explicitly recommends parallel evaluation with your own rubric when uncertain — generic benchmarks do not reliably predict performance on specific tasks.

✗ The framework recommends a parallel evaluation: send the same representative prompt to all three models and score against your actual rubric — not a generic benchmark or cost table.

Gemini's Origins: From LaMDA to the Multimodal Era

Lesson 1 Quiz

Lab 1 · Gemini's History and Architecture

What to Explore

Gemini's Model Tiers: Ultra, Pro, Flash, and Nano

Lesson 2 Quiz

Lab 2 · Model Tiers and Context Windows

What to Explore

Google's Ecosystem Integration: Search, Workspace, and Cloud

Lesson 3 Quiz

Lab 3 · Google Ecosystem Integration

What to Explore

Choosing Between GPT, Claude, and Gemini: A Practical Decision Framework

Lesson 4 Quiz

Lab 4 · Choosing the Right Model for Your Use Case

What to Explore

Module Test

Module Test Result