Lesson 1 · Module 2

The Generative Image Landscape

Midjourney, Stable Diffusion, DALL·E, Firefly — mapping the tools reshaping visual production.

How do the dominant AI image tools actually differ — and why does tool choice matter for professional design work?

In August 2022, Jason Allen's AI-assisted piece Théâtre D'Opéra Spatial — created using Midjourney — won first place at the Colorado State Fair fine art competition. The judges did not initially know it was AI-generated. The event ignited a global debate about AI tools in visual art and design, and it marked the moment generative image tools crossed from developer curiosity into mainstream creative discourse.

Within eighteen months, every major platform in the design industry — Adobe, Canva, Getty Images, Shutterstock — had either launched or announced its own generative image offering. The landscape had shifted permanently.

The Major Platforms

By 2024, four platforms dominated professional and creative usage. Each uses a different underlying architecture and targets a different user context. Understanding their differences is not optional knowledge for working designers — it is the new baseline.

Discord / Web

Midjourney

Diffusion-based. Known for strong aesthetic coherence, painterly outputs, and high photorealism. Subscription-only. No open API. Fastest adoption in concept art, fashion, and editorial design.

Open Source

Stable Diffusion

Runs locally or on cloud. Full control via model weights, LoRAs, and fine-tuning. Used by studios requiring brand-consistent outputs at scale. Steeper learning curve; maximum flexibility.

OpenAI / API

DALL·E 3

Integrated into ChatGPT and the OpenAI API. Strongest at following precise text instructions. Moderate aesthetic ceiling compared to Midjourney. Well-suited to prototype mockups and content pipelines.

Adobe Ecosystem

Firefly

Trained exclusively on licensed Adobe Stock content. Outputs are commercially safe. Deeply integrated in Photoshop (Generative Fill), Illustrator, and Express. Priority for agency and enterprise workflows.

How They Work: Diffusion in Plain Language

All four tools share a common ancestor: diffusion models. The training process adds random noise to millions of images until they become indistinguishable from static. The model then learns to reverse that process — to denoise a random field into a coherent image guided by a text prompt. What varies between tools is the size of the training dataset, the quality of the text-image pairs used, and the fine-tuning applied after the base model is trained.

This matters for designers because it explains two recurring phenomena: hallucinated anatomy (hands, teeth, text characters) and prompt-drift, where the model emphasises the most statistically common meaning of an ambiguous word rather than your intended one. Knowing the mechanism helps you write prompts that work with the model's tendencies rather than against them.

Why Firefly for Commercial Work

Adobe trained Firefly exclusively on Adobe Stock images, openly licensed content, and public-domain material. This means outputs do not carry the copyright uncertainty that surrounds tools trained on scraped internet data. For client-facing deliverables, this distinction is not aesthetic — it is legal.

Choosing the Right Tool

The practical decision tree for most designers comes down to three factors: commercial licensing requirements, workflow integration, and aesthetic target. Firefly wins on commercial safety and Adobe integration. Midjourney wins on aesthetic quality and speed of iteration for concept work. Stable Diffusion wins when you need brand-specific fine-tuning or high-volume generation without per-image costs. DALL·E 3 wins when you're already inside a ChatGPT or OpenAI API workflow and need reliable prompt adherence.

Key Terms

Diffusion ModelA neural network trained to reverse a noise-adding process, generating images from text prompts by progressively denoising a random field.

LoRALow-Rank Adaptation — a lightweight fine-tuning method that customises a base Stable Diffusion model for a specific visual style without retraining the full model.

Commercially SafeGenerated content trained on licensed data, making it defensible for use in client deliverables without copyright exposure.

Lesson 1 Quiz

The Generative Image Landscape — 3 questions

Which generative image tool was trained exclusively on licensed Adobe Stock and public-domain content, making its outputs commercially safer for client work?

Correct. Adobe Firefly was deliberately trained on licensed Adobe Stock, openly licensed content, and public-domain material — specifically to remove copyright uncertainty from commercial design workflows.

Not quite. Adobe Firefly is the tool Adobe built with commercial safety as the primary design goal, training it only on licensed and public-domain content.

Diffusion models generate images by:

Correct. Diffusion models are trained by adding noise to images until they become pure static, then learning to reverse that degradation — generating new images by starting with noise and iteratively denoising toward a coherent output.

Diffusion is not retrieval or compositing. The model learns to reverse a noise-adding process, generating entirely new pixel arrangements rather than copying existing images.

A designer needs brand-specific outputs at scale without per-image costs and requires full control over model weights. Which tool is most appropriate?

Correct. Stable Diffusion's open-source nature allows full access to model weights, supports fine-tuning via LoRA, and can be run locally or on cloud infrastructure without per-image fees — ideal for brand-consistent, high-volume production.

Stable Diffusion is the right answer here. As an open-source model, it allows full weight access, supports LoRA fine-tuning for brand consistency, and eliminates per-image costs when run locally.

Lab 1 — Tool Selection Advisor

Practice choosing the right generative image tool for real design scenarios.

Your Task

Describe a real or hypothetical design project and ask which generative image tool is best suited to it. Push back, explore edge cases, and ask follow-up questions about licensing, workflow integration, or model capabilities.

Try: "I'm designing a brand campaign for a pharmaceutical client who needs images for regulated advertising. Which tool should I use and why?" — or bring your own scenario.

AI Design Advisor

Generative Tools

Welcome to the Tool Selection Lab. Describe your project — client type, output format, volume, commercial requirements — and I'll help you reason through which generative image platform fits best. What are you working on?

Lesson 2 · Module 2

Prompt Engineering for Image Generation

The vocabulary of visual instruction — how designers communicate intent to generative models.

What separates a prompt that produces something usable from one that produces something generic — and is that difference learnable?

In 2023, Heinz ran a campaign called "A.I. Ketchup" — asking consumers and an AI image tool to generate images of ketchup. Every output, regardless of prompt phrasing, returned an image that unmistakably resembled Heinz packaging. The agency Rethink documented this as proof of brand salience: Heinz had trained the cultural model such that "ketchup" and "Heinz" were nearly synonymous. The campaign was effective precisely because the AI's training data reflected cultural reality.

The lesson for working designers: AI models reflect the statistical weight of their training data. A prompt is not a command — it is a probability nudge. Understanding what the model has seen is as important as knowing what you want.

Anatomy of an Effective Prompt

Effective prompts for image generation typically contain four layers of information, each constraining the probability space the model explores:

Layer 1

Subject

The primary visual element. Be specific: not "a woman" but "a woman in her 50s, dark hair, reading glasses, seated at a desk covered in architectural drawings."

Layer 2

Style / Medium

Art direction signals: "editorial photography," "gouache illustration," "isometric vector," "Bauhaus poster." Reference known aesthetic movements, photographers, or design eras.

Layer 3

Composition & Light

Cinematographic and photographic language: "Dutch angle," "golden hour back-light," "flat lay," "shallow depth of field f/1.8," "rule of thirds."

Layer 4

Negative Space / Exclusions

Most tools support negative prompts or exclusion syntax. "no text," "no watermark," "avoid photorealism" can be as important as affirmative instructions.

Midjourney Parameters: A Working Reference

Midjourney's parameter system allows designers to specify output characteristics beyond natural language. The most used parameters in production work are:

Parameter	What It Controls	Practical Use
--ar 16:9	Aspect ratio	Match output to intended canvas before generation, not after cropping
--stylize 0–1000	Aesthetic strength vs. prompt fidelity	Lower for brand accuracy; higher for expressive concept work
--chaos 0–100	Variation between grid options	Higher chaos for exploration; lower for consistent iterations
--no [term]	Negative prompt	Exclude recurring unwanted elements (text, extra limbs, watermarks)
--seed [number]	Locks randomness	Reproduce a result exactly for iterative refinement
--sref [image URL]	Style reference image	Supply a visual style target; model adapts composition not just keywords

Prompt Failure Patterns

Three failure patterns recur consistently across all image tools. Knowing them in advance saves iteration time:

1. Semantic Collision: Prompts that combine multiple strong visual archetypes produce blended outputs. "A cyberpunk geisha in a Paris café" gives the model three competing dominant attractors. Simplify or use image references to anchor one dimension.

2. Temporal Ambiguity: "Modern," "contemporary," and "current" are nearly meaningless — the model interprets them relative to its training cutoff. Use decade references ("2010s Scandinavian interior") or material specifics ("brushed concrete, matte black steel").

3. Typography Hallucination: No diffusion model reliably renders legible text. Design any required copy in Illustrator or Photoshop post-generation. Attempting to prompt for specific words wastes significant iteration time.

The Heinz Principle

The AI model reflects the statistical weight of its training data. Culturally dominant brands, visual archetypes, and canonical styles will always overpower subtle prompt modifiers. Use this as a design tool: anchor prompts to culturally strong references when you want coherent outputs fast.

Key Terms

Negative PromptInstructions that tell the model what to exclude from the output, reducing unwanted recurring elements.

SeedA numeric value that locks the random noise field used to generate an image, enabling reproducible outputs for iterative refinement.

Style Reference (sref)A supplied image URL that the model uses as a visual style target, going beyond what keyword prompts alone can communicate.

Stylize ParameterA Midjourney parameter (0–1000) controlling the balance between strict prompt fidelity and the model's aesthetic tendencies.

Lesson 2 Quiz

Prompt Engineering for Image Generation — 3 questions

A designer wants to reproduce a Midjourney result exactly for iterative refinement across multiple sessions. Which parameter enables this?

Correct. The --seed parameter locks the random noise field used to initialise generation, making results reproducible with the same prompt and model version.

The --seed parameter is what you need. It locks the random noise field so that the same prompt, same seed, and same model version reproduce an identical or near-identical output.

Which prompt failure pattern explains why writing "modern design" rarely produces useful results in AI image generation?

Correct. "Modern" is interpreted relative to the model's training cutoff, producing inconsistent or generic results. Specific decade or material references ("2020s matte concrete and black steel") give the model a meaningful anchor.

This is Temporal Ambiguity. Words like "modern" or "contemporary" have no stable meaning for a model — it interprets them relative to its training data cutoff, producing inconsistent outputs. Use decade or material-specific references instead.

Based on the Heinz "A.I. Ketchup" campaign, what does the case demonstrate about how AI image models respond to prompts?

Correct. Because Heinz ketchup was so dominant in training data imagery associated with the word "ketchup," the model consistently returned Heinz-style outputs. This reflects the fundamental principle that prompts are probability nudges, not commands.

The Heinz case illustrates that models reflect statistical reality in their training data. Heinz was so culturally dominant in "ketchup" imagery that the model associated the category term almost exclusively with that brand's visual identity.

Lab 2 — Prompt Architect

Build and refine image generation prompts for real design briefs.

Your Task

Bring a design brief or visual concept. The advisor will help you build a structured prompt using the four-layer framework (subject, style/medium, composition/light, exclusions) and suggest relevant Midjourney parameters. Iterate, push back, and test edge cases.

Try: "I need a hero image for a fintech app landing page — modern, trustworthy, not clichéd stock-photo-looking. Help me write a Midjourney prompt." — or bring your own brief.

AI Prompt Architect

Prompt Engineering

Ready to build prompts. Tell me what you're designing — the intended platform, mood, subject, and any references you have in mind. I'll help you construct a layered prompt and add Midjourney parameters that match your brief. What's the project?

Lesson 3 · Module 2

Generative Fill, Inpainting & Workflow Integration

From standalone generation to embedded design tools — how AI image features live inside Photoshop, Illustrator, and production pipelines.

When does using Generative Fill actually save time versus create new problems — and what are the failure modes designers need to know?

Adobe shipped Generative Fill in Photoshop's beta in May 2023, backed by Firefly. Within weeks, designers at agencies including BBDO and independent studios were documenting workflows where background extension — historically a task requiring hours of careful cloning and compositing — was completed in under two minutes. The trade publication Photoshop Cafe published side-by-side benchmarks showing a 4-hour editorial image extension reduced to 8 minutes including review cycles.

The capability was not without controversy: early versions struggled with consistent lighting direction and edge artifact suppression, requiring skilled human review. Adobe acknowledged these limitations in its release notes, framing Generative Fill as a starting point, not a final output tool.

Core Capabilities: What These Features Do

The terminology around AI-assisted image editing has proliferated faster than standardised definitions. For working designers, three capabilities are most relevant:

Photoshop Firefly

Generative Fill

Select any region, type a text prompt, and Firefly generates new pixel content matching the surrounding context. Used for: background extension, object removal, object replacement, adding elements to compositions.

Technical Term

Inpainting

The underlying technique: the model sees the full image including the masked region, infers what should occupy that space, and generates accordingly. The surrounding pixels act as a conditioning signal.

Photoshop Firefly

Generative Expand

Extends an image beyond its existing canvas boundaries. Critical for adapting a 4:3 photograph to a 16:9 banner without sourcing new photography. One of the highest-ROI use cases in production design.

Stable Diffusion

img2img + ControlNet

Transforms an existing image with a text prompt while preserving structure via ControlNet (edge, pose, depth maps). Preferred for style transfer on existing photography without losing composition.

Where Generative Fill Excels

In documented production usage, Generative Fill delivers the most reliable time savings in four scenarios:

Background Extension: Adapting photography to new aspect ratios. A 1:1 social post image extended to 16:9 for a YouTube banner. The model reads edge pixels and continues the scene convincingly — particularly for natural environments, architecture, and neutral backgrounds.

Object Removal: Removing unwanted elements (a logo, a sign, a stray cable) and having the model fill the region with contextually plausible content. More reliable than clone-stamping for large irregular regions.

Prop Addition: Adding an object to a composed product shot ("a coffee cup on the desk to the left of the laptop") without reshooting. Saves reshoots for minor art direction changes discovered post-production.

Sky Replacement with Context: Beyond simple sky swap filters, Generative Fill can repopulate an entire background scene including reflections and ambient light changes, though this requires careful masking and human review.

Known Failure Modes

Designers who skip review cycles get burned by consistent failure patterns. The three most critical:

Lighting Inconsistency: Generated content frequently does not match the light direction or colour temperature of the original image. Always check generated fill edges against the original lighting schema. A shadow falling the wrong direction is immediately visible to trained eyes.

Texture Discontinuity at Edges: The transition between generated and original pixels can show as a subtle halo or texture shift at mask boundaries. Use the Remove Tool or clone stamp at low opacity to blend seams before final output.

Semantic Drift in Complex Scenes: When filling large regions in complex images, the model may generate content that is technically plausible but contextually wrong — a window that doesn't match the building's architecture, or a floor pattern inconsistent with the room. Always zoom to 100% before approving.

Commercial Safety Note

Because Adobe Generative Fill runs on Firefly (trained on licensed content), fills generated in Photoshop carry Adobe's commercial indemnification policy as of their 2023 commercial release — meaning Adobe accepts legal responsibility for IP claims on Firefly outputs used in commercial work. Verify current policy terms before relying on this for regulated industries.

Key Terms

InpaintingGenerating new pixel content within a masked region, conditioned on the surrounding image context.

Outpainting / Generative ExpandExtending an image beyond its existing canvas by generating plausible new content matching the edge context.

ControlNetA Stable Diffusion extension that conditions generation on structural maps (edges, poses, depth) to preserve composition while changing style or content.

Conditioning SignalThe surrounding pixels or structural map that guides an inpainting or img2img model toward contextually appropriate outputs.

Lesson 3 Quiz

Generative Fill, Inpainting & Workflow Integration — 3 questions

What is inpainting, as it applies to AI-assisted image editing?

Correct. Inpainting fills a selected (masked) region with generated content, using the surrounding pixels as a conditioning signal to ensure the fill is contextually appropriate.

Inpainting specifically refers to filling a masked region with generated content, using the surrounding pixels as context. Extending beyond the canvas is outpainting/Generative Expand.

A designer uses Generative Fill to extend a product photograph from 4:3 to 16:9. What documented failure mode should they specifically check for before approving the output?

Correct. Lighting inconsistency is the most common failure mode in Generative Expand and Generative Fill. Generated regions frequently have light sources or colour temperatures that don't match the original photography, which is immediately visible on close inspection.

The key thing to check is lighting inconsistency — the direction and colour temperature of light in the generated fill may not match the original image, making the seam visible to any trained eye.

Which Stable Diffusion tool allows a designer to change the style of an existing photograph while preserving its original composition and structure?

Correct. img2img transforms an existing image using a text prompt, and ControlNet conditions the generation on structural maps (edges, poses, depth) — preserving composition while changing style or rendering.

img2img with ControlNet is the correct answer. ControlNet uses structural maps derived from the original image (edge maps, depth maps, pose maps) to preserve composition while the img2img process applies a new style.

Lab 3 — Generative Fill Workflow Advisor

Plan and troubleshoot Generative Fill and inpainting workflows.

Your Task

Describe a specific image editing challenge — background extension, object removal, prop addition, style transfer — and work through the optimal workflow, masking strategy, and review steps. Ask about failure modes, ControlNet applications, or commercial safety.

Try: "I have a product photo shot on white at 1200×1200px. The client now needs it at 16:9 for a website banner with a lifestyle background. Walk me through how to approach this with Generative Fill." — or bring your own image challenge.

AI Workflow Advisor

Generative Fill

Welcome to the Generative Fill Workflow Lab. Describe your image editing challenge — the source image, the output requirement, and any constraints — and I'll help you plan the workflow, select the right tool, and anticipate failure modes. What are you working with?

Lesson 4 · Module 2

Ethics, Copyright & Professional Practice

The legal and ethical terrain around generative image tools — what designers are responsible for knowing.

Who owns an AI-generated image — and what obligations do designers take on when they use these tools professionally?

In August 2023, US District Court Judge Beryl Howell ruled in Thaler v. Vidal / Perlmutter that images generated entirely by AI — without meaningful human creative control — cannot receive copyright protection under US law. The ruling affirmed the US Copyright Office's earlier guidance: copyright requires human authorship. The plaintiff, Stephen Thaler, had sought to register an image created by an AI system he built and claimed to own, listing the AI as the author.

The ruling's practical implication: images generated by AI with minimal human creative input enter the public domain immediately. Images where a human designer makes meaningful creative choices — selecting, arranging, editing outputs — may retain copyright in those human contributions. The line is not fixed and is currently being litigated in multiple jurisdictions.

The Copyright Landscape in 2024

As of 2024, three legal questions remain unresolved and directly affect designers using generative tools professionally:

1. Outputs and Ownership: The US Copyright Office has registered AI-assisted works where humans made significant creative decisions about which outputs to select and how to arrange or modify them. Pure AI generation with no human curation cannot be registered. This means: document your creative decisions.

2. Training Data Lawsuits: Multiple class-action suits were filed against Stability AI, Midjourney, and DeviantArt in 2023 (Andersen v. Stability AI, Getty Images v. Stability AI). These cases argue that training on scraped internet data without consent infringes the rights of original artists. None were fully resolved as of late 2024 — outcomes will affect which tools remain viable for commercial use.

3. Style vs. Expression: Copyright does not protect style, only specific expression. Generating an image "in the style of" a living artist is not per se copyright infringement under current US law — but it may constitute trademark dilution, unfair competition, or violate platform terms of service. The ethical question and the legal question are not the same question.

Getty Images v. Stability AI

In 2023, Getty Images filed suit in both the US and UK against Stability AI, alleging mass infringement of over 12 million images. Getty's UK case proceeded to the High Court. This is the highest-profile pending case for designers using stock-adjacent tools: if Getty prevails, it would establish precedent that training on commercially licensed stock without permission is infringement — and potentially open liability for users of those tools' outputs.

Platform Terms of Service: What Designers Actually Agree To

Beyond copyright law, platform ToS governs commercial use rights. Key points designers miss:

Platform	Commercial Use	Ownership	Key Caveat
Midjourney	Paid plans: yes. Free plan: no commercial use	User retains ownership of outputs	Midjourney retains broad license to use your outputs for training and promotion
DALL·E 3 / OpenAI	Yes, per ToS	OpenAI assigns rights to user	Subject to usage policies; prohibited content list applies to commercial use
Adobe Firefly	Yes, commercially safe per Adobe indemnification policy	User retains ownership	Must use through official Adobe products to qualify for indemnification
Stable Diffusion	Varies by model license (CreativeML, RAIL-M, others)	Generally user owns outputs	Some fine-tuned models have restrictive licenses — always check the specific model card

Professional Ethical Standards

The legal questions are floors, not ceilings. Professional design practice involves ethical obligations that go beyond what's currently enforceable:

Disclosure to Clients: Major design associations including the AIGA have issued guidance recommending designers disclose AI tool use in the production process. Some clients contractually prohibit AI-generated content. Verify before delivery.

Artist Style Use: Using a living artist's distinctive style to generate commercial work without permission or compensation is legal but ethically contested in the design community. The practical test: would this displace income that would otherwise go to the artist whose style you've replicated?

Authentic Representation: Using AI-generated images of real people, places, or events in ways that imply factual documentation — without disclosure — crosses into disinformation. This is not a hypothetical: several 2024 advertising campaigns drew regulatory attention for undisclosed AI-generated "documentary" images.

Key Terms

Thaler v. Perlmutter2023 US case establishing that AI-generated works without meaningful human creative input cannot receive copyright protection.

RAIL LicenseResponsible AI License — a model license type used by some Stable Diffusion variants that restricts certain harmful uses while permitting commercial use. Always check the specific model card.

IndemnificationAdobe's policy under which they accept legal responsibility for IP claims made against Firefly outputs used commercially through official Adobe products.

Human Authorship RequirementThe US Copyright Office's standard requiring meaningful human creative decisions for copyright registration — the basis for the Thaler ruling.

Lesson 4 Quiz

Ethics, Copyright & Professional Practice — 3 questions

What did the 2023 ruling in Thaler v. Perlmutter establish about AI-generated images in the US?

Correct. Judge Beryl Howell affirmed the Copyright Office's position: copyright requires human authorship. AI-only generation produces work that enters the public domain. Human creative choices in selection and arrangement may still be protectable.

The Thaler ruling established that copyright requires human authorship — images generated entirely by AI without meaningful human creative input cannot be registered or protected under US copyright law.

A designer wants to use Stable Diffusion with a fine-tuned community model for a commercial client project. What must they check before proceeding?

Correct. Stable Diffusion's base model uses CreativeML, but community fine-tuned models often use different licenses — some of which restrict commercial use. Always check the specific model card on Hugging Face or the model's repository.

The critical check is the specific model's license. Community fine-tuned Stable Diffusion models often carry different licensing terms than the base model — some prohibit commercial use entirely. Check the model card.

Why does using a living artist's style to generate commercial imagery raise ethical concerns even when it may be legal?

Correct. Copyright does not protect style — only specific expression. But professional ethics extend beyond legality. The practical test is whether using that style generates commercial value that would otherwise compensate the artist whose aesthetic you have replicated.

Style is not copyright-protected, but professional ethics go beyond legal minimums. The question is whether generating commercial work in a living artist's style displaces income they would otherwise earn — a question of professional responsibility, not just legality.

Lab 4 — Ethics & Copyright Advisor

Work through real ethical and legal dilemmas in AI image use.

Your Task

Bring a real or hypothetical professional dilemma involving AI-generated images. The advisor will help you reason through the legal landscape, platform ToS implications, and professional ethics. Push edge cases, ask about disclosure obligations, and explore where the law and ethics diverge.

Try: "My client wants me to generate images 'in the style of' a well-known illustrator they admire for a marketing campaign. They say it's fine legally. How should I think about this?" — or bring your own scenario.

AI Ethics Advisor

Welcome to the Ethics and Copyright Lab. Describe your professional dilemma — client situation, tool involved, intended use — and I'll help you reason through the legal landscape, platform terms, and professional ethics. What's the situation?

Module 2 Test

Generative Image Tools for Designers — 15 questions — pass at 80%

1. Which generative image platform operates as open-source software, allowing full access to model weights and local execution?

Correct. Stable Diffusion is open-source with publicly available model weights, enabling local execution, fine-tuning, and LoRA adaptation.

Stable Diffusion is the open-source option — its weights are publicly available and it can be run locally without per-image fees.

2. What fundamental process do Midjourney, Stable Diffusion, DALL·E 3, and Firefly all share?

Correct. All four tools share diffusion model architecture at their core — trained by adding noise to images and learning to reverse the process.

All four tools use diffusion model architecture — the process of learning to reverse noise addition to generate images from text prompts.

3. The --chaos parameter in Midjourney controls:

Correct. --chaos (0–100) controls how varied the four grid options are from each other. High chaos for exploratory concept work; low chaos for tight iteration.

--chaos controls variation between grid outputs, not rendering style or colour. Higher values produce more diverse options; lower values produce more similar iterations.

4. Why did the Heinz "A.I. Ketchup" campaign consistently produce Heinz-style packaging images regardless of how the prompt was worded?

Correct. The model's training data so heavily associated "ketchup" with Heinz packaging that the statistical relationship overrode other prompt modifiers — a demonstration of how cultural dominance shapes model behaviour.

The training data explanation is correct: Heinz was so overwhelmingly represented in "ketchup" training imagery that the model treated the category term and the brand as nearly identical.

5. Which prompt failure pattern is illustrated when a designer writes "a modern, contemporary office" and receives inconsistent results across tools and sessions?

Correct. "Modern" and "contemporary" have no stable temporal reference for a model — they're interpreted relative to a training cutoff that varies by tool. Use decade or material-specific language instead.

This is Temporal Ambiguity. Vague temporal terms like "modern" produce inconsistent results because the model interprets them relative to its training data, which varies by tool and version.

6. A designer prompts for "a luxury Swiss watch advertisement with the brand name CHRONEX in bold gold letters." The text in the generated image is illegible and distorted. Why?

Correct. Typography hallucination is a fundamental limitation of all current diffusion models. Text should always be added in Photoshop, Illustrator, or InDesign post-generation.

Typography hallucination is a known limitation of all diffusion models. They cannot reliably render specific legible characters — all required text must be added post-generation in traditional design tools.

7. Generative Expand in Photoshop is most accurately described as:

Correct. Generative Expand is outpainting — it reads the edge pixels and extends the image beyond the original canvas with generated content that matches the existing scene.

Generative Expand is outpainting — it extends the canvas and fills the new area with generated content conditioned on the original image's edges. Inpainting fills regions within the existing image boundaries.

8. In a Stable Diffusion img2img workflow, what does ControlNet contribute that standard img2img alone does not?

Correct. ControlNet extracts structural maps (edges, depth, pose) from the input image and uses them to condition generation — so you can radically change style without losing the original composition.

ControlNet's key contribution is structural conditioning via maps (Canny edges, depth, human pose) — it allows style to change while composition is preserved, which standard img2img cannot reliably do.

9. Which is the most critical failure mode to check after using Generative Fill for background extension?

Correct. Lighting inconsistency is the most visible and common failure — shadow directions, colour temperature, and ambient light often don't match the original photography in generated extensions.

Lighting inconsistency is the key thing to check. Generated regions frequently have mismatched light direction or colour temperature relative to the original image — always inspect at 100% zoom.

10. What did Judge Beryl Howell's 2023 ruling in Thaler v. Perlmutter establish?

Correct. The ruling affirmed the Copyright Office's position: copyright requires human authorship, and purely AI-generated works enter the public domain. Human creative choices in curation and arrangement remain protectable.

Thaler v. Perlmutter established the human authorship requirement: AI-only generation cannot be copyrighted. Human creative choices in selecting and arranging outputs may still qualify for protection.

11. Which AI image platform offers commercial indemnification — meaning the platform accepts legal responsibility for IP claims on outputs used commercially?

Correct. Adobe's indemnification policy covers Firefly outputs generated through official Adobe products — they accept legal responsibility for IP claims. This must be verified for current policy terms and doesn't extend to third-party Firefly integrations.

Adobe Firefly is the correct answer. Adobe's indemnification policy means they accept legal liability for IP claims on outputs generated through official Adobe products — a unique commercial safety guarantee as of 2023–2024.

12. What is a LoRA in the context of Stable Diffusion?

Correct. LoRA (Low-Rank Adaptation) is a parameter-efficient fine-tuning approach that adds small adapter layers to a base model, enabling brand-specific or style-specific outputs without the cost of full model retraining.

LoRA stands for Low-Rank Adaptation — it's a fine-tuning technique that adapts a base model to a specific style or subject using a relatively small dataset, without retraining the full model.

13. Getty Images filed suit against Stability AI in 2023. What was the core allegation?

Correct. Getty alleged Stability AI trained on their licensed image library without permission, constituting mass copyright infringement. The UK case proceeded to the High Court as of late 2024.

The core allegation was unauthorised training on Getty's licensed image library at scale — over 12 million images. This is the highest-profile pending training data infringement case in the industry.

14. A designer uses Midjourney on a paid subscription to generate images for a commercial client campaign. The client demands full copyright ownership of the outputs. What issue must the designer flag?

Correct. Two issues intersect: the Thaler ruling means purely AI-generated outputs lack copyright protection (so "full copyright ownership" may be legally meaningless), and Midjourney's ToS retains their own license to use outputs regardless of user assignment to clients.

The designer must flag both the Thaler copyright limitation (purely AI outputs may not be copyrightable) and Midjourney's retained license to use outputs — meaning the client cannot achieve full exclusive ownership in the conventional sense.

15. What is the practical design recommendation for producing legible brand name text in a Midjourney-generated advertisement image?

Correct. No current diffusion model reliably renders legible specific text. The professional workflow is to generate image content only and handle all typography in Photoshop, Illustrator, or a dedicated layout application.

The only reliable solution is to generate the image without text and add all typography post-generation in Photoshop or Illustrator. No current diffusion model reliably renders specific legible characters regardless of prompt technique.