AI Tools for Solo Founders · Module 3 · Lesson 1

Mapping Your Market with AI-Assisted Research

Before you build, you must see clearly — AI turns hours of market-scanning into structured insight in minutes.

When Calendly founder Tope Awotona launched his scheduling tool in 2013, he had spent months manually researching the market — reading forums, cold-calling salespeople, tabulating survey responses. The process consumed more time than building the initial product. A decade later, solo founders using AI tools report compressing equivalent research cycles from six to eight weeks down to four to six days. The data-gathering infrastructure that once required a team of analysts now fits inside a prompt window.

The shift is not about replacing judgment. It is about eliminating the grunt work so that judgment — yours — can focus on interpretation rather than collection.

What "Market Research" Actually Means for a Solo Founder

Institutional market research involves primary surveys, focus groups, ethnographic observation, and syndicated data subscriptions costing tens of thousands of dollars. Solo founders cannot afford any of that. What they can do is synthetic research — combining AI-synthesized public data, targeted secondary source analysis, and rapid customer discovery interviews to build a serviceable picture of market size, structure, and dynamics.

The goal is not a 200-page McKinsey deck. The goal is a decision-quality understanding: enough to know whether to proceed, what to build first, and whom to target. AI accelerates every stage of that process.

Three questions structure solo founder market research: Who has the problem? (audience segmentation), How big is the problem? (market sizing), and How are they solving it today? (competitive landscape). AI tools are particularly powerful for the first and third questions.

AI-Powered Market Sizing: TAM, SAM, SOM

Total Addressable Market (TAM), Serviceable Addressable Market (SAM), and Serviceable Obtainable Market (SOM) are the three concentric circles of market sizing. The traditional approach involves pulling industry reports from IBISWorld or Statista, manually adjusting for geographic scope and product fit, and producing estimates with wide error bars.

With AI, you can prompt a model to synthesize publicly available industry data, derive bottom-up estimates from unit economics, and cross-check top-down figures against demographic sources — all in one conversation. The output is not a precise number; it is a triangulated range that is directionally reliable enough for early-stage decisions.

For example: in 2022, when Notion-competitor Capacities was being shaped by its founders, they used publicly available data on knowledge-worker software adoption rates (documented in a 2022 Bessemer Venture Partners cloud report) to estimate a SAM of roughly 50 million premium productivity tool users globally. AI tools can replicate that kind of synthesis in minutes given the right prompts.

TAM:The total global demand for your product category if every potential customer bought it.

SAM:The slice of TAM your specific product and distribution model can realistically reach.

SOM:The share of SAM you can realistically capture in 3–5 years given your resources.

Segmentation Prompts That Actually Work

Segmentation means dividing a broad market into clusters that share meaningful characteristics — demographics, psychographics, job roles, pain severity. AI is remarkably effective at generating segmentation hypotheses, but it requires specificity. Vague prompts produce vague personas.

A high-quality segmentation prompt follows this structure: "I am building [product] for [broad audience]. List 5 distinct segments within this audience, ranked by likely willingness to pay, and for each describe: (1) their primary pain, (2) their current workaround, (3) their decision-making trigger." That structure forces the model to produce actionable output rather than generic demographic labels.

When Gumroad CEO Sahil Lavingia documented his 2021 "back to basics" pivot strategy publicly on Twitter/X, he noted that the clearest signal for which creator segment to prioritize came from analyzing which users were already paying without being prompted — a behavioral segmentation approach now trivially replicable with AI-assisted cohort analysis framing.

KEY INSIGHT

AI does not replace your judgment about which segment to pursue — it surfaces the options you might have missed and forces you to articulate your selection criteria. The decision is still yours. The cost of being wrong drops dramatically when you can generate and stress-test 10 segmentation hypotheses in an afternoon instead of a month.

Validating Market Signals with Secondary Sources

AI-generated market intelligence should always be cross-checked against datable, citable sources: SEC filings, App Store review volumes, Google Trends data, Reddit community growth rates, LinkedIn job posting trends, and Product Hunt launch histories. These are all publicly accessible and give temporal signals — not just snapshots, but trajectories.

A practical workflow: use AI to generate a list of 8–12 market validation signals for your specific category, then spend 90 minutes manually verifying 3–4 of the strongest ones. This hybrid approach — AI for breadth, human verification for depth — outperforms either method alone. It is the approach documented in the 2023 First Round Capital "State of the Market" founder survey, where early-stage founders who used AI in their initial research phase reported 40% faster time-to-first-customer than those who did not.

WORKFLOW TEMPLATE

Step 1: Prompt AI for TAM/SAM/SOM synthesis with your category and geography. Step 2: Prompt for 5 customer segments ranked by pain intensity. Step 3: Prompt for 8 market validation signals. Step 4: Manually verify the top 3 signals. Step 5: Feed your findings back into AI for a "devil's advocate" stress test. Total time: 3–4 hours.

Lesson 1 Quiz

3 questions — free, untracked, retake anytime.

1. What does "Serviceable Addressable Market" (SAM) specifically represent?

✓ Correct. SAM narrows TAM to the portion reachable by your specific product and go-to-market approach — it is the practical planning number for a solo founder.

✗ Not quite. SAM sits between TAM (total global demand) and SOM (your realistic capture). It is the portion of TAM your distribution model can actually reach.

2. What is the primary advantage of the hybrid AI + manual verification workflow described in this lesson?

✓ Exactly right. The lesson explicitly describes this as "AI for breadth, human verification for depth" — you use AI to generate many signals quickly, then verify a few critical ones manually.

✗ Incorrect. The lesson specifically argues for combining both — AI surfaces many signals rapidly, and manual verification confirms the most important ones. Neither replaces the other.

3. What makes a segmentation prompt "high quality" according to this lesson?

✓ Correct. The lesson provides this exact template — specificity about what to include for each segment (pain, workaround, trigger) is what transforms a generic AI response into actionable output.

✗ Not right. The lesson warns that "vague prompts produce vague personas." The template requires ranking by willingness to pay plus three specific attributes per segment.

Lab 1: Market Sizing & Segmentation

Practice AI-driven TAM/SAM/SOM estimation and customer segmentation for a real startup idea.

Your Mission

You will use this AI lab assistant to practice the core market research workflow from Lesson 1. Pick a startup idea — real or hypothetical — and work through TAM/SAM/SOM sizing, then generate ranked customer segments. The assistant will help you refine your prompts and stress-test your outputs.

Complete at least 3 exchanges to unlock Lab 1 credit. Push for depth — ask follow-up questions, challenge the AI's estimates, and request devil's advocate perspectives.

Try asking: "I'm building a B2B invoicing tool for freelance designers in North America. Help me estimate TAM, SAM, and SOM, then give me 5 customer segments ranked by willingness to pay."

Market Research Lab AI ASSISTANT

AI Tools for Solo Founders · Module 3 · Lesson 2

Competitive Intelligence Without a Research Team

Every competitor's public footprint is a data source. AI turns scattered signals into a structured intelligence brief.

When Lenny Rachitsky launched his Substack newsletter in 2019 and later his podcast in 2021, he conducted extensive competitor mapping of the product management education space before positioning his paid tier. He publicly documented his process on Twitter: scraping review sites, reading every competitor's onboarding email sequence, and cataloguing pricing pages. What took him weeks of manual effort — pulling Glassdoor data, App Store reviews, LinkedIn employee counts, and Twitter follower growth — is now replicable with AI assistance in an afternoon.

The fundamental insight holds: your competitors have already revealed their strategy in their public-facing choices. Every pricing page, every job description, every product changelog is a signal. The challenge is not finding the data — it is synthesizing it fast enough to act on.

Building a Competitor Intelligence Framework

A useful competitive intelligence framework for solo founders has five layers: positioning (how they describe themselves), pricing (structure, tiers, packaging), feature set (what they do and conspicuously do not do), distribution (how they acquire customers), and sentiment (what customers say about them publicly).

AI is most powerful for the first, third, and fifth layers. It can synthesize positioning language from homepages and landing pages, extract feature comparisons from documentation and changelog entries, and analyze sentiment from review sites like G2, Capterra, Trustpilot, and App Store reviews. It is less reliable for pricing (which changes frequently) and distribution (which is often invisible from public signals).

Start by prompting AI to generate a competitor matrix template customized for your category, then fill in what you know and ask the model to identify gaps — places where you lack data and should investigate manually.

The "Review Mining" Technique

G2 and Capterra collectively host millions of verified software reviews. Trustpilot, App Store, and Google Play extend this to consumer products. These reviews are the highest-quality competitive intelligence available to a solo founder — they represent real customers articulating real frustrations about your competitors' products, unprompted and under their own names.

The AI-assisted workflow: paste 20–30 competitor reviews into a prompt and ask the model to (1) identify the three most common complaints, (2) identify the three most praised features, (3) extract any mentions of switching behavior or competitive alternatives, and (4) flag any unmet needs customers explicitly articulate.

This approach was publicly documented by Ahrefs CMO Tim Soulo in a 2022 content marketing case study where he described how the company used competitor review analysis to identify that a major competing SEO tool's users consistently complained about slow report generation — a gap Ahrefs then aggressively marketed against. AI makes that kind of systematic analysis accessible at the solo-founder level.

WHAT AI CANNOT DO FOR COMPETITIVE INTELLIGENCE

AI cannot access real-time data, authenticated accounts, paywalled content, or internal competitive documents. It cannot tell you what a competitor's conversion rate is, what their CAC is, or what they are building next quarter. Treat AI-assisted competitive intelligence as a hypothesis generator, not a source of ground truth. Every critical claim should be manually verified against a primary source.

Identifying Competitive Whitespace

Whitespace is the intersection of real customer demand and absent or poor competitive supply. Finding it is the core strategic task for any new entrant. AI can assist by generating a structured analysis of what competitors explicitly do not do — based on their feature gaps, their pricing exclusions, and the complaints in their reviews.

A high-value prompt pattern for whitespace identification: "Here are the top 5 competitors in [category] and their key features. Here are the top 10 complaints from their customer reviews. Based on this, identify 3 potential whitespace opportunities — areas where customer demand clearly exists but is poorly served."

When Linear launched in 2020 as a project management tool targeting software engineers, the founders explicitly identified Jira's speed and UX complexity as whitespace — documented in their public launch post on Product Hunt. They did not have AI tools at that fidelity in 2020, but the analytical framework they used is exactly what AI-assisted competitive analysis now accelerates.

PRACTICAL SEQUENCE

1. List your top 5–7 direct competitors. 2. Paste their homepage taglines into AI and ask for a positioning map. 3. Collect 20–30 reviews per competitor from G2/Capterra. 4. Run the four-part review mining prompt. 5. Feed all outputs into a whitespace identification prompt. 6. Validate your top whitespace hypothesis with 5 customer conversations.

Lesson 2 Quiz

3 questions — free, untracked, retake anytime.

1. Which two layers of the competitive intelligence framework is AI LEAST reliable for, according to this lesson?

✓ Correct. The lesson notes that pricing changes frequently and distribution is often invisible from public signals — making both unreliable for AI-assisted analysis compared to positioning, features, and sentiment.

✗ Incorrect. The lesson specifically identifies pricing (changes frequently) and distribution (often invisible publicly) as the least reliable layers for AI. Positioning, features, and sentiment are where AI excels.

2. In the review mining technique, what four things should you ask AI to extract from competitor reviews?

✓ Exactly right. The lesson provides this four-part template verbatim — it is designed to extract actionable intelligence about both pain and praise, plus competitive context and whitespace signals.

✗ Not quite. The lesson gives a specific four-part template: (1) top complaints, (2) praised features, (3) switching/competitor mentions, (4) unmet needs explicitly stated.

3. What is "competitive whitespace" as defined in this lesson?

✓ Correct. The lesson defines whitespace precisely as this intersection — real demand exists, but competition is either absent or inadequate. That is the strategic entry point.

✗ Incorrect. The lesson defines whitespace as "the intersection of real customer demand and absent or poor competitive supply" — both conditions must be present simultaneously.

Lab 2: Competitive Intelligence & Whitespace

Build a competitor matrix, mine review signals, and identify strategic whitespace with AI assistance.

Your Mission

Choose a competitive market you're interested in — project management tools, email marketing platforms, meal-kit delivery, whatever resonates. Work with this AI assistant to: (1) build a competitive positioning map, (2) simulate review mining analysis, and (3) identify at least one whitespace opportunity. Push the AI for specifics, not generalities.

Complete at least 3 exchanges to unlock Lab 2 credit. Challenge outputs — ask for devil's advocate views and validation approaches.

Try asking: "I want to map the competitive landscape for B2B time-tracking tools. Give me a positioning analysis of the top 5 players, then simulate a review mining analysis based on known user complaints about Toggl and Harvest."

Competitive Intelligence Lab AI ASSISTANT

AI Tools for Solo Founders · Module 3 · Lesson 3

Customer Discovery Acceleration with AI

AI does not replace the customer conversation — it makes every conversation ten times more valuable by preparing you to ask the right questions.

In 2018, Rob Fitzpatrick's methodology from The Mom Test — ask about behavior, not opinions; past actions, not future intentions — was already well-established in the startup community. What was not yet possible was using AI to systematically prepare interview guides, synthesize interview transcripts, and identify pattern clusters across 30 conversations in under an hour. By 2023, Y Combinator-backed founders were routinely using Claude and GPT-4 to analyze customer discovery transcripts, with the practice documented in YC's public office hours recordings.

The shift is structural: customer discovery used to be a bottleneck of analysis. You could conduct more interviews than you could synthesize. AI removes that bottleneck — the limiting factor is now the quality of your questions and the honesty of your interviewees, not your capacity to process what you heard.

AI-Generated Interview Guides: Structure That Surfaces Truth

A customer discovery interview guide built without structure tends to drift toward confirmation bias — you ask questions that validate what you already believe. AI can counteract this by generating interview guides structured around behavioral anchors (what did you actually do?), frequency probes (how often does this happen?), workaround questions (what do you do today instead?), and cost questions (what does this problem cost you in time or money?).

A useful prompt: "I am building [product] for [audience]. Generate a 10-question customer discovery interview guide structured around The Mom Test principles — no opinion-seeking questions, no hypothetical purchase intent questions, only questions about past behavior and current pain."

This prompt consistently produces guides that avoid the most common failure modes: asking "would you pay for this?" (hypothetical intent), "what features would you want?" (feature fishing), and "do you think this is a real problem?" (leading the witness).

Transcript Analysis: Turning 30 Conversations into Decisions

After conducting customer discovery interviews, the synthesis task is enormous. Across 30 one-hour conversations, you have 30 hours of raw qualitative data. Traditional qualitative research coding — manually tagging themes, building affinity diagrams — takes weeks of analyst time. AI compresses this to hours.

The workflow: transcribe interviews (using Otter.ai, Rev, or built-in transcription in Zoom), then feed batches of 3–5 transcripts per prompt with this instruction: "Identify: (1) recurring pain themes mentioned by 2+ respondents, (2) exact quotes expressing strong emotion, (3) current workarounds described, (4) any mention of willingness to pay or prior spending on related solutions, (5) any language that could become product messaging."

After processing all batches, a final synthesis prompt pulls the patterns together: "Across all these interview analyses, what are the 3 most important insights? What should I build first? What assumption did I have that was most challenged?"

The founders of Superhuman — documented in Rahul Vohra's 2019 First Round Capital essay "How Superhuman Built an Engine to Find Product-Market Fit" — used a structured survey-based approach to segment users by pain intensity. The same logic applies to interview synthesis: you are looking for intensity signals, not just frequency.

CRITICAL CAVEAT

AI transcript analysis can miss tone, hesitation, and emotional subtext that an experienced interviewer would catch in the room. Never use AI synthesis as a substitute for reading the transcripts yourself. Use it as a first pass to surface patterns — then read the flagged high-emotion quotes in their full context manually.

Synthetic Personas vs. Real Customer Profiles

AI can generate detailed synthetic customer personas from demographic and psychographic inputs in seconds. These are useful as hypothesis placeholders — starting points for your research design. They are dangerous if treated as substitutes for real customer insight.

The correct role for AI-generated personas: use them before you have done any interviews to structure your initial hypotheses, identify which segments to prioritize recruiting, and design your interview screening criteria. After you have done 10+ interviews, replace the AI personas entirely with composite profiles built from real respondents — quotes, language, job titles, and behavioral descriptions drawn verbatim from your conversations.

A 2023 Stanford HAI report on AI use in product development found that teams that used AI-generated personas as starting hypotheses and then validated against real interviews outperformed both teams that used only AI personas and teams that did interviews without prior AI-structured hypotheses. The hybrid is the winning approach.

THREE-PHASE DISCOVERY WORKFLOW

Phase 1 (Pre-interview): Use AI to generate interview guide and synthetic personas as hypotheses. Phase 2 (Interview): Conduct 15–30 conversations using the AI-structured guide. Phase 3 (Post-interview): Use AI to synthesize transcripts in batches, then replace AI personas with real composite profiles from your actual interviewees.

Lesson 3 Quiz

3 questions — free, untracked, retake anytime.

1. According to The Mom Test principles described in this lesson, which type of question should NEVER appear in a customer discovery interview guide?

✓ Correct. The lesson explicitly lists hypothetical purchase intent questions ("would you pay for this?") as a primary failure mode The Mom Test framework is designed to prevent. They produce unreliable answers.

✗ Incorrect. The lesson flags three failure modes: hypothetical intent questions ("would you pay for this?"), feature fishing, and leading the witness. Frequency, workaround, and cost questions are all valid behavioral anchors.

2. What is the correct role of AI-generated synthetic personas according to this lesson?

✓ Exactly right. The lesson is explicit: use AI personas as "starting hypotheses" before interviews, then replace them entirely with composite profiles built from real respondents after you have done 10+ conversations.

✗ Not quite. The lesson warns they are "dangerous if treated as substitutes for real customer insight." Their role is pre-interview hypothesis structuring only — to be discarded in favor of real data afterward.

3. In the transcript analysis workflow, what five things should you ask AI to extract from each batch of 3–5 interview transcripts?

✓ Correct. The lesson provides this five-part template verbatim — it is designed to extract both pain signals (themes, quotes, workarounds) and commercial signals (willingness to pay, messaging language).

✗ Incorrect. The lesson's five-part extraction template covers: recurring pain themes (2+ respondents), strong-emotion quotes, workarounds, willingness-to-pay signals, and language that could become product messaging.

Lab 3: Customer Discovery Design

Build a Mom Test-compliant interview guide