Back to Blog

How We Analyze Every AI Response: Our Multi-Layer Extraction Pipeline

Brightwill Team·2026-03-22

When ChatGPT responds to “What's the best sushi restaurant in Miami?”, it produces a paragraph (or several) of natural language text. That text might mention your business, recommend a competitor, cite a review source, or say nothing useful at all. To turn that raw text into quantifiable data, every response goes through a structured extraction pipeline.

This article explains exactly what we extract from each AI response, how we measure mention quality on a continuous scale, and why we use a separate AI model for parsing rather than having the queried engine evaluate itself.

brightwill.ai/report
Brightwill source influence analysis showing extracted data from AI responses
Structured extraction turns raw AI text into actionable data - sources, citations, and influence scores across all platforms.

The Parsing Pipeline

Each of the 100+ responses in a comprehensive audit is parsed independently by GPT-4.1-mini. We chose this model specifically for extraction because it runs at temperature 0 (fully deterministic) in JSON mode, producing structured output that we validate against a strict schema.

The parser receives two inputs: the raw AI response text and your business name. It doesn't see other responses, previous results, or any context that might bias its extraction. Each response is evaluated in isolation.

Pipeline steps
1.AI engine generates natural language response to query
2.Raw response text sent to GPT-4.1-mini parser (temperature 0, JSON mode)
3.Parser extracts structured fields: mentions, sentiment, competitors, sources
4.Output validated against Zod schema - malformed responses are rejected and retried
5.Validated data stored as a QueryExecution record in the database

What We Extract From Every Response

The parser extracts 11 structured fields from each response. Together, these fields capture not just whether you were mentioned, but exactly how you were characterized, where you ranked, and what sources influenced the response.

Business Mentionedboolean
The fundamental yes/no: did the AI mention your business at all? This single field drives the recommendation probability metric.
Mention Typeenum
Three levels: primary_recommendation (you were the featured or first-named option), passing_mention (you were listed among others), or not_mentioned.
Mention Strength0.0 - 1.0
A continuous score capturing how prominently you were featured. More granular than mention type alone - a dedicated paragraph (0.7) is different from a brief list entry (0.3).
Rank Positioninteger
When AI lists multiple businesses, what position were you? First position in an AI-generated list gets disproportionate user attention.
Sentimentenum or null
Positive, neutral, or negative - how the AI characterized your business. Null when you weren't mentioned, because absence of mention is distinct from negative sentiment.
Sentiment Phrasesarray
Exact text excerpts from the response that convey sentiment, with source attribution showing which platform the AI drew from (e.g., 'via Yelp'). These appear in your report as evidence.
Competitors Mentionedarray
Every other business named in the response, with context and source attribution. Each competitor entry now tracks which platform cited them, powering accurate competitive intelligence.
Topics Associatedarray
What topics the AI links to your business - cuisine types, service categories, specialties. Each topic now includes source attribution when the AI cites a specific platform.
Sources Citedarray
Source name, type (review platform, directory, news, social media, official site), and URL when available. These are the citations shaping your visibility.
Factual Claimsarray
Specific facts the AI stated about your business - address, phone, hours, services, pricing - with source attribution showing where the AI found each fact (e.g., 'Google Maps', 'Yelp'). Reveals both what information AI has and where it comes from.
Category Inferredstring
What business category the AI thinks you belong to. Mismatches here indicate a positioning problem.

Mention Strength: A Continuous Signal

The mention type field (primary/passing/not mentioned) is useful but coarse. A business that gets a dedicated paragraph of praise is fundamentally different from one that appears as the seventh name in a list - even though both count as “mentioned.” The mention strength score (0.0 to 1.0) captures this nuance.

0.9 - 1.0
Primary Focus
AI dedicates significant discussion to your business. You're the featured recommendation or the main subject of the response.
0.6 - 0.8
Dedicated Detail
A full paragraph or detailed description. You're clearly highlighted, though not necessarily the sole focus.
0.3 - 0.5
Listed Among Options
You're one of several businesses named, with some context but not a deep dive. Common in "top 5" style responses.
0.1 - 0.2
Passing Reference
A brief, tangential mention without meaningful detail. You were named but not recommended.

Mention strength feeds directly into the “Mention Quality” component of your AI Visibility Score, which accounts for 15% of your total score. A business that's mentioned in every response but only as a passing reference will score lower than one that's featured prominently in fewer responses.

Source Citation Tracking

One of the most actionable things we extract is which sources AI engines cite when making recommendations. When ChatGPT says “According to Yelp reviews...” or “Based on their Google Business Profile...”, that tells you which information sources are shaping your AI visibility.

We classify sources into six types:

Review Platforms
Yelp, Google Reviews, TripAdvisor, Trustpilot
Directories
Google Business Profile, Apple Maps, BBB
News & Media
Local news, industry publications, press coverage
Social Media
Reddit, Instagram, X, Facebook
Official Sites
Your own website, blog, about pages
Other
Wikipedia, forums, niche databases

These citations are aggregated across all queries and all providers to produce the Source Influence analysis in your report. If Yelp is cited 15 times across your audit but your Yelp profile is thin, that's a clear optimization opportunity. If your website is never cited, it may mean AI crawlers can't access your content.

Source Attribution: Tracing Every Data Point

Beyond tracking which sources an AI response cites, we now trace individual data points back to their source. When the parser extracts a fact, a competitor mention, a sentiment phrase, or a topic association, it also extracts which platform or site the AI attributed that information to.

Example: When ChatGPT says “According to Google Maps, Joe's Sushi is located at 123 Main St and is open 11am-10pm. Yelp reviewers praise their omakase...”, the parser extracts:
address: “123 Main St” → via Google Maps
hours: “11am-10pm” → via Google Maps
sentiment: “praise their omakase” → via Yelp

This source attribution flows through the entire pipeline. In your report, the “What AI Knows About You” card shows not just what each engine said, but where it got the information:

ChatGPT (910) 791-0280 via Google Maps
Claude  Not mentioned
Gemini  (910) 791-0280 via Yelp

Source attribution also improves the action plan. Instead of guessing which platforms your competitors appear on, the system uses real data - if the AI cited a competitor “via TripAdvisor,” that competitor's TripAdvisor presence is a confirmed fact, not a heuristic assumption.

From Individual Responses to Your Score

The extracted data from all responses is aggregated into your GEO Score (0-100), a weighted composite of seven factors:

Mention Rate
How often AI mentions your business across organic queries
30%
Mention Quality
Weighted by mention strength - primary recommendations count more
15%
Sentiment
The positivity of how AI describes you when mentioned
10%
Topic Breadth
How many query types trigger a mention (max 8 types)
10%
Source Diversity
Distinct sources citing your business (max 6 sources)
5%
Information Completeness
How many factual fields AI knows about you
10%
Technical AI Readiness
Website scan score - crawler access, schema, meta directives
20%

The Technical AI Readiness component (20%) comes from the website scan. If no website URL is provided, that 20% is redistributed proportionally across the other six components. For the full breakdown of how the score works, see AI Visibility Score explained.

Why We Use a Separate Parser

A natural question: why not ask ChatGPT to evaluate its own responses? The answer is objectivity. The AI that generates the recommendation is not the same AI that analyzes it.

We query three different AI engines (ChatGPT, Claude, Gemini) with real questions, collecting their natural responses. Then a separate model (GPT-4.1-mini, optimized for extraction) parses every response using the same consistent criteria. This avoids self-evaluation bias - a model might be generous in assessing its own outputs. By keeping generation and evaluation separate, we get a more honest picture.

The parser also runs at temperature 0, meaning the same response text always produces the same extracted data. There's no randomness in the analysis step. If you ran the same audit twice with the same AI responses, you'd get identical structured data.

See It in Action

Every data point in your Brightwill report traces back to a real AI conversation that went through this extraction pipeline. The sentiment phrases, competitor names, source citations, and mention types in your report are all extracted directly from what the AI actually said - not estimated or inferred.

Run a free audit to see how ChatGPT responds to 5 real queries about your business. To see how the extracted data becomes a prioritized optimization plan, read how we build your action plan.

Continue Reading

Methodology10 min read

Inside Our Query Engine: How We Test 100+ AI Conversations About Your Business

How we generate and test 100+ tailored AI conversations per audit — from business profiling to 10 query types across 16 business categories.

Read more
Methodology6 min read

AI Visibility Score: How We Measure If AI Recommends Your Business

A transparent look at our methodology — how we track visibility, position, and sentiment across 150+ real AI conversations to calculate your AI Visibility Score (0-100), using organic-only scoring and source attribution.

Read more
Methodology9 min read

From Data to Action: How We Build Your Personalized GEO Action Plan

How we turn audit data into 15-30 specific optimization actions — from pre-computed gap analysis to opportunity scoring and the earned vs owned media split.

Read more