When ChatGPT responds to “What's the best sushi restaurant in Miami?”, it produces a paragraph (or several) of natural language text. That text might mention your business, recommend a competitor, cite a review source, or say nothing useful at all. To turn that raw text into quantifiable data, every response goes through a structured extraction pipeline.

This article explains exactly what we extract from each AI response, how we measure mention quality on a continuous scale, and why we use a separate AI model for parsing rather than having the queried engine evaluate itself.

Brightwill source influence analysis showing extracted data from AI responses — Structured extraction turns raw AI text into actionable data - sources, citations, and influence scores across all platforms.

The Parsing Pipeline

Each of the 100+ responses in a comprehensive audit is parsed independently by GPT-4.1-mini. We chose this model specifically for extraction because it runs at temperature 0 (fully deterministic) in JSON mode, producing structured output that we validate against a strict schema.

The parser receives two inputs: the raw AI response text and your business name. It doesn't see other responses, previous results, or any context that might bias its extraction. Each response is evaluated in isolation.

Pipeline steps

1.AI engine generates natural language response to query

2.Raw response text sent to GPT-4.1-mini parser (temperature 0, JSON mode)

3.Parser extracts structured fields: mentions, sentiment, competitors, sources

4.Output validated against Zod schema - malformed responses are rejected and retried

5.Validated data stored as a QueryExecution record in the database

What We Extract From Every Response

The parser extracts 11 structured fields from each response. Together, these fields capture not just whether you were mentioned, but exactly how you were characterized, where you ranked, and what sources influenced the response.

Business Mentionedboolean

The fundamental yes/no: did the AI mention your business at all? This single field drives the recommendation probability metric.

Mention Typeenum

Three levels: primary_recommendation (you were the featured or first-named option), passing_mention (you were listed among others), or not_mentioned.

Mention Strength0.0 - 1.0

A continuous score capturing how prominently you were featured. More granular than mention type alone - a dedicated paragraph (0.7) is different from a brief list entry (0.3).

Rank Positioninteger

When AI lists multiple businesses, what position were you? First position in an AI-generated list gets disproportionate user attention.

Sentimentenum or null

Positive, neutral, or negative - how the AI characterized your business. Null when you weren't mentioned, because absence of mention is distinct from negative sentiment.

Sentiment Phrasesarray

Exact text excerpts from the response that convey sentiment, with source attribution showing which platform the AI drew from (e.g., 'via Yelp'). These appear in your report as evidence.

Competitors Mentionedarray

Every other business named in the response, with context and source attribution. Each competitor entry now tracks which platform cited them, powering accurate competitive intelligence.

Topics Associatedarray

What topics the AI links to your business - cuisine types, service categories, specialties. Each topic now includes source attribution when the AI cites a specific platform.

Sources Citedarray

Source name, type (review platform, directory, news, social media, official site), and URL when available. These are the citations shaping your visibility.

Factual Claimsarray

Specific facts the AI stated about your business - address, phone, hours, services, pricing - with source attribution showing where the AI found each fact (e.g., 'Google Maps', 'Yelp'). Reveals both what information AI has and where it comes from.

Category Inferredstring

What business category the AI thinks you belong to. Mismatches here indicate a positioning problem.

Mention Strength: A Continuous Signal

The mention type field (primary/passing/not mentioned) is useful but coarse. A business that gets a dedicated paragraph of praise is fundamentally different from one that appears as the seventh name in a list - even though both count as “mentioned.” The mention strength score (0.0 to 1.0) captures this nuance.

0.9 - 1.0

Primary Focus

AI dedicates significant discussion to your business. You're the featured recommendation or the main subject of the response.

0.6 - 0.8

Dedicated Detail

A full paragraph or detailed description. You're clearly highlighted, though not necessarily the sole focus.

0.3 - 0.5

Listed Among Options

You're one of several businesses named, with some context but not a deep dive. Common in "top 5" style responses.

0.1 - 0.2

Passing Reference

A brief, tangential mention without meaningful detail. You were named but not recommended.

Mention strength feeds directly into the “Mention Quality” component of your AI Visibility Score, which accounts for 15% of your total score. A business that's mentioned in every response but only as a passing reference will score lower than one that's featured prominently in fewer responses.

Source Citation Tracking

One of the most actionable things we extract is which sources AI engines cite when making recommendations. When ChatGPT says “According to Yelp reviews...” or “Based on their Google Business Profile...”, that tells you which information sources are shaping your AI visibility.

We classify sources into six types:

Review Platforms

Yelp, Google Reviews, TripAdvisor, Trustpilot

Directories

Google Business Profile, Apple Maps, BBB

News & Media

Local news, industry publications, press coverage

Social Media

Reddit, Instagram, X, Facebook

Official Sites

Your own website, blog, about pages

Other

Wikipedia, forums, niche databases

These citations are aggregated across all queries and all providers to produce the Source Influence analysis in your report. If Yelp is cited 15 times across your audit but your Yelp profile is thin, that's a clear optimization opportunity. If your website is never cited, it may mean AI crawlers can't access your content.

Source Attribution: Tracing Every Data Point

Beyond tracking which sources an AI response cites, we now trace individual data points back to their source. When the parser extracts a fact, a competitor mention, a sentiment phrase, or a topic association, it also extracts which platform or site the AI attributed that information to.

Example: When ChatGPT says “According to Google Maps, Joe's Sushi is located at 123 Main St and is open 11am-10pm. Yelp reviewers praise their omakase...”, the parser extracts:

address: “123 Main St” → via Google Maps
hours: “11am-10pm” → via Google Maps
sentiment: “praise their omakase” → via Yelp

This source attribution flows through the entire pipeline. In your report, the “What AI Knows About You” card shows not just what each engine said, but where it got the information:

ChatGPT (910) 791-0280 via Google Maps
Claude  Not mentioned
Gemini  (910) 791-0280 via Yelp

Source attribution also improves the action plan. Instead of guessing which platforms your competitors appear on, the system uses real data - if the AI cited a competitor “via TripAdvisor,” that competitor's TripAdvisor presence is a confirmed fact, not a heuristic assumption.

From Individual Responses to Your Score

The extracted data from all responses is aggregated into your GEO Score (0-100), a weighted composite of seven factors:

Mention Rate

How often AI mentions your business across organic queries

30%

Mention Quality

Weighted by mention strength - primary recommendations count more

15%

Sentiment

The positivity of how AI describes you when mentioned

10%

Topic Breadth

How many query types trigger a mention (max 8 types)

10%

Source Diversity

Distinct sources citing your business (max 6 sources)

Information Completeness

How many factual fields AI knows about you

10%

Technical AI Readiness

Website scan score - crawler access, schema, meta directives

20%

The Technical AI Readiness component (20%) comes from the website scan. If no website URL is provided, that 20% is redistributed proportionally across the other six components. For the full breakdown of how the score works, see AI Visibility Score explained.

Why We Use a Separate Parser

A natural question: why not ask ChatGPT to evaluate its own responses? The answer is objectivity. The AI that generates the recommendation is not the same AI that analyzes it.

We query three different AI engines (ChatGPT, Claude, Gemini) with real questions, collecting their natural responses. Then a separate model (GPT-4.1-mini, optimized for extraction) parses every response using the same consistent criteria. This avoids self-evaluation bias - a model might be generous in assessing its own outputs. By keeping generation and evaluation separate, we get a more honest picture.

The parser also runs at temperature 0, meaning the same response text always produces the same extracted data. There's no randomness in the analysis step. If you ran the same audit twice with the same AI responses, you'd get identical structured data.

See It in Action

Every data point in your Brightwill report traces back to a real AI conversation that went through this extraction pipeline. The sentiment phrases, competitor names, source citations, and mention types in your report are all extracted directly from what the AI actually said - not estimated or inferred.

Run a free audit to see how ChatGPT responds to 5 real queries about your business. To see how the extracted data becomes a prioritized optimization plan, read how we build your action plan.

How We Analyze Every AI Response: Our Multi-Layer Extraction Pipeline

The Parsing Pipeline

What We Extract From Every Response

Mention Strength: A Continuous Signal

Source Citation Tracking

Source Attribution: Tracing Every Data Point

From Individual Responses to Your Score

Why We Use a Separate Parser

See It in Action

Continue Reading

Inside Our Query Engine: How We Test 100+ AI Conversations About Your Business

AI Visibility Score: How We Measure If AI Recommends Your Business

From Data to Action: How We Build Your Personalized GEO Action Plan