Hallucinated URLs: When AI Invents Pages on Your Website

What Are Hallucinated URLs?

A hallucinated URL is a web address fabricated entirely by a language model (LLM) when generating a response. The model, unable to verify whether web pages actually exist, constructs URLs that appear plausible but do not exist on your site. This phenomenon, known as AI hallucination, affects all major language models without exception.

In practice, when a user asks ChatGPT, Claude, or Perplexity for information about your brand, the model may invent URLs like yoursite.com/services/strategic-consulting or yoursite.com/enterprise-pricing even though these pages simply do not exist. Users who click these links land on a 404 error, damaging your brand image and representing a missed opportunity.

Real-World Impact on Brands

The consequences of hallucinated URLs go far beyond a simple technical error:

  • 404 errors from AI traffic: users arrive at non-existent pages, creating a negative experience and immediate trust loss. Our data shows that 18% of AI referral traffic leads to 404 pages on average.
  • Reputation damage: when an LLM cites an invented URL, users may perceive your site as poorly structured, outdated, or unreliable. This can deter high-intent prospects.
  • Missed opportunities: each hallucinated URL represents a real user need your site fails to meet. If an LLM invents /enterprise-pricing, it is likely because users are searching for that information.
  • Impact on native score: AI bots following these links and encountering 404s negatively adjust their evaluation of your site.

How Different Models Hallucinate

Each language model has its own URL hallucination patterns:

GPT-4 and ChatGPT

OpenAI's models tend to generate logically structured URLs, often replicating your site's navigation patterns. They frequently invent sub-pages for services or products that do not exist but would be consistent with your business. Observed hallucination rate: approximately 12% of cited URLs.

Claude (Anthropic)

Claude is generally more cautious with URL generation, often preferring to link to the homepage or generic pages. When it hallucinates, it tends to invent blog-type or resource URLs. Observed hallucination rate: approximately 8% of cited URLs.

Gemini (Google)

Gemini benefits from access to Google's index, which reduces its URL hallucinations. However, it may cite indexed but since-deleted pages, or obsolete URL versions. Observed hallucination rate: approximately 6%.

Perplexity

Perplexity performs real-time searches, considerably reducing URL hallucinations. Nevertheless, it may confuse URLs between similar sites or cite partially correct URLs. Observed hallucination rate: approximately 4%.

How AI Labs Audit Detects Hallucinated URLs

Hallucinated URL detection is a key platform feature, integrated directly into the audit process:

Per-Model Tracking

During each audit, the platform queries major LLMs with structured prompts. Each response is analyzed to extract all mentioned URLs. These URLs are then verified against your sitemap and crawled to confirm their actual existence.

Response Analysis

The system does not merely verify URL existence. It also analyzes the context in which they are cited: what question triggered the hallucination? What type of page was invented? Is there a recurring pattern? This analysis reveals unmet user expectations.

Extraction and Categorization

Hallucinated URLs are automatically categorized: service pages, product pages, blog articles, pricing pages, contact pages, etc. This categorization helps prioritize corrective actions.

Dashboard Visualization

The AI tracking displays hallucinated URLs in a dedicated dashboard. You will find:

  • A complete list of detected hallucinated URLs, sorted by frequency
  • The model(s) that generated each URL
  • The context (prompt or question) that triggered the hallucination
  • Time evolution (new hallucinations, recurring hallucinations)
  • Processing status (untreated, redirect in place, page created)

What to Do with Hallucinated URLs

Once detected, several strategies are available:

Implementing 301 Redirects

The fastest solution: redirect the most frequent hallucinated URLs to the most relevant existing pages. If an LLM invents /services/complete-audit, redirect it to your actual services page. This immediately transforms lost traffic into useful traffic.

Creating Missing Pages

If a hallucinated URL recurs regularly, it signals genuine demand for that content. Create the corresponding page with rich, GEO-optimized content. This approach transforms hallucinations into content opportunities.

Monitoring Patterns

Analyze hallucination trends to anticipate needs. If LLMs systematically invent comparison pages, FAQs, or pricing pages, this is a strong signal for content creation. Integrate these insights into your editorial strategy.

Prevention Strategies

Beyond reactive correction, several proactive strategies reduce hallucination rates:

Strengthen Real URL Structure

Clear, logical, descriptive URLs are less likely to be hallucinated because LLMs memorize them better. Adopt a coherent and predictable URL architecture.

Optimize the Sitemap

A complete, up-to-date XML sitemap helps LLMs learn your site's real structure. Ensure all important pages are included with rich metadata.

Schema.org and Structured Data

Structured data provides LLMs with an accurate map of your site. The richer your structured data (SiteNavigationElement, WebPage, BreadcrumbList), the more reliable information models have about your real URLs.

llms.txt File

The llms.txt can include a section listing your main URLs, explicitly guiding LLMs toward your real pages. See our complete llms.txt guide for more.

Hallucination Rate Statistics

Based on data collected on our platform across hundreds of sites:

  • 74% of sites have at least one hallucinated URL detected during the first audit
  • 3.2 URLs hallucinated on average per brand per model
  • E-commerce sites are most affected (5.7 hallucinated URLs on average) due to catalog complexity
  • After correction (redirects + pages created), the hallucination rate drops by 61% on average over 3 months
  • Recovered traffic via redirects represents on average 8% of total AI referral traffic

Case Study Examples

A management consulting firm discovered that ChatGPT was systematically inventing a page at /expertise/digital-transformation. By creating this page with rich content and a dedicated showcase page, the firm recovered over 200 qualified monthly visits within 8 weeks.
A B2B SaaS brand identified via complete AI tracking that 4 different models were hallucinating a competitor comparison page. Creating this objective comparison page generated a 23% boost in overall mention rate.

Integration with the Audit Ecosystem

Hallucinated URL detection integrates into a global GEO strategy. Combine it with the 26-point GEO checklist for complete coverage, and use scheduled audits to monitor hallucination evolution over time. The competitive analysis also allows you to compare your hallucination rate against competitors.

Measure your AI visibility now

Discover how ChatGPT, Claude, Gemini and Perplexity talk about your brand. Free audit across 300+ AI models.

Start my free audit
About the author

Davy Abderrahman

Founder & CEO at

Specialist in AI visibility (AEO/GEO/LLMO), I help agencies and consultants measure and optimize their clients' presence on ChatGPT, Claude, Gemini, Perplexity and other AI answer engines. Pioneer in AI visibility auditing since 2024.

AEO GEO LLMO AI Visibility AI Audits

Was this article helpful?

- (0 votes)