Which sources do AI models actually cite?

Track Reddit, Wikipedia, G2, LinkedIn, and 180+ domains across ChatGPT, Claude, Perplexity, Gemini, Grok, and DeepSeek.

Sources by AI model

Last seen: source activity across all models

When was each source last cited? 15 sources tracked across 3 AI models.

All tracked sources by category

How to see what sources ChatGPT uses

ChatGPT, Claude, Perplexity, and other LLMs pull from different source pools. Some cite live web pages. Others draw from training data that includes Reddit, Wikipedia, and news sites.

The sources an LLM cites directly shape how it describes your brand, recommends your product, and positions you against competitors. If G2 reviews say you're the best CRM, ChatGPT will repeat that claim.

xSeek tracks citation patterns across all major AI models. You see exactly which domains get cited, how often, and whether your brand appears. That's the difference between hoping LLMs say the right thing and knowing they do.

Best tools to check ChatGPT sources

Native UIs like Perplexity and SearchGPT show inline citations. Browser dev tools reveal the actual URLs fetched during retrieval. Enterprise teams use observability layers that log every source for every query.

For ongoing monitoring, tracking whether your brand gets cited more or less over time, you need an AI visibility tool. xSeek monitors citation frequency, share of voice, and source patterns across ChatGPT, Claude, Perplexity, Gemini, Grok, and DeepSeek.

Frequently asked questions

How to see what sources ChatGPT uses?

You can check ChatGPT's sources by enabling browsing mode (SearchGPT) which shows inline citations, inspecting network requests in browser dev tools, or using an AI visibility tool like xSeek that tracks citation patterns across ChatGPT responses at scale.

Is Reddit still used to train LLMs?

Yes. OpenAI has a $60M/year licensing deal with Reddit. Reddit content appears in ChatGPT training data and in live browsing citations. Other LLMs also reference Reddit threads through web search retrieval, though citation frequency has shifted since mid-2025.

What are the best tools to check ChatGPT sources?

Native citation UIs (Perplexity, SearchGPT), browser network inspection for live URL retrieval, retrieval observability layers for custom LLM stacks, and AI visibility platforms like xSeek that monitor citation frequency and share of voice across all major LLMs.

How often does ChatGPT cite Reddit in 2025-2026?

Reddit URLs appear in approximately 8-15% of ChatGPT responses that include citations, depending on query type. Consumer queries and product recommendations cite Reddit most frequently. The rate has fluctuated but Reddit remains a significant source.

What is generative engine optimization (GEO)?

GEO is the practice of optimizing your content and online presence to be cited by AI answer engines like ChatGPT, Perplexity, and Claude. It involves monitoring which sources LLMs cite, ensuring your brand appears on high-citation domains, and structuring content for AI retrieval.

Do different LLMs use different sources?

Yes. Each LLM has distinct citation patterns. Perplexity cites the widest variety of sources in every response. Grok has unique access to X/Twitter. Gemini leverages Google Search. ChatGPT varies between training data and live browsing. Claude favors documentation and academic sources.

Track which sources LLMs cite for your brand

xSeek monitors 180+ domains across ChatGPT, Claude, Perplexity, Gemini, Grok, and DeepSeek. See exactly when and where your brand gets cited.