AI models lean on Reddit: The internet’s ultimate knowledge base

AI systems rely heavily on a select group of websites for information. Reddit leads by a wide margin, accounting for about 40% of domain citations, followed by Wikipedia (26%), YouTube (24%), Google (23%), Yelp (21%), Facebook (20%), Amazon (19%), Tripadvisor (12%), Instagram (11%), and Quora (5%).
This dominance of specific platforms highlights how large language models (LLMs) base their answers on a concentrated slice of the web.
A Semrush analysis supports and elaborates on these findings. As noted in a recent Voronoi summary, Semrush reports that LLMs like ChatGPT lean most heavily on Reddit and Wikipedia when sourcing facts.
Moreover, Semrush’s AI Search study reveals that Quora and Reddit are the most cited domains in Google's AI Overviews, quick answer boxes that appear at the top of many search results, owing largely to their rich, user-generated content.
As AI-powered interfaces like Google’s AI Overviews grow more prevalent, now appearing in over 13% of all searches according to Semrush data, your visibility within them can make or break search discoverability.
Ranking well is no longer sufficient; being cited directly in AI-generated responses is increasingly the most valuable position to hold.
Sites like Quora and Reddit excel in visibility because they answer hyper-specific questions that broader sites don’t address. If you can contribute well-crafted content to these platforms or similar high-authority domains, you may significantly boost your exposure across AI channels.
This story is written and edited by the Global South World team, you can contact us here.