Related papers: Unexpected Knowledge: Auditing Wikipedia and Grokipedia Search Recommendations

Unexpected Knowledge: Auditing Wikipedia and Grokipedia Search Recommendations

URL: http://arxiv.org/abs/2512.17027v1
Date: Thu, 18 Dec 2025 19:41:58 GMT
Title: Unexpected Knowledge: Auditing Wikipedia and Grokipedia Search Recommendations
Authors: Erica Coppolillo, Simone Mungari,
Abstract summary: We provide the first comparative analysis of search engine in Wikipedia and Grokipedia.<n>We collect over 70,000 search engine results and examine their semantic alignment, overlap, and topical structure.<n>Our findings show that unexpected search engine outcomes are a common feature of both the platforms.
Score: 1.4323566945483497
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Encyclopedic knowledge platforms are key gateways through which users explore information online. The recent release of Grokipedia, a fully AI-generated encyclopedia, introduces a new alternative to traditional, well-established platforms like Wikipedia. In this context, search engine mechanisms play an important role in guiding users exploratory paths, yet their behavior across different encyclopedic systems remains underexplored. In this work, we address this gap by providing the first comparative analysis of search engine in Wikipedia and Grokipedia. Using nearly 10,000 neutral English words and their substrings as queries, we collect over 70,000 search engine results and examine their semantic alignment, overlap, and topical structure. We find that both platforms frequently generate results that are weakly related to the original query and, in many cases, surface unexpected content starting from innocuous queries. Despite these shared properties, the two systems often produce substantially different recommendation sets for the same query. Through topical annotation and trajectory analysis, we further identify systematic differences in how content categories are surfaced and how search engine results evolve over multiple stages of exploration. Overall, our findings show that unexpected search engine outcomes are a common feature of both the platforms, even though they exhibit discrepancies in terms of topical distribution and query suggestions.

Related papers

Wikipedia and Grokipedia: A Comparison of Human and Generative Encyclopedias [1.2109519547057517]
We examine how generative mediation alters content selection, textual rewriting, narrative structure, and evaluative framing in encyclopedic content.<n>We model page inclusion in Grokipedia as a function of Wikipedia page popularity, density of reference, and recent editorial activity.<n>Rewriting is more frequent for pages with higher reference density and recent controversy, while highly popular pages are more often reproduced without modification.
arXiv Detail & Related papers (2026-02-05T10:24:21Z)
How Similar Are Grokipedia and Wikipedia? A Multi-Dimensional Textual and Structural Comparison [0.0]
Grokipedia, an AI-generated encyclopedia developed by Elon Musk's xAI, was presented as a response to perceived ideological and structural biases in Wikipedia.<n>This study undertakes a large-scale computational comparison of 1,800 matched article pairs between Grokipedia and Wikipedia.<n>Using metrics across lexical richness, readability, structural organization, reference density, and semantic similarity, we assess how closely the two platforms align in form and substance.
arXiv Detail & Related papers (2025-10-30T18:04:46Z)
A Survey of Generative Search and Recommendation in the Era of Large Language Models [125.26354486027408]
generative search (retrieval) and recommendation aims to address the matching problem in a generative manner. Superintelligent generative large language models have sparked a new paradigm in search and recommendation.
arXiv Detail & Related papers (2024-04-25T17:58:17Z)
DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition [94.90258603217008]
The MultiCoNER RNum2 shared task aims to tackle multilingual named entity recognition (NER) in fine-grained and noisy scenarios. Previous top systems in the MultiCoNER RNum1 either incorporate the knowledge bases or gazetteers. We propose a unified retrieval-augmented system (U-RaNER) for fine-grained multilingual NER.
arXiv Detail & Related papers (2023-05-05T16:59:26Z)
Evaluating Verifiability in Generative Search Engines [70.59477647085387]
Generative search engines directly generate responses to user queries, along with in-line citations. We conduct human evaluation to audit four popular generative search engines. We find that responses from existing generative search engines are fluent and appear informative, but frequently contain unsupported statements and inaccurate citations.
arXiv Detail & Related papers (2023-04-19T17:56:12Z)
NeuralSearchX: Serving a Multi-billion-parameter Reranker for Multilingual Metasearch at a Low Cost [4.186775801993103]
We describe NeuralSearchX, a metasearch engine based on a multi-purpose large reranking model to merge results and highlight sentences. We show that our design choices led to a much cost-effective system with competitive QPS while having close to state-of-the-art results on a wide range of public benchmarks.
arXiv Detail & Related papers (2022-10-26T16:36:53Z)
A Large-Scale Characterization of How Readers Browse Wikipedia [13.106604261718381]
We present the first systematic large-scale analysis of how readers browse Wikipedia. Using billions of page requests from Wikipedia's server logs, we measure how readers reach articles. We find that navigation behavior is characterized by highly diverse structures.
arXiv Detail & Related papers (2021-12-22T12:54:44Z)
Exposing Query Identification for Search Transparency [69.06545074617685]
We explore the feasibility of approximate exposing query identification (EQI) as a retrieval task by reversing the role of queries and documents in two classes of search systems. We derive an evaluation metric to measure the quality of a ranking of exposing queries, as well as conducting an empirical analysis focusing on various practical aspects of approximate EQI.
arXiv Detail & Related papers (2021-10-14T20:19:27Z)
Search Engine Similarity Analysis: A Combined Content and Rankings Approach [6.69087470775851]
We present an analysis of the affinity of the two major search engines, Google and Bing, along with DuckDuckGo. We developed a new similarity metric that leverages both the content and the ranking of search responses. We found that Google stands apart, but Bing and DuckDuckGo are largely indistinguishable from each other.
arXiv Detail & Related papers (2020-11-01T23:57:24Z)
A New Neural Search and Insights Platform for Navigating and Organizing AI Research [56.65232007953311]
We introduce a new platform, AI Research Navigator, that combines classical keyword search with neural retrieval to discover and organize relevant literature. We give an overview of the overall architecture of the system and of the components for document analysis, question answering, search, analytics, expert search, and recommendations.
arXiv Detail & Related papers (2020-10-30T19:12:25Z)
On the Social and Technical Challenges of Web Search Autosuggestion Moderation [118.47867428272878]
Autosuggestions are typically generated by machine learning (ML) systems trained on a corpus of search logs and document representations. While current search engines have become increasingly proficient at suppressing such problematic suggestions, there are still persistent issues that remain. We discuss several dimensions of problematic suggestions, difficult issues along the pipeline, and why our discussion applies to the increasing number of applications beyond web search.
arXiv Detail & Related papers (2020-07-09T19:22:00Z)
A Deeper Investigation of the Importance of Wikipedia Links to the Success of Search Engines [7.433327915285967]
We report the results of an investigation into the incidence of Wikipedia links in search engine results pages (SERPs) We find that Wikipedia links are extremely common in important search contexts, appearing in 67-84% of all SERPs for common and trending queries, but less often for medical queries. Our findings reinforce the complementary notions that (1) Wikipedia content and research has major impact outside of the Wikipedia domain and (2) powerful technologies like search engines are highly reliant on free content created by volunteers.
arXiv Detail & Related papers (2020-04-21T19:58:28Z)

This list is automatically generated from the titles and abstracts of the papers in this site.