Large Language Models as Search Engines: Societal Challenges
- URL: http://arxiv.org/abs/2512.08946v1
- Date: Mon, 24 Nov 2025 12:59:42 GMT
- Title: Large Language Models as Search Engines: Societal Challenges
- Authors: Zacchary Sadeddine, Winston Maxwell, Gaƫl Varoquaux, Fabian M. Suchanek,
- Abstract summary: Large Language Models (LLMs) may one day replace search engines as the primary portal to information on the Web.<n>In this article, we investigate the societal challenges that such a change could bring.
- Score: 21.113599077463018
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Large Language Models (LLMs) may one day replace search engines as the primary portal to information on the Web. In this article, we investigate the societal challenges that such a change could bring. We focus on the roles of LLM Providers, Content Creators, and End Users, and identify 15 types of challenges. With each, we show current mitigation strategies -- both from the technical perspective and the legal perspective. We also discuss the impact of each challenge and point out future research opportunities.
Related papers
- From Questions to Insights: Exploring XAI Challenges Reported on Stack Overflow Questions [1.8049331600471712]
Lack of interpretability is a major barrier that limits the practical usage of AI models.<n>XAI techniques (e.g., SHAP, LIME) have been employed to interpret these models' performance.<n>We conducted an exploratory study to expose these challenges, their severity, and features that can make XAI techniques more accessible.
arXiv Detail & Related papers (2025-04-03T23:33:46Z) - A Review of the Challenges with Massive Web-mined Corpora Used in Large Language Models Pre-Training [0.0]
This review identifies key challenges in this domain, including challenges such as noise (irrelevant or misleading information), duplication of content, the presence of low-quality or incorrect information, biases, and the inclusion of sensitive or personal information in web-mined corpora.
Through an examination of current methodologies for data cleaning, pre-processing, bias detection and mitigation, we highlight the gaps in existing approaches and suggest directions for future research.
arXiv Detail & Related papers (2024-07-10T13:09:23Z) - Foundational Challenges in Assuring Alignment and Safety of Large Language Models [171.01569693871676]
This work identifies 18 foundational challenges in assuring the alignment and safety of large language models (LLMs)
Based on the identified challenges, we pose $200+$ concrete research questions.
arXiv Detail & Related papers (2024-04-15T16:58:28Z) - A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges [29.74898680986507]
We review the current landscape of LM usage in regards to complex game playing scenarios and the challenges still open.
We present our perspective on promising future research avenues for the advancement of LMs in games.
arXiv Detail & Related papers (2024-03-15T12:37:12Z) - Large Language Models for Generative Information Extraction: A Survey [89.71273968283616]
Large Language Models (LLMs) have demonstrated remarkable capabilities in text understanding and generation.
We present an extensive overview by categorizing these works in terms of various IE subtasks and techniques.
We empirically analyze the most advanced methods and discover the emerging trend of IE tasks with LLMs.
arXiv Detail & Related papers (2023-12-29T14:25:22Z) - Online Advertisements with LLMs: Opportunities and Challenges [51.96140910798771]
This paper explores the potential for leveraging Large Language Models (LLM) in the realm of online advertising systems.
We introduce a general framework for LLM advertisement, consisting of modification, bidding, prediction, and auction modules.
arXiv Detail & Related papers (2023-11-11T02:13:32Z) - ACQUIRED: A Dataset for Answering Counterfactual Questions In Real-Life
Videos [53.92440577914417]
ACQUIRED consists of 3.9K annotated videos, encompassing a wide range of event types and incorporating both first and third-person viewpoints.
Each video is annotated with questions that span three distinct dimensions of reasoning, including physical, social, and temporal.
We benchmark our dataset against several state-of-the-art language-only and multimodal models and experimental results demonstrate a significant performance gap.
arXiv Detail & Related papers (2023-11-02T22:17:03Z) - A Survey on Detection of LLMs-Generated Content [97.87912800179531]
The ability to detect LLMs-generated content has become of paramount importance.
We aim to provide a detailed overview of existing detection strategies and benchmarks.
We also posit the necessity for a multi-faceted approach to defend against various attacks.
arXiv Detail & Related papers (2023-10-24T09:10:26Z) - Language Models as a Service: Overview of a New Paradigm and its
Challenges [47.75762014254756]
Some of the most powerful language models currently are proprietary systems, accessible only via (typically restrictive) web or programming.
This paper has two goals: on the one hand, we delineate how the aforementioned challenges act as impediments to the accessibility, replicability, reliability, and trustworthiness of LM interfaces.
On the other hand, it serves as a comprehensive resource for existing knowledge on current, major LM, offering a synthesized overview of the licences and capabilities their interfaces offer.
arXiv Detail & Related papers (2023-09-28T16:29:52Z) - Retrospectives on the Embodied AI Workshop [238.302290980995]
We focus on 13 challenges presented at the Embodied AI Workshop at CVPR.
These challenges are grouped into three themes: (1) visual navigation, (2) rearrangement, and (3) embodied vision-and-language.
We discuss the dominant datasets within each theme, evaluation metrics for the challenges, and the performance of state-of-the-art models.
arXiv Detail & Related papers (2022-10-13T09:00:52Z) - Machine Learning Towards Intelligent Systems: Applications, Challenges,
and Opportunities [8.68311678910946]
Machine learning (ML) provides a mechanism for humans to process large amounts of data.
This review focuses on some of the fields and applications such as education, healthcare, network security, banking and finance, and social media.
arXiv Detail & Related papers (2021-01-11T01:32:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.