Fugu-MT 論文翻訳(概要): MillStone: How Open-Minded Are LLMs?

論文の概要: MillStone: How Open-Minded Are LLMs?

arxiv url: http://arxiv.org/abs/2509.11967v1
Date: Mon, 15 Sep 2025 14:18:51 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-16 17:26:23.334896
Title: MillStone: How Open-Minded Are LLMs?
Title（参考訳）: MillStone: LLMはどの程度オープンなのでしょうか?
Authors: Harold Triedman, Vitaly Shmatikov,
Abstract要約: Web検索や情報検索,その他のエージェント機能を備えた大規模言語モデルが,従来の検索エンジンに取って代わり始めている。我々は、LLMが議論の的になっている問題に対して外部の議論が与える影響を体系的に測定することを目的とした最初のベンチマークであるMillStoneを紹介する。
参考スコア（独自算出の注目度）: 8.349679378026027
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models equipped with Web search, information retrieval tools, and other agentic capabilities are beginning to supplant traditional search engines. As users start to rely on LLMs for information on many topics, including controversial and debatable issues, it is important to understand how the stances and opinions expressed in LLM outputs are influenced by the documents they use as their information sources. In this paper, we present MillStone, the first benchmark that aims to systematically measure the effect of external arguments on the stances that LLMs take on controversial issues (not all of them political). We apply MillStone to nine leading LLMs and measure how ``open-minded'' they are to arguments supporting opposite sides of these issues, whether different LLMs agree with each other, which arguments LLMs find most persuasive, and whether these arguments are the same for different LLMs. In general, we find that LLMs are open-minded on most issues. An authoritative source of information can easily sway an LLM's stance, highlighting the importance of source selection and the risk that LLM-based information retrieval and search systems can be manipulated.
Abstract（参考訳）: Web検索や情報検索,その他のエージェント機能を備えた大規模言語モデルが,従来の検索エンジンに取って代わり始めている。議論を呼んだり、議論を呼んだり、議論を呼んだりする問題など、多くの話題の情報をLCMに頼り始める中で、LCM出力で表現される姿勢や意見が、情報ソースとして使用する文書にどのように影響されるかを理解することが重要である。本稿では、LLMが議論を呼んでいる問題(全て政治的ではない)に対する姿勢に対する外部論証の効果を体系的に測定することを目的とした最初のベンチマークであるMillStoneについて述べる。我々は、MillStone を9つの主要な LLM に適用し、これらの問題と異なる LLM が互いに一致しているかどうか、どの LLM が最も説得力があるか、そしてこれらの議論が異なる LLM に対して同じであるかどうかを測る。一般的に、LLMはほとんどの問題に対してオープンな意識を持っている。信頼性のある情報ソースは、LLMの姿勢を容易に妨げ、ソース選択の重要性とLLMベースの情報検索と検索システムを操作するリスクを強調している。

論文の概要: MillStone: How Open-Minded Are LLMs?

関連論文リスト