Fugu-MT 論文翻訳(概要): The Ranking Blind Spot: Decision Hijacking in LLM-based Text Ranking

論文の概要: The Ranking Blind Spot: Decision Hijacking in LLM-based Text Ranking

arxiv url: http://arxiv.org/abs/2509.18575v1
Date: Tue, 23 Sep 2025 02:56:38 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-24 20:41:27.668979
Title: The Ranking Blind Spot: Decision Hijacking in LLM-based Text Ranking
Title（参考訳）: ランキング・ブラインド・スポット:LCMによるテキスト・ランキングにおける決定的ハイジャック
Authors: Yaoyao Qian, Yifan Zeng, Yuchao Jiang, Chelsi Jain, Huazheng Wang,
Abstract要約: 大規模言語モデル (LLM) は, 通過ランキングなどの情報検索タスクにおいて, 高い性能を示した。本研究では,LLMにおける命令追従能力がマルチドキュメント比較タスクとどのように相互作用するかを検討する。 2つのアプローチにより、このランキングの盲点がLLM評価システムにどのように影響するかを分析する。
参考スコア（独自算出の注目度）: 17.328293277532
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large Language Models (LLMs) have demonstrated strong performance in information retrieval tasks like passage ranking. Our research examines how instruction-following capabilities in LLMs interact with multi-document comparison tasks, identifying what we term the "Ranking Blind Spot", a characteristic of LLM decision processes during comparative evaluation. We analyze how this ranking blind spot affects LLM evaluation systems through two approaches: Decision Objective Hijacking, which alters the evaluation goal in pairwise ranking systems, and Decision Criteria Hijacking, which modifies relevance standards across ranking schemes. These approaches demonstrate how content providers could potentially influence LLM-based ranking systems to affect document positioning. These attacks aim to force the LLM ranker to prefer a specific passage and rank it at the top. Malicious content providers can exploit this weakness, which helps them gain additional exposure by attacking the ranker. In our experiment, We empirically show that the proposed attacks are effective in various LLMs and can be generalized to multiple ranking schemes. We apply these attack to realistic examples to show their effectiveness. We also found stronger LLMs are more vulnerable to these attacks. Our code is available at: https://github.com/blindspotorg/RankingBlindSpot
Abstract（参考訳）: 大規模言語モデル (LLM) は, 通過ランキングなどの情報検索タスクにおいて, 高い性能を示した。本研究では,LLMにおける命令追従能力がマルチドキュメント比較タスクとどのように相互作用するかを考察し,LLM決定過程の特徴である「ランキング・ブラインド・スポット」と呼ばれるものを特定する。本研究は,2つの手法を用いてLLM評価システムにどのように影響するかを解析する。2つの手法は,ペアランキングシステムにおける評価目標を変更する決定対象ハイジャックと,ランキング方式間の関連基準を変更する決定基準ハイジャックである。これらのアプローチは、コンテンツプロバイダがLCMベースのランキングシステムにどのように影響し、文書の位置決めに影響を及ぼすかを示す。これらの攻撃は、LLMローダに特定のパスを優先させ、トップにランク付けすることを目的としている。悪意のあるコンテンツプロバイダは、この弱点を悪用することができる。実験では,提案した攻撃は様々なLSMにおいて有効であり,複数のランキング方式に一般化可能であることを実証的に示す。これらの攻撃を実例に適用し,その有効性を示す。また、より強力なLSMはこれらの攻撃に対してより脆弱であることもわかりました。私たちのコードは、https://github.com/blindspotorg/RankingBlindSpotで利用可能です。

論文の概要: The Ranking Blind Spot: Decision Hijacking in LLM-based Text Ranking

関連論文リスト