Fugu-MT 論文翻訳(概要): SearchAttack: Red-Teaming LLMs against Real-World Threats via Framing Unsafe Web Information-Seeking Tasks

論文の概要: SearchAttack: Red-Teaming LLMs against Real-World Threats via Framing Unsafe Web Information-Seeking Tasks

arxiv url: http://arxiv.org/abs/2601.04093v1
Date: Wed, 07 Jan 2026 16:59:34 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-09 02:15:23.696252
Title: SearchAttack: Red-Teaming LLMs against Real-World Threats via Framing Unsafe Web Information-Seeking Tasks
Title（参考訳）: SearchAttack: 安全でないWebインフォメーション検索タスクによる現実世界の脅威に対するLLMの再コラボレーション
Authors: Yu Yan, Sheng Sun, Mingfeng Li, Zheming Yang, Chiwei Zhu, Fei Ma, Benfeng Xu, Min Liu,
Abstract要約: このジレンマにより、Web検索を重要な攻撃面として認識し、red-teamingのためのtextbftextitSearchAttackを提案する。 SearchAttackはWeb検索に有害なセマンティクスをアウトソースし、クエリのスケルトンと断片化されたヒントのみを保持する。
参考スコア（独自算出の注目度）: 19.28321072381512
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Recently, people have suffered and become increasingly aware of the unreliability gap in LLMs for open and knowledge-intensive tasks, and thus turn to search-augmented LLMs to mitigate this issue. However, when the search engine is triggered for harmful tasks, the outcome is no longer under the LLM's control. Once the returned content directly contains targeted, ready-to-use harmful takeaways, the LLM's safeguards cannot withdraw that exposure. Motivated by this dilemma, we identify web search as a critical attack surface and propose \textbf{\textit{SearchAttack}} for red-teaming. SearchAttack outsources the harmful semantics to web search, retaining only the query's skeleton and fragmented clues, and further steers LLMs to reconstruct the retrieved content via structural rubrics to achieve malicious goals. Extensive experiments are conducted to red-team the search-augmented LLMs for responsible vulnerability assessment. Empirically, SearchAttack demonstrates strong effectiveness in attacking these systems.
Abstract（参考訳）: 近年, オープンかつ知識集約的なタスクにおいて, LLMの信頼性の欠如に悩まされ, ますます認識されるようになり, この問題を緩和するために, 探索強化された LLM に目を向けるようになっている。しかし、検索エンジンが有害なタスクのためにトリガーされると、その結果はもはやLLMの制御下には置かれない。返却されたコンテンツに直接標的となる有害なテイクアウトが組み込まれれば、LLMのセーフガードはその露出を取り下げることはできない。このジレンマに触発され、Web検索をクリティカルアタックサーフェスとして認識し、レッドチームのための \textbf{\textit{SearchAttack}} を提案する。 SearchAttackは、Web検索に有害なセマンティクスをアウトソーシングし、クエリのスケルトンと断片化されたヒントのみを保持し、LLMを使って検索したコンテンツを構造的ルーリックで再構築し、悪意ある目標を達成する。脆弱性評価に責任を負うために,LLMを探索して再設計する大規模な実験を行った。経験的に、SearchAttackはこれらのシステムに対する攻撃効果を強く示している。

論文の概要: SearchAttack: Red-Teaming LLMs against Real-World Threats via Framing Unsafe Web Information-Seeking Tasks

関連論文リスト