Fugu-MT 論文翻訳(概要): The Sum Leaks More Than Its Parts: Compositional Privacy Risks and Mitigations in Multi-Agent Collaboration

論文の概要: The Sum Leaks More Than Its Parts: Compositional Privacy Risks and Mitigations in Multi-Agent Collaboration

arxiv url: http://arxiv.org/abs/2509.14284v1
Date: Tue, 16 Sep 2025 16:57:25 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-19 17:26:52.916898
Title: The Sum Leaks More Than Its Parts: Compositional Privacy Risks and Mitigations in Multi-Agent Collaboration
Title（参考訳）: 構成的プライバシリスクとマルチエージェントコラボレーションの軽減
Authors: Vaidehi Patil, Elias Stengel-Eskin, Mohit Bansal,
Abstract要約: 大規模言語モデル(LLM)はマルチエージェントシステムに不可欠なものである。プライバシーリスクは、暗記、直接推論、シングルターン評価を超えて現れる。特に、相互作用によって構成される一見無害な反応は、敵が機密情報の回復を累積的に行うことができる。
参考スコア（独自算出の注目度）: 72.33801123508145
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: As large language models (LLMs) become integral to multi-agent systems, new privacy risks emerge that extend beyond memorization, direct inference, or single-turn evaluations. In particular, seemingly innocuous responses, when composed across interactions, can cumulatively enable adversaries to recover sensitive information, a phenomenon we term compositional privacy leakage. We present the first systematic study of such compositional privacy leaks and possible mitigation methods in multi-agent LLM systems. First, we develop a framework that models how auxiliary knowledge and agent interactions jointly amplify privacy risks, even when each response is benign in isolation. Next, to mitigate this, we propose and evaluate two defense strategies: (1) Theory-of-Mind defense (ToM), where defender agents infer a questioner's intent by anticipating how their outputs may be exploited by adversaries, and (2) Collaborative Consensus Defense (CoDef), where responder agents collaborate with peers who vote based on a shared aggregated state to restrict sensitive information spread. Crucially, we balance our evaluation across compositions that expose sensitive information and compositions that yield benign inferences. Our experiments quantify how these defense strategies differ in balancing the privacy-utility trade-off. We find that while chain-of-thought alone offers limited protection to leakage (~39% sensitive blocking rate), our ToM defense substantially improves sensitive query blocking (up to 97%) but can reduce benign task success. CoDef achieves the best balance, yielding the highest Balanced Outcome (79.8%), highlighting the benefit of combining explicit reasoning with defender collaboration. Together, our results expose a new class of risks in collaborative LLM deployments and provide actionable insights for designing safeguards against compositional, context-driven privacy leakage.
Abstract（参考訳）: 大規模言語モデル(LLM)がマルチエージェントシステムに不可欠なものになると、暗記、直接推論、シングルターン評価を超えて、新たなプライバシリスクが出現する。特に、相互作用によって構成される一見無害な反応は、敵が機密情報を累積的に回収することができる。マルチエージェントLLMシステムにおいて,このような構成プライバシリークと緩和手法に関する最初の体系的研究を行う。まず、補助的な知識とエージェントの相互作用が、個別に応答するたびにプライバシーリスクを増幅する方法をモデル化するフレームワークを開発する。次に、これを緩和するために、(1)防衛エージェントが相手にどのようにアウトプットを活用できるかを予測して質問者の意図を推測する「人民防衛理論」(ToM)と、(2)協力的合意防衛(CoDef)の2つの防衛戦略を提案し、評価する。重要なことは、私たちは、機密情報を公開する構成と、良心的な推論をもたらす構成とで評価のバランスをとります。われわれの実験は、プライバシーとユーティリティのトレードオフのバランスをとる上で、これらの防衛戦略がどう違うのかを定量化している。チェーン・オブ・シークレットだけではリークに対する保護が制限されている(約39%の機密性の高いブロッキングレート)のに対して、ToMのディフェンスはセンシティブなクエリ・ブロッキング(最大97%)を大幅に改善します。 CoDefは最高のバランスを達成し、最も高いバランスの取れたアウトカム(79.8%)を獲得し、明確な推論とディフェンダーの協力を組み合わせる利点を強調している。この結果から,共同LLMデプロイメントにおける新たなリスクのクラスを明らかにし,構成的かつコンテキスト駆動型プライバシリークに対する保護設計のための実用的な洞察を提供する。

論文の概要: The Sum Leaks More Than Its Parts: Compositional Privacy Risks and Mitigations in Multi-Agent Collaboration

関連論文リスト