Fugu-MT 論文翻訳(概要): MosaicLeaks:Privacy Risks in Querying-in-the-Open for Deep Research Agents

論文の概要: MosaicLeaks:Privacy Risks in Querying-in-the-Open for Deep Research Agents

arxiv url: http://arxiv.org/abs/2605.30727v1
Date: Fri, 29 May 2026 01:44:35 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-01 20:56:50.330141
Title: MosaicLeaks:Privacy Risks in Querying-in-the-Open for Deep Research Agents
Title（参考訳）: MosaicLeaks:ディープリサーチエージェントに対するクエリー・イン・ザ・オープンのリスク
Authors: Alexander Gurung, Spandana Gella, Alexandre Drouin, Issam H. Laradji, Perouz Taslakian, Rafael Pardinas,
Abstract要約: ディープリサーチエージェントは、プライベートなローカルドキュメントをWeb検索のような外部ツールと組み合わせて、プライバシーリスクを生じさせている。プライベートなエンタープライズドキュメントとパブリックなWebコーパスをチェーンする1,001のディープリサーチタスクのベンチマークであるMosaicLeaksを紹介します。私たちは、家族やサイズにわたるモデルが3つのレベルすべてに頻繁にリークしていることに気付きました。
参考スコア（独自算出の注目度）: 76.60965179634664
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Deep research agents increasingly combine private local documents with external tools like web retrieval, creating a privacy risk: an agent's external queries may leak sensitive information from its local context. This risk is amplified by the mosaic effect, where individual queries may appear harmless but become revealing in aggregate. We introduce MosaicLeaks, a benchmark of 1,001 multi-hop deep research tasks that chain private enterprise documents and a public web corpus, forcing agents to make external queries that depend on local information. We evaluate leakage with an adversary LLM that observes only the agent's external queries and attempts to infer private information at three levels: the agent's research intent, answers to specific private questions and verifiable claims about the enterprise documents. We find that models across families and sizes frequently leak at all three levels, that zero-shot privacy prompting reduces but does not eliminate leakage and that reinforcement learning for task performance alone worsens leakage. To address this, we propose Privacy-Aware Deep Research (PA-DR), an RL framework that combines situational rewards for task success with a learned privacy classifier to provide dense credit assignment over both per-query and mosaic-level leakage. Training Qwen3-4B-Instruct with PA-DR improves accuracy from 48.7% to 58.7% and reduces answer and full-information leakage from 34.0% to 9.9%.
Abstract（参考訳）: ディープリサーチエージェントは、プライベートなローカルドキュメントとWeb検索のような外部ツールを組み合わせることで、プライバシリスクを生み出している。このリスクはモザイク効果によって増幅され、個々のクエリは無害に見えるが、集合的に明らかになる。我々は、プライベートエンタープライズ文書とパブリックウェブコーパスをチェーンする1,001のマルチホップディープリサーチタスクのベンチマークであるMosaicLeaksを紹介し、エージェントにローカル情報に依存する外部クエリを強制する。我々は,エージェントの外部クエリのみを観察し,エージェントの研究意図,特定のプライベートな質問に対する回答,企業文書に対する検証可能なクレームの3つのレベルにおいて,プライベートな情報を推論しようとする敵対的LCMによる漏洩を評価する。ゼロショットのプライバシーは低下するが、リークを排除せず、タスクパフォーマンスのみの強化学習がリークを悪化させる。そこで我々は,タスク成功に対する状況報酬と学習されたプライバシ分類器を組み合わせたRLフレームワークであるPrivacy-Aware Deep Research (PA-DR)を提案する。 Qwen3-4B-Instruct with PA-DRは精度を48.7%から58.7%に改善し、回答と全情報漏洩を34.0%から9.9%に削減する。

論文の概要: MosaicLeaks:Privacy Risks in Querying-in-the-Open for Deep Research Agents

関連論文リスト