Fugu-MT 論文翻訳(概要): When Recovery Matters: The Blind Spot of Surrogate Privacy in MLLM Editing

論文の概要: When Recovery Matters: The Blind Spot of Surrogate Privacy in MLLM Editing

arxiv url: http://arxiv.org/abs/2606.07171v1
Date: Fri, 05 Jun 2026 11:40:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-08 14:33:29.714007
Title: When Recovery Matters: The Blind Spot of Surrogate Privacy in MLLM Editing
Title（参考訳）: MLLM編集におけるプライバシー保護の盲点
Authors: Siyuan Xu, Yibing Liu, Peilin Chen, Yung-Hui LI, Shiqi Wang, Sam Kwong,
Abstract要約: 我々は、36のきめ細かいプライバシーカテゴリと65の編集手順をカバーする最初のリカバリ指向ベンチマークであるSPPE(Surrogate-based Privacy-Preserving Editing)を紹介する。編集性評価のために、ERMAはSRCCで13.9%、PLCCで12.3%向上した。 SPPEとInstructPix2Pixの実験では、両方のタスクが一貫した改善が見られた。
参考スコア（独自算出の注目度）: 61.80513991207956
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Multimodal Large Language Models (MLLMs) enable flexible instruction-driven image editing, but privacy risks arise when user images expose diverse and user-specific private content. Canonical privacy protection strategies typically substitute sensitive regions with surrogate content before cloud editing. Yet, the resulting output is often an edited surrogate rather than the desired edited source image, neglecting the local recovery in both design and evaluation scope. To this end, we introduce SPPE (Surrogate-based Privacy-Preserving Editing), the first recovery-oriented benchmark covering 36 fine-grained privacy categories and 65 editing instructions. It defines two complementary tasks: 1) editability assessment, which estimates before cloud interaction whether a surrogate can induce an edit consistent with the original image; and 2) surrogate-to-source edit recovery, which evaluates whether the edited surrogate can be transferred back to the private source with the edit effect preserved. We address each task with a dedicated method: ERMA predicts surrogate editability through instruction-aware multimodal relation modeling, while \method performs cycle-consistent recovery by using the surrogate editing pair as visual edit evidence and the source image as a source-preserving anchor. Experiments on SPPE and InstructPix2Pix show consistent improvements on both tasks. For editability assessment, ERMA improves over the best-performing baselines by 13.9% in SRCC and 12.3% in PLCC. For surrogate-to-source edit recovery, C2E-S2SER outperforms SOER across all 8 source integrity and edit consistency metrics on SPPE.
Abstract（参考訳）: MLLM(Multimodal Large Language Models)は、フレキシブルな命令駆動型画像編集を可能にするが、ユーザイメージが多様なユーザ固有のプライベートコンテンツを公開すると、プライバシのリスクが生じる。標準的なプライバシ保護戦略は、クラウド編集の前に、機密性の高い領域を代理コンテンツに置き換えるのが一般的である。しかし、結果の出力は、望まれるソースイメージよりもむしろ編集されたサロゲートであり、設計と評価の両方における局所的なリカバリを無視していることが多い。この目的のために、36のきめ細かいプライバシーカテゴリと65の編集命令をカバーする最初のリカバリ指向のベンチマークであるSPPE(Surrogate-based Privacy-Preserving Editing)を紹介する。それは2つの補完的なタスクを定義します。 1)サロゲートが原画像と整合した編集を誘導できるか否かをクラウド間相互作用前に推定する編集可能性評価 2) 編集効果を保存して, 編集したサロゲートをプライベートソースに戻すことができるかどうかを評価するサロゲートからソースへの編集回復を行う。 ERMAは命令対応マルチモーダル関係モデリングによりサロゲート編集可能性を予測する一方、Shamethodはサロゲート編集ペアを視覚的編集エビデンスとして、ソースイメージをソース保存アンカーとして、サイクル一貫性回復を行う。 SPPEとInstructPix2Pixの実験では、両方のタスクが一貫した改善が見られた。編集性評価のために、ERMAはSRCCで13.9%、PLCCで12.3%向上した。 C2E-S2SERは、サロゲートからソースへの編集リカバリのために、SPPE上の8つのソース整合性と編集整合性メトリクスでSOERを上回っている。

論文の概要: When Recovery Matters: The Blind Spot of Surrogate Privacy in MLLM Editing

関連論文リスト