Fugu-MT 論文翻訳(概要): Semantic Manipulation Localization

論文の概要: Semantic Manipulation Localization

arxiv url: http://arxiv.org/abs/2604.10132v1
Date: Sat, 11 Apr 2026 09:53:09 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-14 20:13:15.864398
Title: Semantic Manipulation Localization
Title（参考訳）: セマンティックマニピュレーションの局在化
Authors: Zhenshan Tan, Chenhan Lu, Yuxiang Huang, Ziwen He, Xiang Zhang, Yuzhe Sha, Xianyi Chen, Tianrun Chen, Zhangjie Fu,
Abstract要約: 画像の解釈を著しく変化させる微妙な意味的編集の局所化に焦点を当てた新しいタスクである意味的操作を導入する。本課題に基づいて,意味的アンカー,摂動知覚,意味論的制約のある推論という3つのコンポーネントを通して意味的感受性をモデル化する,エンドツーエンドのフレームワークであるTRACEを提案する。包括的実験により、TRACE は我々のベンチマークで既存の IML メソッドを一貫して上回り、より完全でコンパクトでセマンティックに整合したローカライゼーション結果を生成することが示された。
参考スコア（独自算出の注目度）: 18.942761820082705
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Image Manipulation Localization (IML) aims to identify edited regions in an image. However, with the increasing use of modern image editing and generative models, many manipulations no longer exhibit obvious low-level artifacts. Instead, they often involve subtle but meaning-altering edits to an object's attributes, state, or relationships while remaining highly consistent with the surrounding content. This makes conventional IML methods less effective because they mainly rely on artifact detection rather than semantic sensitivity. To address this issue, we introduce Semantic Manipulation Localization (SML), a new task that focuses on localizing subtle semantic edits that significantly change image interpretation. We further construct a dedicated fine-grained benchmark for SML using a semantics-driven manipulation pipeline with pixel-level annotations. Based on this task, we propose TRACE (Targeted Reasoning of Attributed Cognitive Edits), an end-to-end framework that models semantic sensitivity through three progressively coupled components: semantic anchoring, semantic perturbation sensing, and semantic-constrained reasoning. Specifically, TRACE first identifies semantically meaningful regions that support image understanding, then injects perturbation-sensitive frequency cues to capture subtle edits under strong visual consistency, and finally verifies candidate regions through joint reasoning over semantic content and semantic scope. Extensive experiments show that TRACE consistently outperforms existing IML methods on our benchmark and produces more complete, compact, and semantically coherent localization results. These results demonstrate the necessity of moving beyond artifact-based localization and provide a new direction for image forensics in complex semantic editing scenarios.
Abstract（参考訳）: 画像操作局所化(IML)は、画像内の編集された領域を特定することを目的としている。しかし、現代の画像編集と生成モデルの利用が増加し、多くの操作は明らかに低レベルのアーティファクトを示さない。代わりに、しばしば、オブジェクトの属性、状態、関係に対する微妙だが意味を変える編集を伴いながら、周囲のコンテンツと高度に整合性を保つ。これにより、従来のIMLメソッドは、セマンティックな感度よりもアーティファクト検出に大きく依存するため、効率が低下する。この問題に対処するために、画像の解釈を著しく変える微妙な意味編集の局所化に焦点を当てた、セマンティック・マニピュレーション・ローカライゼーション(SML)を導入する。さらに,画素レベルのアノテーションを持つセマンティックス駆動の操作パイプラインを用いて,SML用の詳細なベンチマークを構築する。この課題に基づいて,意味的アンカー,意味的摂動センシング,意味的制約のある推論という3つの段階的に結合されたコンポーネントを通して意味的感受性をモデル化する,エンドツーエンドのフレームワークであるTRACE(Targeted Reasoning of Attributed Cognitive Edits)を提案する。具体的には、TRACEはまず、画像理解をサポートする意味論的意味のある領域を特定し、次に摂動に敏感な周波数キューを注入し、強い視覚的一貫性の下で微妙な編集をキャプチャし、最後にセマンティックコンテンツとセマンティックスコープに関する共同推論を通して候補領域を検証する。包括的実験により、TRACE は我々のベンチマークで既存の IML メソッドを一貫して上回り、より完全でコンパクトでセマンティックに整合したローカライゼーション結果を生成することが示された。これらの結果は、アーティファクトベースのローカライゼーションを超えて、複雑なセマンティック編集シナリオにおける画像鑑定のための新しい方向を提供する必要があることを示す。

論文の概要: Semantic Manipulation Localization

関連論文リスト