Fugu-MT 論文翻訳(概要): CORE: Conflict-Oriented Reasoning for General Multimodal Manipulation Detection

論文の概要: CORE: Conflict-Oriented Reasoning for General Multimodal Manipulation Detection

arxiv url: http://arxiv.org/abs/2606.03066v1
Date: Tue, 02 Jun 2026 02:53:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-03 22:00:04.708724
Title: CORE: Conflict-Oriented Reasoning for General Multimodal Manipulation Detection
Title（参考訳）: CORE: 汎用マルチモーダルマニピュレーション検出のための競合指向推論
Authors: Jinjie Shen, Yaxiong Wang, Yujiao Wu, Lechao Cheng, Tianrui Hui, Nan Pu, Zhihui Li, Zhun Zhong,
Abstract要約: ジェネレーティブAIは、マルチモーダルなフェイクニュースをますます現実的で広範にし、公共の信頼と社会的安定に深刻な脅威を与えている。 textbfConflict-textbfOriented textbfREasoning (textbfCORE) フレームワークを提案する。 COREは堅牢で一般化可能なコンフリクト検出を実現し、いくつかのサンプルやゼロショット設定で、目に見えない操作タイプに効果的かつ迅速に適応する。
参考スコア（独自算出の注目度）: 56.64398465636452
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The rapid rise of generative AI has made multimodal fake news increasingly realistic and pervasive, posing severe threats to public trust and social stability. Existing detection methods rely heavily on manipulation-specific models and large-scale labeled data, resulting in poor generalization to emerging manipulation types. We observed that the essence of manipulated misinformation lies in its intrinsic conflicts, \textbf{i.e.,} semantic or physical inconsistencies either across modalities or with common world knowledge. Inspired by this observation, we propose \textbf{C}onflict-\textbf{O}riented \textbf{RE}asoning (\textbf{CORE}) framework, an effective paradigm that learns to endows multimodal large language models (MLLMs) with explicit conflict-capturing capability. To this end, CORE first constructs the Conflict Attribution Corpus (CAC) with fine-grained annotations of conflict factors and sources, providing essential data support for subsequent conflict perception training. By performing conflict-oriented representation enhancement and reasoning based on CAC, CORE achieves robust and generalizable conflict detection, effectively and rapidly adapting to unseen manipulation types with a few samples or in even zero-shot settings. Extensive experiments demonstrate that CORE surpasses state-of-the-art models. The dataset and code are publicly available at https://github.com/shen8424/CORE.
Abstract（参考訳）: 生成AIの急速な普及により、マルチモーダルなフェイクニュースはますます現実的で広まり、公衆の信頼と社会的安定に深刻な脅威をもたらしている。既存の検出方法は操作固有のモデルと大規模ラベル付きデータに大きく依存しているため、新たな操作タイプへの一般化は不十分である。操作された誤情報の本質は、その本質的な紛争、つまり、意味的または物理的不整合が、モダリティを越えても、あるいは共通の世界知識と共にもたらされることを観察した。本研究は,マルチモーダルな大言語モデル(MLLM)を明示的なコンフリクトキャプチャー能力で実現するための,効果的なパラダイムであるフレームワークである,‘textbf{C}onflict-\textbf{O}riented \textbf{RE}asoning(\textbf{CORE})を提案する。この目的のために、COREはコンフリクト・アトリビューション・コーパス(CAC)をコンフリクト・ファクターとソースの細かいアノテーションで構築し、その後のコンフリクト・アトリビューション・トレーニングに不可欠なデータサポートを提供する。 CACに基づくコンフリクト指向表現の強化と推論を行うことで、COREは堅牢で一般化可能なコンフリクト検出を実現し、いくつかのサンプルやゼロショット設定で、目に見えない操作タイプに効果的かつ迅速に適応する。大規模な実験では、COREが最先端のモデルを上回ることが示されている。データセットとコードはhttps://github.com/shen8424/COREで公開されている。

論文の概要: CORE: Conflict-Oriented Reasoning for General Multimodal Manipulation Detection

関連論文リスト