Fugu-MT 論文翻訳(概要): Multi-axis Analysis of Image Manipulation Localization

論文の概要: Multi-axis Analysis of Image Manipulation Localization

arxiv url: http://arxiv.org/abs/2605.20174v1
Date: Tue, 19 May 2026 17:54:33 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-20 15:03:09.576251
Title: Multi-axis Analysis of Image Manipulation Localization
Title（参考訳）: 画像操作位置の多軸解析
Authors: Keanu Nichols, Divya Appapogu, Giscard Biamby, Dina Bashkirova, Anna Rohrbach, Bryan A. Plummer,
Abstract要約: 操作された画像は誤報を広め、虚偽の物語を作り、重要な問題について人々の意見に影響を与える可能性がある。領域シフト, qualIty, Type, Size (AUDITS) に基づく解析を導入し, 画像検出における解析の軸について検討する。 AUDITSは2つの異なるソース(ユーザーとニュース写真)から530万以上の画像で構成されている
参考スコア（独自算出の注目度）: 29.967054752383607
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Advanced image editing software enables easy creation of highly convincing image manipulations, which has been made even more accessible in recent years due to advances in generative AI. Manipulated images, while often harmless, could spread misinformation, create false narratives, and influence people's opinions on important issues. Despite this growing threat, there is limited research on detecting advanced manipulations across different visual domains. Thus, we introduce Analysis Under Domain-shifts, qualIty, Type, and Size (AUDITS), a comprehensive benchmark designed for studying axes of analysis in image manipulation detection. AUDITS comprises over 530K images from two distinct sources (user and news photos). We curate our dataset to support analysis across multiple axes using recent diffusion-based inpaintings, spanning a diverse range of manipulation types and sizes. We conduct experiments under different types of domain shift to evaluate robustness of existing image manipulation detection methods. Our goal is to drive further research in this area by offering new insights that would help develop more reliable and generalizable image manipulation detection methods.
Abstract（参考訳）: 高度な画像編集ソフトウェアは、非常に説得力のある画像操作を容易に作成できる。操作された画像は、しばしば無害であるが、誤報を広め、虚偽の物語を作り、重要な問題について人々の意見に影響を与える可能性がある。この脅威が増大しているにもかかわらず、様々な視覚領域にわたる高度な操作を検出する研究は限られている。そこで我々は、画像操作検出において解析の軸を研究するために設計された総合ベンチマークであるAUDITS(Analytic Under Domain-shifts, qualIty, Type, and Size)を紹介する。 AUDITSは、2つの異なるソース(ユーザーとニュース写真)から530K以上の画像で構成されている。我々は、最近の拡散ベースのインペインティングを用いて、複数の軸にわたる解析をサポートするためにデータセットをキュレートし、様々な種類の操作タイプとサイズにまたがる。我々は、既存の画像操作検出手法の堅牢性を評価するために、異なるタイプの領域シフトで実験を行う。私たちのゴールは、より信頼性が高く、より一般化可能な画像検出方法の開発に役立つ新しい洞察を提供することによって、この分野のさらなる研究を進めることです。

関連論文リスト

FRAME: Forensic Routing and Adaptive Multi-path Evidence Fusion for Image Manipulation Detection [3.9783598254756023]
textbfForensic textbfRouting と textbfAdaptive textbfMulti-path textbfEvidence fusion の手法である textbfFRAME を提案する。 FRAMEは多様な法医学的アルゴリズムを多経路解析空間に整理し、各入力画像に対する情報的法医学的パスを適応的に選択し、補完的な証拠を融合して検出とローカライゼーション性能を向上させる。
論文参考訳（メタデータ） (2026-05-12T23:48:26Z)
ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation [81.52606410224136]
画像操作検出と局所化のための大規模ベンチマークである textbfManipBench を提案する。また,マルチモーダル大言語モデル(MLLM)に基づくオールインワンモデルであるtextbfManipShieldを提案する。
論文参考訳（メタデータ） (2025-11-18T08:50:17Z)
ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection [107.86009509291581]
我々はForgerySleuthを提案し、包括的ヒント融合を行い、改ざんされた領域を示すセグメンテーション出力を生成する。 ForgeryAnalysisの有効性を実証し,ForgerySleuthが既存手法の堅牢性,一般化性,説明可能性において著しく優れていたことを示す。
論文参考訳（メタデータ） (2024-11-29T04:35:18Z)
Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectors [62.63467652611788]
実画像27,600枚、223,400枚、AI拡張画像1,472,700枚を含むSEMI-TRUTHSを紹介する。それぞれの画像には、検出器のロバスト性の標準化と目標評価のためのメタデータが添付されている。以上の結果から,現状の検出器は摂動の種類や程度,データ分布,拡張方法に様々な感度を示すことが示唆された。
論文参考訳（メタデータ） (2024-11-12T01:17:27Z)
DA-HFNet: Progressive Fine-Grained Forgery Image Detection and Localization Based on Dual Attention [12.36906630199689]
DA-HFNet鍛造画像データセットをテキストまたは画像支援GANおよび拡散モデルで作成する。我々のゴールは、階層的なプログレッシブネットワークを使用して、異なるスケールの偽造物を検出およびローカライゼーションするために捕獲することである。
論文参考訳（メタデータ） (2024-06-03T16:13:33Z)
On the exploitation of DCT statistics for cropping detectors [5.039808715733204]
本研究では,DCT統計を用いた画像分解能分類器について検討し,画像の本来の分解能を検出することを目的とした。その結果, 収穫画像と収穫画像の区別における分類器の信頼性が示され, 元の分解能の信頼性が評価された。この研究は、複数の領域にわたる画像解析と使用法を変換する可能性を持つ、この分野における新しい視点を開放する。
論文参考訳（メタデータ） (2024-03-21T19:05:31Z)
ObjectFormer for Image Manipulation Detection and Localization [118.89882740099137]
画像操作の検出とローカライズを行うObjectFormerを提案する。画像の高周波特徴を抽出し,マルチモーダルパッチの埋め込みとしてRGB特徴と組み合わせる。各種データセットについて広範な実験を行い,提案手法の有効性を検証した。
論文参考訳（メタデータ） (2022-03-28T12:27:34Z)
Image Manipulation Detection by Multi-View Multi-Scale Supervision [11.319080833880307]
画像操作検出の主な課題は、新しいデータの操作に敏感な一般化可能な特徴の学習方法である。本稿では,多視点特徴学習とマルチスケール監視による両面の考察を行う。我々の思考はMVSS-Netと呼ばれる新しいネットワークによって実現される。
論文参考訳（メタデータ） (2021-04-14T13:05:58Z)
Translate to Adapt: RGB-D Scene Recognition across Domains [18.40373730109694]
本研究では,マルチモーダルシーン認識データセット内に,潜在的に深刻なドメインシフト問題が存在することを注目する。本稿では,異なるカメラ領域にまたがって適応可能な自己教師付きモダリティ間翻訳法を提案する。
論文参考訳（メタデータ） (2021-03-26T18:20:29Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。