Fugu-MT 論文翻訳(概要): FIND: A Simple yet Effective Baseline for Diffusion-Generated Image Detection

論文の概要: FIND: A Simple yet Effective Baseline for Diffusion-Generated Image Detection

arxiv url: http://arxiv.org/abs/2603.14220v1
Date: Sun, 15 Mar 2026 04:49:30 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:35.680106
Title: FIND: A Simple yet Effective Baseline for Diffusion-Generated Image Detection
Title（参考訳）: FIND:拡散生成画像検出のためのシンプルで効果的なベースライン
Authors: Jie Li, Yingying Feng, Chi Xie, Jie Hu, Lei Tan, Jiayi Ji,
Abstract要約: 実際の画像は合成画像よりもガウス分布に適合することが難しい。本稿では,単純なバイナリ分類器のみを必要とする新しい手法であるFIND(Forgery Identification via Noise disturbance)を提案する。 FINDはGenImageベンチマークで11.7%向上し、既存のメソッドよりも126倍高速である。
参考スコア（独自算出の注目度）: 32.11320589253777
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The remarkable realism of images generated by diffusion models poses critical detection challenges. Current methods utilize reconstruction error as a discriminative feature, exploiting the observation that real images exhibit higher reconstruction errors when processed through diffusion models. However, these approaches require costly reconstruction computations and depend on specific diffusion models, making their performance highly model-dependent. We identify a fundamental difference: real images are more difficult to fit with Gaussian distributions compared to synthetic ones. In this paper, we propose Forgery Identification via Noise Disturbance (FIND), a novel method that requires only a simple binary classifier. It eliminates reconstruction by directly targeting the core distributional difference between real and synthetic images. Our key operation is to add Gaussian noise to real images during training and label these noisy versions as synthetic. This step allows the classifier to focus on the statistical patterns that distinguish real from synthetic images. We theoretically prove that the noise-augmented real images resemble diffusion-generated images in their ease of Gaussian fitting. Furthermore, simply by adding noise, they still retain visual similarity to the original images, highlighting the most discriminative distribution-related features. The proposed FIND improves performance by 11.7% on the GenImage benchmark while running 126x faster than existing methods. By removing the need for auxiliary diffusion models and reconstruction, it offers a practical, efficient, and generalizable way to detect diffusion-generated content.
Abstract（参考訳）: 拡散モデルによって生成された画像の顕著なリアリズムは、重要な検出課題を生じさせる。現在の手法では, 拡散モデルを用いて処理した場合に, 実画像がより高い再構成誤差を示すという観察を活かして, 再構成誤差を識別的特徴として利用している。しかし、これらの手法はコストのかかる再構成計算を必要とし、特定の拡散モデルに依存するため、その性能はモデルに依存している。実際の画像は合成画像に比べてガウス分布に適合することが困難である。本稿では,単純なバイナリ分類器のみを必要とする新しい手法であるFIND(Forgery Identification via Noise disturbance)を提案する。実画像と合成画像のコア分布差を直接ターゲットにすることで、再構成をなくす。私たちの重要な操作は、トレーニング中にガウスノイズを実画像に追加し、これらのノイズバージョンを合成としてラベル付けすることです。このステップにより、分類器は合成画像と現実を区別する統計パターンに焦点を合わせることができる。理論的には、ノイズ増大実画像はガウスフィッティングの容易さで拡散生成画像に類似していることを証明する。さらに、単にノイズを加えるだけで、元の画像と視覚的類似性が保たれ、最も識別性の高い分布関連の特徴が強調される。提案されたFINDは、GenImageベンチマークで11.7%向上し、既存のメソッドよりも126倍高速である。補助拡散モデルや再構成の必要性を取り除くことで、拡散生成コンテンツを検出する実用的で効率的で一般化可能な方法を提供する。

論文の概要: FIND: A Simple yet Effective Baseline for Diffusion-Generated Image Detection

関連論文リスト