Fugu-MT 論文翻訳(概要): $\bf{D^3}$QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection

論文の概要: $\bf{D^3}$QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection

arxiv url: http://arxiv.org/abs/2510.05891v1
Date: Tue, 07 Oct 2025 13:02:27 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-08 17:57:08.260055
Title: $\bf{D^3}$QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection
Title（参考訳）: $\bf{D^3}$QE: 自己回帰画像検出のための離散分布離散型量子化誤差の学習
Authors: Yanran Zhang, Bingyao Yu, Yu Zheng, Wenzhao Zheng, Yueqi Duan, Lei Chen, Jie Zhou, Jiwen Lu,
Abstract要約: 視覚的自己回帰(AR)モデルは、離散トークン予測を通じて画像を生成する。本稿では,離散分布離散性を考慮した量子化誤差(D$3$QE)を自己回帰画像検出に活用することを提案する。
参考スコア（独自算出の注目度）: 85.9202830503973
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The emergence of visual autoregressive (AR) models has revolutionized image generation while presenting new challenges for synthetic image detection. Unlike previous GAN or diffusion-based methods, AR models generate images through discrete token prediction, exhibiting both marked improvements in image synthesis quality and unique characteristics in their vector-quantized representations. In this paper, we propose to leverage Discrete Distribution Discrepancy-aware Quantization Error (D$^3$QE) for autoregressive-generated image detection that exploits the distinctive patterns and the frequency distribution bias of the codebook existing in real and fake images. We introduce a discrete distribution discrepancy-aware transformer that integrates dynamic codebook frequency statistics into its attention mechanism, fusing semantic features and quantization error latent. To evaluate our method, we construct a comprehensive dataset termed ARForensics covering 7 mainstream visual AR models. Experiments demonstrate superior detection accuracy and strong generalization of D$^3$QE across different AR models, with robustness to real-world perturbations. Code is available at \href{https://github.com/Zhangyr2022/D3QE}{https://github.com/Zhangyr2022/D3QE}.
Abstract（参考訳）: 視覚自己回帰モデル(AR)の出現は、合成画像検出の新しい課題を提示しながら、画像生成に革命をもたらした。従来のGANや拡散法とは異なり、ARモデルは離散トークン予測によって画像を生成し、画像合成の品質とベクトル量子化表現における特徴の両方を顕著に改善した。本稿では、離散分布離散性を考慮した量子化誤差(D$^3$QE)を用いて、実画像および偽画像に存在するコードブックの固有パターンと周波数分布バイアスを利用する自動回帰画像検出手法を提案する。本稿では、動的コードブックの周波数統計をその注意機構に統合し、意味的特徴と量子化誤りを解消する離散分布離散化対応変換器を提案する。提案手法を評価するために,7つの主要な視覚的ARモデルをカバーするARForensicsと呼ばれる包括的データセットを構築した。実験では、実世界の摂動に頑健な異なるARモデルに対して、より優れた検出精度とD$^3$QEの強力な一般化を示す。コードは \href{https://github.com/Zhangyr2022/D3QE}{https://github.com/Zhangyr2022/D3QE} で公開されている。

論文の概要: $\bf{D^3}$QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection

関連論文リスト