Fugu-MT 論文翻訳(概要): LOTA: Bit-Planes Guided AI-Generated Image Detection

論文の概要: LOTA: Bit-Planes Guided AI-Generated Image Detection

arxiv url: http://arxiv.org/abs/2510.14230v1
Date: Thu, 16 Oct 2025 02:12:49 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-17 21:15:14.678393
Title: LOTA: Bit-Planes Guided AI-Generated Image Detection
Title（参考訳）: LOTA:bit-PlanesガイドによるAI生成画像検出
Authors: Hongsong Wang, Renxi Cheng, Yang Zhang, Chaolei Han, Jie Gui,
Abstract要約: GANとDiffusionのモデルでは、AI生成画像と実際の画像との区別がより困難になる。ノイズの多い画像生成を誘導する有効なビットプレーンを導入し、様々な画像正規化戦略を活用する。本手法は, GAN から Diffusion への98.2% 以上, Diffusion から GAN への99.2% 以上の精度を実現する。
参考スコア（独自算出の注目度）: 25.736743931612565
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The rapid advancement of GAN and Diffusion models makes it more difficult to distinguish AI-generated images from real ones. Recent studies often use image-based reconstruction errors as an important feature for determining whether an image is AI-generated. However, these approaches typically incur high computational costs and also fail to capture intrinsic noisy features present in the raw images. To solve these problems, we innovatively refine error extraction by using bit-plane-based image processing, as lower bit planes indeed represent noise patterns in images. We introduce an effective bit-planes guided noisy image generation and exploit various image normalization strategies, including scaling and thresholding. Then, to amplify the noise signal for easier AI-generated image detection, we design a maximum gradient patch selection that applies multi-directional gradients to compute the noise score and selects the region with the highest score. Finally, we propose a lightweight and effective classification head and explore two different structures: noise-based classifier and noise-guided classifier. Extensive experiments on the GenImage benchmark demonstrate the outstanding performance of our method, which achieves an average accuracy of \textbf{98.9\%} (\textbf{11.9}\%~$\uparrow$) and shows excellent cross-generator generalization capability. Particularly, our method achieves an accuracy of over 98.2\% from GAN to Diffusion and over 99.2\% from Diffusion to GAN. Moreover, it performs error extraction at the millisecond level, nearly a hundred times faster than existing methods. The code is at https://github.com/hongsong-wang/LOTA.
Abstract（参考訳）: GANと拡散モデルの急速な進歩により、AI生成画像と実際の画像との区別がより困難になる。近年の研究では、画像がAI生成されているかどうかを判断する重要な特徴として、画像ベースの再構成エラーを用いることが多い。しかし、これらの手法は一般的に高い計算コストを伴い、原画像に存在する本質的なノイズの特徴を捉えることができない。これらの問題を解決するために,ビット平面に基づく画像処理を用いて誤り抽出を革新的に洗練し,低ビット平面は画像のノイズパターンを実際に表現する。ノイズの多い画像生成を誘導する有効なビットプレーンを導入し,スケーリングやしきい値設定など,さまざまな画像正規化戦略を活用する。そして,AIによる画像検出を容易にするため,雑音信号の増幅を行うため,ノイズスコアの計算に多方向勾配を適用した最大勾配パッチ選択を設計し,最も高いスコアの領域を選択する。最後に、軽量で効果的な分類ヘッドを提案し、ノイズベース分類器とノイズ誘導分類器の2つの異なる構造を探索する。 GenImage ベンチマークの大規模な実験により,提案手法の優れた性能を示し,その精度は \textbf{98.9\%} (\textbf{11.9}\%~$\uparrow$) であり,優れたクロスジェネレータ一般化能力を示す。特に,本手法は, GAN から Diffusion への98.2\% および Diffusion から GAN への99.2\% 以上の精度を実現する。さらに、既存の手法の100倍近く高速なミリ秒レベルでエラー抽出を行う。コードはhttps://github.com/hongsong-wang/LOTAにある。

論文の概要: LOTA: Bit-Planes Guided AI-Generated Image Detection

関連論文リスト