Fugu-MT 論文翻訳(概要): Bi-Level Optimization for Self-Supervised AI-Generated Face Detection

論文の概要: Bi-Level Optimization for Self-Supervised AI-Generated Face Detection

arxiv url: http://arxiv.org/abs/2507.22824v1
Date: Wed, 30 Jul 2025 16:38:29 GMT
ステータス: 翻訳完了
システム内更新日: 2025-07-31 16:14:18.337937
Title: Bi-Level Optimization for Self-Supervised AI-Generated Face Detection
Title（参考訳）: 自己監督型AI生成顔検出のためのバイレベル最適化
Authors: Mian Zou, Nan Zhong, Baosheng Yu, Yibing Zhan, Kede Ma,
Abstract要約: 両レベル最適化に基づくAI生成顔検出器の自己教師方式を提案する。我々の検出器は、一級・二級の分類設定において、既存のアプローチよりも大幅に優れています。
参考スコア（独自算出の注目度）: 56.57881725223548
License: http://creativecommons.org/licenses/by/4.0/
Abstract: AI-generated face detectors trained via supervised learning typically rely on synthesized images from specific generators, limiting their generalization to emerging generative techniques. To overcome this limitation, we introduce a self-supervised method based on bi-level optimization. In the inner loop, we pretrain a vision encoder only on photographic face images using a set of linearly weighted pretext tasks: classification of categorical exchangeable image file format (EXIF) tags, ranking of ordinal EXIF tags, and detection of artificial face manipulations. The outer loop then optimizes the relative weights of these pretext tasks to enhance the coarse-grained detection of manipulated faces, serving as a proxy task for identifying AI-generated faces. In doing so, it aligns self-supervised learning more closely with the ultimate goal of AI-generated face detection. Once pretrained, the encoder remains fixed, and AI-generated faces are detected either as anomalies under a Gaussian mixture model fitted to photographic face features or by a lightweight two-layer perceptron serving as a binary classifier. Extensive experiments demonstrate that our detectors significantly outperform existing approaches in both one-class and binary classification settings, exhibiting strong generalization to unseen generators.
Abstract（参考訳）: 教師付き学習によって訓練されたAI生成顔検出器は通常、特定のジェネレータから合成された画像に依存し、その一般化を新しい生成技術に制限する。この制限を克服するため,二段階最適化に基づく自己監督手法を提案する。 In the inner loop, we training a vision encoder only on photoic face images using a set of linearly weighted pretext tasks: classification of categorical exchangeable image file format (EXIF) tag, ranking of Ordinal EXIF tags, and detection of artificial face manipulates。外ループは、これらのプリテキストタスクの相対重みを最適化し、操作された顔の粗いきめ細かな検出を強化し、AI生成された顔を特定するためのプロキシタスクとして機能する。そうすることで、自己教師型学習をAI生成顔検出の最終的な目標とより緊密に整合させる。事前訓練後、エンコーダは固定され、AI生成顔は、写真顔の特徴に適合したガウス混合モデルの下で異常として検出されるか、バイナリ分類器として機能する軽量の2層パーセプトロンによって検出される。大規模な実験により、我々の検出器は、一級と二級の両方の分類設定において既存のアプローチを著しく上回り、目に見えないジェネレータに強い一般化を示すことが示された。

関連論文リスト

Towards Generalizable AI-Generated Image Detection via Image-Adaptive Prompt Learning [30.415427474641813]
本稿では,多様なテスト画像の処理の柔軟性を向上する,画像適応型プロンプト学習(IAPL)という新しいフレームワークを提案する。これは2つの適応モジュール、すなわち条件情報学習器と信頼駆動適応予測からなる。実験の結果、IAPLは最先端のパフォーマンスを達成しており、95.61%と96.7%は広く使われているUniversalFakeDetectとGenImageの2つのデータセットの精度を示している。
論文参考訳（メタデータ） (2025-08-03T05:41:24Z)
Quality Assessment and Distortion-aware Saliency Prediction for AI-Generated Omnidirectional Images [70.49595920462579]
本研究は,AIGODIの品質評価と歪みを考慮したサリエンシ予測問題について検討する。 BLIP-2モデルに基づく共有エンコーダを用いた2つのモデルを提案する。
論文参考訳（メタデータ） (2025-06-27T05:36:04Z)
Self-Supervised Learning for Detecting AI-Generated Faces as Anomalies [58.11545090128854]
本稿では、写真顔画像から純粋にカメラ固有の特徴と顔特有の特徴の自己教師付き学習を活用することで、AI生成顔の異常検出手法について述べる。提案手法の成功は,特徴抽出器を訓練して4つの通常交換可能な画像ファイルフォーマット(EXIF)をランク付けし,人工的に操作された顔画像の分類を行うプリテキストタスクを設計することにある。
論文参考訳（メタデータ） (2025-01-04T06:23:24Z)
Understanding and Improving Training-Free AI-Generated Image Detections with Vision Foundation Models [68.90917438865078]
顔合成と編集のためのディープフェイク技術は、生成モデルに重大なリスクをもたらす。本稿では,モデルバックボーン,タイプ,データセット間で検出性能がどう変化するかを検討する。本稿では、顔画像のパフォーマンスを向上させるContrastive Blurと、ノイズタイプのバイアスに対処し、ドメイン間のパフォーマンスのバランスをとるMINDERを紹介する。
論文参考訳（メタデータ） (2024-11-28T13:04:45Z)
Orthogonal Subspace Decomposition for Generalizable AI-Generated Image Detection [58.87142367781417]
航法的に訓練された検出器は、限定的で単調な偽のパターンに過度に適合する傾向にあり、特徴空間は高度に制約され、低ランクになる。潜在的な治療法の1つは、ビジョンファウンデーションモデルに事前訓練された知識を取り入れて、機能領域を広げることである。主要なコンポーネントを凍結し、残ったコンポーネントのみを適用することで、フェイクパターンを学習しながら、トレーニング済みの知識を保存します。
論文参考訳（メタデータ） (2024-11-23T19:10:32Z)
UniForensics: Face Forgery Detection via General Facial Representation [60.5421627990707]
高レベルの意味的特徴は摂動の影響を受けにくく、フォージェリー固有の人工物に限らないため、より強い一般化がある。我々は、トランスフォーマーベースのビデオネットワークを活用する新しいディープフェイク検出フレームワークUniForensicsを導入し、顔の豊かな表現のためのメタファンクショナルな顔分類を行う。
論文参考訳（メタデータ） (2024-07-26T20:51:54Z)
RIGID: A Training-free and Model-Agnostic Framework for Robust AI-Generated Image Detection [60.960988614701414]
RIGIDは、堅牢なAI生成画像検出のためのトレーニング不要でモデルに依存しない方法である。 RIGIDは、既存のトレーニングベースおよびトレーニング不要な検出器を著しく上回っている。
論文参考訳（メタデータ） (2024-05-30T14:49:54Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。