Fugu-MT 論文翻訳(概要): Adversarial Attacks on Audio Deepfake Detection: A Benchmark and Comparative Study

論文の概要: Adversarial Attacks on Audio Deepfake Detection: A Benchmark and Comparative Study

arxiv url: http://arxiv.org/abs/2509.07132v1
Date: Mon, 08 Sep 2025 18:33:24 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-10 14:38:27.08219
Title: Adversarial Attacks on Audio Deepfake Detection: A Benchmark and Comparative Study
Title（参考訳）: オーディオディープフェイク検出における敵対的攻撃 : ベンチマークと比較研究
Authors: Kutub Uddin, Muhammad Umar Farooq, Awais Khan, Khalid Mahmood Malik,
Abstract要約: State-of-the-art (SoTA) Audio Deepfake Detection (ADD) 法が提案されている。これらの方法の有効性は、生成的シグネチャを隠蔽する反法医学的(AF)攻撃によって著しく損なわれている。我々は,5つのディープフェイク・ベンチマーク・データセットに対して,生とスペクトログラムに基づくアプローチの2つのカテゴリを用いて,ADD法を広範囲に評価する。
参考スコア（独自算出の注目度）: 11.300081537458757
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The widespread use of generative AI has shown remarkable success in producing highly realistic deepfakes, posing a serious threat to various voice biometric applications, including speaker verification, voice biometrics, audio conferencing, and criminal investigations. To counteract this, several state-of-the-art (SoTA) audio deepfake detection (ADD) methods have been proposed to identify generative AI signatures to distinguish between real and deepfake audio. However, the effectiveness of these methods is severely undermined by anti-forensic (AF) attacks that conceal generative signatures. These AF attacks span a wide range of techniques, including statistical modifications (e.g., pitch shifting, filtering, noise addition, and quantization) and optimization-based attacks (e.g., FGSM, PGD, C \& W, and DeepFool). In this paper, we investigate the SoTA ADD methods and provide a comparative analysis to highlight their effectiveness in exposing deepfake signatures, as well as their vulnerabilities under adversarial conditions. We conducted an extensive evaluation of ADD methods on five deepfake benchmark datasets using two categories: raw and spectrogram-based approaches. This comparative analysis enables a deeper understanding of the strengths and limitations of SoTA ADD methods against diverse AF attacks. It does not only highlight vulnerabilities of ADD methods, but also informs the design of more robust and generalized detectors for real-world voice biometrics. It will further guide future research in developing adaptive defense strategies that can effectively counter evolving AF techniques.
Abstract（参考訳）: 生成AIの普及は、話者検証、音声バイオメトリックス、オーディオ会議、刑事捜査など、さまざまな音声バイオメトリック応用に深刻な脅威を呈する、非常に現実的なディープフェイクの製作に顕著な成功を収めている。これに対抗するために、複数の最先端(SoTA)オーディオディープフェイク検出(ADD)手法が提案され、生成的AIシグネチャを識別し、実音声とディープフェイク音声を区別する。しかし、これらの方法の有効性は、遺伝子的シグネチャを隠蔽する反法医学的(AF)攻撃によって著しく損なわれている。これらのAF攻撃は、統計的修正(ピッチシフト、フィルタリング、ノイズ追加、量子化など)や最適化ベースの攻撃(例えば、FGSM、PGD、C \&W、DeepFool)を含む幅広い技術にまたがっている。本稿では,SoTA ADD法について検討し,その効果と,敵条件下での脆弱性を明らかにするための比較分析を行った。我々は,5つのディープフェイク・ベンチマーク・データセットに対して,生と分光の2つのカテゴリを用いたADD手法の広範囲な評価を行った。この比較分析により、多様なAF攻撃に対するSoTA ADD法の強度と限界をより深く理解することができる。 ADD法の脆弱性を浮き彫りにするだけでなく、実世界の音声バイオメトリックスのためのより堅牢で一般化された検出器の設計を知らせる。進化するAF技術に効果的に対抗できる適応型防衛戦略の開発における今後の研究の指針となる。

論文の概要: Adversarial Attacks on Audio Deepfake Detection: A Benchmark and Comparative Study

関連論文リスト