Fugu-MT 論文翻訳(概要): DecepGPT: Schema-Driven Deception Detection with Multicultural Datasets and Robust Multimodal Learning

論文の概要: DecepGPT: Schema-Driven Deception Detection with Multicultural Datasets and Robust Multimodal Learning

arxiv url: http://arxiv.org/abs/2603.23916v1
Date: Wed, 25 Mar 2026 04:06:36 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-26 21:06:11.121855
Title: DecepGPT: Schema-Driven Deception Detection with Multicultural Datasets and Robust Multimodal Learning
Title（参考訳）: DecepGPT:マルチカルチャーデータセットとロバストなマルチモーダル学習を用いたスキーマ駆動型誤認検出
Authors: Jiajian Huang, Dongliang Zhu, Zitong YU, Hui Ma, Jiayu Zhang, Chunmei Zhu, Xiaochun Cao,
Abstract要約: マルチモーダル偽装検出は、法医学とセキュリティのための聴覚的手がかりを解析することにより、偽装行動を特定することを目的としている。既存のベンチマークでは、中間的な推論手段を使わずにバイナリラベルのみを提供する。構造的キューレベルの記述と推論チェーンを用いた推論データセットを構築した。 1695年のサンプルでは、非実験的偽装検出データセットとしては最大である。
参考スコア（独自算出の注目度）: 64.33887406863899
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Multimodal deception detection aims to identify deceptive behavior by analyzing audiovisual cues for forensics and security. In these high-stakes settings, investigators need verifiable evidence connecting audiovisual cues to final decisions, along with reliable generalization across domains and cultural contexts. However, existing benchmarks provide only binary labels without intermediate reasoning cues. Datasets are also small with limited scenario coverage, leading to shortcut learning. We address these issues through three contributions. First, we construct reasoning datasets by augmenting existing benchmarks with structured cue-level descriptions and reasoning chains, enabling model output auditable reports. Second, we release T4-Deception, a multicultural dataset based on the unified ``To Tell The Truth'' television format implemented across four countries. With 1695 samples, it is the largest non-laboratory deception detection dataset. Third, we propose two modules for robust learning under small-data conditions. Stabilized Individuality-Commonality Synergy (SICS) refines multimodal representations by synergizing learnable global priors with sample-adaptive residuals, followed by a polarity-aware adjustment that bi-directionally recalibrates representations. Distilled Modality Consistency (DMC) aligns modality-specific predictions with the fused multimodal predictions via knowledge distillation to prevent unimodal shortcut learning. Experiments on three established benchmarks and our novel dataset demonstrate that our method achieves state-of-the-art performance in both in-domain and cross-domain scenarios, while exhibiting superior transferability across diverse cultural contexts. The datasets and codes will be released.
Abstract（参考訳）: マルチモーダル偽装検出は、法医学とセキュリティのための聴覚的手がかりを解析することにより、偽装行動を特定することを目的としている。これらの高精細な環境では、調査員は、オーディオ視覚的手がかりと最終決定を結び、ドメインと文化の文脈をまたいだ信頼性の高い一般化と、検証可能な証拠が必要である。しかし、既存のベンチマークは中間的推論手段なしでバイナリラベルのみを提供する。データセットも小さく、シナリオカバレッジが制限されているため、ショートカット学習につながります。これらの問題に3つのコントリビューションを通じて対処する。まず、構造化キューレベルの記述と推論チェーンで既存のベンチマークを拡張して推論データセットを構築し、モデル出力の監査可能なレポートを可能にする。第2に,4カ国にまたがる「真実を伝える」テレビフォーマットに基づく多文化データセットであるT4-Deceptionをリリースする。 1695年のサンプルでは、非実験的偽装検出データセットとしては最大である。第3に、小規模データ条件下での堅牢な学習のための2つのモジュールを提案する。安定化されたパーソナリティ・コモンナリティ・シナジー(SICS)は、学習可能なグローバル事前とサンプル適応残差を相乗化することでマルチモーダル表現を洗練し、続いて、双方向で表現を再分類する極性対応調整を行う。 DMC (Distilled Modality Consistency) は、モダリティ固有の予測を知識蒸留による融合マルチモーダル予測と整合させ、一助的ショートカット学習を防ぐ。 3つの確立されたベンチマークと新しいデータセットによる実験により,本手法はドメイン内シナリオとドメイン間シナリオの両方において最先端のパフォーマンスを実現し,多様な文化的文脈において優れた転送性を示す。データセットとコードはリリースされる。

論文の概要: DecepGPT: Schema-Driven Deception Detection with Multicultural Datasets and Robust Multimodal Learning

関連論文リスト