Fugu-MT 論文翻訳(概要): Mixture of Experts Guided by Gaussian Splatters Matters: A new Approach to Weakly-Supervised Video Anomaly Detection

論文の概要: Mixture of Experts Guided by Gaussian Splatters Matters: A new Approach to Weakly-Supervised Video Anomaly Detection

arxiv url: http://arxiv.org/abs/2508.06318v1
Date: Fri, 08 Aug 2025 13:48:48 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-11 20:39:06.243121
Title: Mixture of Experts Guided by Gaussian Splatters Matters: A new Approach to Weakly-Supervised Video Anomaly Detection
Title（参考訳）: ガウスプラッターが指導するエキスパートの混在 : 微弱に監督されたビデオ異常検出のための新しいアプローチ
Authors: Giacomo D'Amicantonio, Snehashis Majhi, Quan Kong, Lorenzo Garattoni, Gianpiero Francesca, François Bremond, Egor Bondarev,
Abstract要約: ビデオ異常検出(VAD)は、異常事象のばらつきとラベル付きデータの限られた可用性のために難しい課題である。本稿では,一組のエキスパートモデルを用いて,それぞれが特定の異常型をキャプチャする新しいフレームワークを提案する。提案手法は,UCF-Crimeデータセット上で91.58%のAUCを達成し,XD-ViolenceデータセットとMSADデータセットにおいて優れた結果を示す。
参考スコア（独自算出の注目度）: 7.435598538875321
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Video Anomaly Detection (VAD) is a challenging task due to the variability of anomalous events and the limited availability of labeled data. Under the Weakly-Supervised VAD (WSVAD) paradigm, only video-level labels are provided during training, while predictions are made at the frame level. Although state-of-the-art models perform well on simple anomalies (e.g., explosions), they struggle with complex real-world events (e.g., shoplifting). This difficulty stems from two key issues: (1) the inability of current models to address the diversity of anomaly types, as they process all categories with a shared model, overlooking category-specific features; and (2) the weak supervision signal, which lacks precise temporal information, limiting the ability to capture nuanced anomalous patterns blended with normal events. To address these challenges, we propose Gaussian Splatting-guided Mixture of Experts (GS-MoE), a novel framework that employs a set of expert models, each specialized in capturing specific anomaly types. These experts are guided by a temporal Gaussian splatting loss, enabling the model to leverage temporal consistency and enhance weak supervision. The Gaussian splatting approach encourages a more precise and comprehensive representation of anomalies by focusing on temporal segments most likely to contain abnormal events. The predictions from these specialized experts are integrated through a mixture-of-experts mechanism to model complex relationships across diverse anomaly patterns. Our approach achieves state-of-the-art performance, with a 91.58% AUC on the UCF-Crime dataset, and demonstrates superior results on XD-Violence and MSAD datasets. By leveraging category-specific expertise and temporal guidance, GS-MoE sets a new benchmark for VAD under weak supervision.
Abstract（参考訳）: ビデオ異常検出(VAD)は、異常事象のばらつきとラベル付きデータの限られた可用性のために難しい課題である。 Weakly-Supervised VAD(WSVAD)パラダイムでは、トレーニング中にビデオレベルのラベルのみを提供し、フレームレベルで予測を行う。最先端のモデルは単純な異常(例:爆発)でうまく機能するが、それらは複雑な現実世界のイベント(例:万引き)と競合する。この難しさは,(1) カテゴリー固有の特徴を網羅して,すべてのカテゴリを共有モデルで処理する現在のモデルが異常な種類の多様性に対処できないこと,(2) 正確な時間的情報を欠いている弱い監視信号が,通常の事象と混在する異常なパターンを捉える能力を制限すること,の2つに起因している。これらの課題に対処するため,我々は,特定の異常型を捉えることに特化した,一連の専門家モデルを用いた新しいフレームワークであるGaussian Splatting-Guided Mixture of Experts (GS-MoE)を提案する。これらの専門家は、時間的ガウスのスプラッティング損失によって導かれ、モデルが時間的一貫性を活用し、監督の弱さを高めることができる。ガウスのスプレイティングアプローチは、異常事象を含む可能性が最も高い時間セグメントに焦点を当てることで、より正確で包括的な異常表現を促進する。これらの専門専門家の予測は、様々な異常パターンをまたいだ複雑な関係をモデル化する、Mix-of-expertsメカニズムを通じて統合される。提案手法は,UCF-Crimeデータセット上で91.58%のAUCを達成し,XD-ViolenceデータセットとMSADデータセットにおいて優れた結果を示す。カテゴリ固有の専門知識と時間的ガイダンスを活用することで、GS-MoEは、弱い監督の下でVADの新しいベンチマークを設定できる。

論文の概要: Mixture of Experts Guided by Gaussian Splatters Matters: A new Approach to Weakly-Supervised Video Anomaly Detection

関連論文リスト