Fugu-MT 論文翻訳(概要): Micro-Expression Recognition via Fine-Grained Dynamic Perception

論文の概要: Micro-Expression Recognition via Fine-Grained Dynamic Perception

arxiv url: http://arxiv.org/abs/2509.06015v1
Date: Sun, 07 Sep 2025 11:13:50 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-09 14:07:03.818292
Title: Micro-Expression Recognition via Fine-Grained Dynamic Perception
Title（参考訳）: 微細粒度動的知覚による微小表現認識
Authors: Zhiwen Shao, Yifan Cheng, Fan Zhang, Xuehuai Shi, Canlin Li, Lizhuang Ma, Dit-yan Yeung,
Abstract要約: 顔マイクロ圧縮認識(MER)のためのFDPフレームワークを開発した。時系列の原フレーム列のフレームレベルの特徴をランク付けし、ランク付けプロセスはMEの出現と動きの両方の動的情報をエンコードする。提案手法は最先端のMER法よりも優れており,動的画像構築に有効である。
参考スコア（独自算出の注目度）: 64.26947471761916
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Facial micro-expression recognition (MER) is a challenging task, due to the transience, subtlety, and dynamics of micro-expressions (MEs). Most existing methods resort to hand-crafted features or deep networks, in which the former often additionally requires key frames, and the latter suffers from small-scale and low-diversity training data. In this paper, we develop a novel fine-grained dynamic perception (FDP) framework for MER. We propose to rank frame-level features of a sequence of raw frames in chronological order, in which the rank process encodes the dynamic information of both ME appearances and motions. Specifically, a novel local-global feature-aware transformer is proposed for frame representation learning. A rank scorer is further adopted to calculate rank scores of each frame-level feature. Afterwards, the rank features from rank scorer are pooled in temporal dimension to capture dynamic representation. Finally, the dynamic representation is shared by a MER module and a dynamic image construction module, in which the former predicts the ME category, and the latter uses an encoder-decoder structure to construct the dynamic image. The design of dynamic image construction task is beneficial for capturing facial subtle actions associated with MEs and alleviating the data scarcity issue. Extensive experiments show that our method (i) significantly outperforms the state-of-the-art MER methods, and (ii) works well for dynamic image construction. Particularly, our FDP improves by 4.05%, 2.50%, 7.71%, and 2.11% over the previous best results in terms of F1-score on the CASME II, SAMM, CAS(ME)^2, and CAS(ME)^3 datasets, respectively. The code is available at https://github.com/CYF-cuber/FDP.
Abstract（参考訳）: MER(Facial Micro-Expression Recognition)は、マイクロ表現(ME)の透明性、微妙さ、ダイナミックスのために難しい課題である。既存のほとんどの手法は手作りの機能やディープネットワークに頼っており、前者はキーフレームを必要とすることが多く、後者は小規模で低多様性のトレーニングデータに悩まされている。本稿では,MERのためのFDPフレームワークを開発する。本稿では, フレーム列のフレームレベルの特徴を時系列順にランク付けし, ランク付けプロセスがMEの出現と動きの両方の動的情報をエンコードする手法を提案する。具体的には,フレーム表現学習のための局所的特徴認識変換器を提案する。各フレームレベルの特徴のランクスコアを計算するためにランクスコアがさらに採用される。その後、ランクスコアからのランク特徴を時間次元にプールし、動的表現をキャプチャする。最後に、動的表現をMERモジュールと動的画像構築モジュールで共有し、前者はMEカテゴリを予測し、後者はエンコーダデコーダ構造を用いて動的画像を構築する。動的画像構築タスクの設計は、MEに関連する顔の微妙な動作を捕捉し、データ不足の問題を軽減するのに有用である。広汎な実験により、我々の方法が示される (i)最先端のMER法を著しく上回り、 (ii)動的画像構築には有効である。特に,我々のFDPは,CASME II,SAMM,CAS(ME)^2,CAS(ME)^3データセットのF1スコアにおいて,従来の最良値よりも4.05%,2.50%,7.71%,2.11%向上した。コードはhttps://github.com/CYF-cuber/FDPで入手できる。

論文の概要: Micro-Expression Recognition via Fine-Grained Dynamic Perception

関連論文リスト