Fugu-MT 論文翻訳(概要): EventFace: Event-Based Face Recognition via Structure-Driven Spatiotemporal Modeling

論文の概要: EventFace: Event-Based Face Recognition via Structure-Driven Spatiotemporal Modeling

arxiv url: http://arxiv.org/abs/2604.06782v1
Date: Wed, 08 Apr 2026 07:52:10 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-09 17:30:51.407969
Title: EventFace: Event-Based Face Recognition via Structure-Driven Spatiotemporal Modeling
Title（参考訳）: EventFace: 構造駆動時空間モデリングによるイベントベースの顔認識
Authors: Qingguo Meng, Xingbo Dong, Zhe Jin, Massimo Tistarelli,
Abstract要約: イベントカメラは、照明のロバスト性やプライバシーのフレンドリさに固有の利点があるため、顔認識に期待できるモダリティを提供する。イベントベースの顔認識は、剛性な顔の動きと個々の顔形状によって形成される構造駆動のアイデンティティ表現をモデル化すべきである、と我々は主張する。本研究では,イベントベースの顔認識フレームワークであるEventFaceを提案する。
参考スコア（独自算出の注目度）: 15.313758078552437
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Event cameras offer a promising sensing modality for face recognition due to their inherent advantages in illumination robustness and privacy-friendliness. However, because event streams lack the stable photometric appearance relied upon by conventional RGB-based face recognition systems, we argue that event-based face recognition should model structure-driven spatiotemporal identity representations shaped by rigid facial motion and individual facial geometry. Since dedicated datasets for event-based face recognition remain lacking, we construct EFace, a small-scale event-based face dataset captured under rigid facial motion. To learn effectively from this limited event data, we further propose EventFace, a framework for event-based face recognition that integrates spatial structure and temporal dynamics for identity modeling. Specifically, we employ Low-Rank Adaptation (LoRA) to transfer structural facial priors from pretrained RGB face models to the event domain, thereby establishing a reliable spatial basis for identity modeling. Building on this foundation, we further introduce a Motion Prompt Encoder (MPE) to explicitly encode temporal features and a Spatiotemporal Modulator (STM) to fuse them with spatial features, thereby enhancing the representation of identity-relevant event patterns. Extensive experiments demonstrate that EventFace achieves the best performance among the evaluated baselines, with a Rank-1 identification rate of 94.19% and an equal error rate (EER) of 5.35%. Results further indicate that EventFace exhibits stronger robustness under degraded illumination than the competing methods. In addition, the learned representations exhibit reduced template reconstructability.
Abstract（参考訳）: イベントカメラは、照明のロバスト性やプライバシーのフレンドリさに固有の利点があるため、顔認識に期待できるモダリティを提供する。しかし, イベントストリームは従来のRGBベースの顔認識システムに頼っている安定な測光外観を欠いているため, イベントベースの顔認識は, 剛体的な顔の動きと個々の顔形状によって形成される時空間的同一性表現をモデル化すべきである,と論じる。イベントベースの顔認識のための専用データセットは依然として不足しているため、厳格な顔の動きでキャプチャされた小規模のイベントベースの顔データセットであるEFaceを構築した。この限られたイベントデータから効果的に学習するために,イベントベースの顔認識フレームワークであるEventFaceを提案する。具体的には、Low-Rank Adaptation (LoRA) を用いて、事前訓練されたRGB顔モデルからイベントドメインへ構造的顔の事前情報を転送し、アイデンティティモデリングのための信頼性の高い空間的基盤を確立する。この基礎の上に、時間的特徴を明示的にエンコードするモーションプロンプトエンコーダ(MPE)と、空間的特徴を融合する時空間変調器(STM)を導入し、アイデンティティ関連イベントパターンの表現を強化する。大規模な実験では、EventFaceは評価されたベースラインの中で最高のパフォーマンスを達成しており、ランク1の識別率は94.19%、エラーレートは5.35%である。結果はさらに、EventFaceは競合するメソッドよりも劣化した照明下で強い堅牢性を示すことを示している。さらに、学習された表現はテンプレート再構成可能性の低下を示す。

論文の概要: EventFace: Event-Based Face Recognition via Structure-Driven Spatiotemporal Modeling

関連論文リスト