Fugu-MT 論文翻訳(概要): PhysNeXt: Next-Generation Dual-Branch Structured Attention Fusion Network for Remote Photoplethysmography Measurement

論文の概要: PhysNeXt: Next-Generation Dual-Branch Structured Attention Fusion Network for Remote Photoplethysmography Measurement

arxiv url: http://arxiv.org/abs/2603.19752v1
Date: Fri, 20 Mar 2026 08:37:02 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-23 19:48:39.055981
Title: PhysNeXt: Next-Generation Dual-Branch Structured Attention Fusion Network for Remote Photoplethysmography Measurement
Title（参考訳）: PhysNeXt:リモート光胸腺撮影のための次世代デュアルブランチ構造化注意核融合ネットワーク
Authors: Junzhe Cao, Bo Zhao, Zhiyi Niu, Dan Guo, Yue Sun, Haochen Liang, Yong Xu, Zitong YU,
Abstract要約: ハーモグラフィーは、心臓の脈動によって引き起こされる顔の皮膚の色変化を分析し、心拍数やその他の重要な兆候を測定することができる。現在の手法は主に生のビデオからエンド・ツー・エンドのモデリング、または微妙な心拍マップ(ST)表現に基づいている。本稿では,ビデオフレームとST表現を併用したデュアルインプットディープラーニングフレームワークであるPhysMapXtを提案する。
参考スコア（独自算出の注目度）: 50.524262997433546
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Remote photoplethysmography (rPPG) enables contactless measurement of heart rate and other vital signs by analyzing subtle color variations in facial skin induced by cardiac pulsation. Current rPPG methods are mainly based on either end-to-end modeling from raw videos or intermediate spatial-temporal map (STMap) representations. The former preserves complete spatiotemporal information and can capture subtle heartbeat-related signals, but it also introduces substantial noise from motion artifacts and illumination variations. The latter stacks the temporal color changes of multiple facial regions of interest into compact two-dimensional representations, significantly reducing data volume and computational complexity, although some high-frequency details may be lost. To effectively integrate the mutual strengths, we propose PhysNeXt, a dual-input deep learning framework that jointly exploits video frames and STMap representations. By incorporating a spatio-temporal difference modeling unit, a cross-modal interaction module, and a structured attention-based decoder, PhysNeXt collaboratively enhances the robustness of pulse signal extraction. Experimental results demonstrate that PhysNeXt achieves more stable and fine-grained rPPG signal recovery under challenging conditions, validating the effectiveness of joint modeling of video and STMap representations. The codes will be released.
Abstract（参考訳）: リモート光胸腺造影(rPPG)は、心臓脈動により引き起こされる顔面皮膚の微妙な色変化を解析することにより、心拍数やその他のバイタルサインを無接触で測定することができる。現在のrPPG法は主に、生のビデオからのエンドツーエンドのモデリングと、中間時空間地図(STMap)の表現に基づいている。前者は完全な時空間情報を保存し、微妙な心拍関連信号を捉えることができるが、運動人工物からのかなりのノイズや照明のバリエーションも導入する。後者は、興味のある複数の顔領域の時間的色変化をコンパクトな2次元表現に積み重ね、データボリュームと計算の複雑さを著しく低減するが、いくつかの高周波の詳細は失われる可能性がある。相互の強みを効果的に統合するために,ビデオフレームとSTMap表現を併用した2入力深層学習フレームワークPhysNeXtを提案する。時空間差分モデリングユニット、クロスモーダル相互作用モジュール、構造化アテンションベースのデコーダを組み込むことにより、PhysNeXtはパルス信号抽出の堅牢性を協調的に強化する。実験により、PhysNeXtはより安定かつ微細なrPPG信号の回復を困難な条件下で達成し、ビデオとSTMap表現の合同モデリングの有効性を検証した。コードはリリースされます。

論文の概要: PhysNeXt: Next-Generation Dual-Branch Structured Attention Fusion Network for Remote Photoplethysmography Measurement

関連論文リスト