Fugu-MT 論文翻訳(概要): Paving the Way Towards Kinematic Assessment Using Monocular Video: A Preclinical Benchmark of State-of-the-Art Deep-Learning-Based 3D Human Pose Estimators Against Inertial Sensors in Daily Living Activities

論文の概要: Paving the Way Towards Kinematic Assessment Using Monocular Video: A Preclinical Benchmark of State-of-the-Art Deep-Learning-Based 3D Human Pose Estimators Against Inertial Sensors in Daily Living Activities

arxiv url: http://arxiv.org/abs/2510.02264v1
Date: Thu, 02 Oct 2025 17:44:31 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-03 16:59:21.264295
Title: Paving the Way Towards Kinematic Assessment Using Monocular Video: A Preclinical Benchmark of State-of-the-Art Deep-Learning-Based 3D Human Pose Estimators Against Inertial Sensors in Daily Living Activities
Title（参考訳）: 単眼ビデオによる体力評価への道を開く: 日常生活における慣性センサーに対する3次元深層学習型人体姿勢推定器の予備的ベンチマーク
Authors: Mario Medrano-Paredes, Carmen Fernández-González, Francisco-Javier Díaz-Pernas, Hichem Saoudi, Javier González-Alonso, Mario Martínez-Zarzuela,
Abstract要約: 本研究は、慣性計測ユニット(IMU)を用いた単眼映像に基づく3次元ポーズ推定モデルの比較である。 IMUデータから計算した関節角度に対して,最先端のディープラーニングフレームワークから得られる関節角度を評価した。 MotionAGFormerは優れた性能を示し、RMSE全体の最低値を達成した。
参考スコア（独自算出の注目度）: 1.3854111346209868
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Advances in machine learning and wearable sensors offer new opportunities for capturing and analyzing human movement outside specialized laboratories. Accurate assessment of human movement under real-world conditions is essential for telemedicine, sports science, and rehabilitation. This preclinical benchmark compares monocular video-based 3D human pose estimation models with inertial measurement units (IMUs), leveraging the VIDIMU dataset containing a total of 13 clinically relevant daily activities which were captured using both commodity video cameras and five IMUs. During this initial study only healthy subjects were recorded, so results cannot be generalized to pathological cohorts. Joint angles derived from state-of-the-art deep learning frameworks (MotionAGFormer, MotionBERT, MMPose 2D-to-3D pose lifting, and NVIDIA BodyTrack) were evaluated against joint angles computed from IMU data using OpenSim inverse kinematics following the Human3.6M dataset format with 17 keypoints. Among them, MotionAGFormer demonstrated superior performance, achieving the lowest overall RMSE ($9.27\deg \pm 4.80\deg$) and MAE ($7.86\deg \pm 4.18\deg$), as well as the highest Pearson correlation ($0.86 \pm 0.15$) and the highest coefficient of determination $R^{2}$ ($0.67 \pm 0.28$). The results reveal that both technologies are viable for out-of-the-lab kinematic assessment. However, they also highlight key trade-offs between video- and sensor-based approaches including costs, accessibility, and precision. This study clarifies where off-the-shelf video models already provide clinically promising kinematics in healthy adults and where they lag behind IMU-based estimates while establishing valuable guidelines for researchers and clinicians seeking to develop robust, cost-effective, and user-friendly solutions for telehealth and remote patient monitoring.
Abstract（参考訳）: 機械学習とウェアラブルセンサーの進歩は、特殊な研究室の外での人間の動きを捉え分析する新たな機会を提供する。遠隔医療、スポーツ科学、リハビリテーションには、現実の環境下での人間の運動の正確な評価が不可欠である。この前臨床ベンチマークは、モノクロビデオに基づく3次元ポーズ推定モデルと慣性測定ユニット(IMU)を比較し、コモディティビデオカメラと5つのIMUを用いて、合計13の臨床的な日常活動を含むVIDIMUデータセットを活用する。最初の研究では健康な被験者のみが記録され、その結果は病理コホートに一般化できない。最新のディープラーニングフレームワーク(MotionAGFormer, MotionBERT, MMPose 2D-to-3D pose lifting, NVIDIA BodyTrack)から得られた関節角度を、17個のキーポイントを持つHuman3.6MデータセットフォーマットのOpenSim逆運動学を用いてIMUデータから計算した関節角度に対して評価した。その中でMotionAGFormerは、RMSE (9.27\deg \pm 4.80\deg$) と MAE (7.86\deg \pm 4.18\deg$) と、ピアソン相関 (0.86 \pm 0.15$) と決定係数 (0.67 \pm 0.28$) の最高値である。その結果,両技術は既定のキネマティック・アセスメントに有効であることが判明した。しかし、コスト、アクセシビリティ、精度など、ビデオとセンサーベースのアプローチの主なトレードオフを強調している。本研究は、健康な成人に既に有望なキネマティクスを提供するオフ・ザ・シェルフビデオモデルが、IMUに基づく見積もりに遅れをきたし、遠隔医療や遠隔患者監視のための堅牢で費用対効果の高いソリューションを開発しようとする研究者や臨床医にとって貴重なガイドラインを定めていることを明らかにした。

論文の概要: Paving the Way Towards Kinematic Assessment Using Monocular Video: A Preclinical Benchmark of State-of-the-Art Deep-Learning-Based 3D Human Pose Estimators Against Inertial Sensors in Daily Living Activities

関連論文リスト