Fugu-MT 論文翻訳(概要): Refining Diffusion Models for Motion Synthesis with an Acceleration Loss to Generate Realistic IMU Data

論文の概要: Refining Diffusion Models for Motion Synthesis with an Acceleration Loss to Generate Realistic IMU Data

arxiv url: http://arxiv.org/abs/2512.08859v1
Date: Tue, 09 Dec 2025 17:51:01 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-10 22:28:08.073043
Title: Refining Diffusion Models for Motion Synthesis with an Acceleration Loss to Generate Realistic IMU Data
Title（参考訳）: 実時間IMUデータを生成する加速損失を用いた運動合成のための精製拡散モデル
Authors: Lars Ole Häusler, Lena Uhlenberg, Göran Köber, Diyora Salimova, Oliver Amft,
Abstract要約: 現実的なIMUデータを得るために,テキストからIMU(慣性計測単位)の動き合成フレームワークを提案する。加速に基づく2次損失(L_acc)を伴う事前学習拡散モデル L_accは、生成された動きの離散的な2階時間差に一貫性を強制する。
参考スコア（独自算出の注目度）: 1.291843130404247
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: We propose a text-to-IMU (inertial measurement unit) motion-synthesis framework to obtain realistic IMU data by fine-tuning a pretrained diffusion model with an acceleration-based second-order loss (L_acc). L_acc enforces consistency in the discrete second-order temporal differences of the generated motion, thereby aligning the diffusion prior with IMU-specific acceleration patterns. We integrate L_acc into the training objective of an existing diffusion model, finetune the model to obtain an IMU-specific motion prior, and evaluate the model with an existing text-to-IMU framework that comprises surface modelling and virtual sensor simulation. We analysed acceleration signal fidelity and differences between synthetic motion representation and actual IMU recordings. As a downstream application, we evaluated Human Activity Recognition (HAR) and compared the classification performance using data of our method with the earlier diffusion model and two additional diffusion model baselines. When we augmented the earlier diffusion model objective with L_acc and continued training, L_acc decreased by 12.7% relative to the original model. The improvements were considerably larger in high-dynamic activities (i.e., running, jumping) compared to low-dynamic activities~(i.e., sitting, standing). In a low-dimensional embedding, the synthetic IMU data produced by our refined model shifts closer to the distribution of real IMU recordings. HAR classification trained exclusively on our refined synthetic IMU data improved performance by 8.7% compared to the earlier diffusion model and by 7.6% over the best-performing comparison diffusion model. We conclude that acceleration-aware diffusion refinement provides an effective approach to align motion generation and IMU synthesis and highlights how flexible deep learning pipelines are for specialising generic text-to-motion priors to sensor-specific tasks.
Abstract（参考訳）: 本稿では,加速度に基づく2次損失(L_acc)による事前学習拡散モデルの微調整により,現実的なIMUデータを得るためのテキスト・ツー・IMU(慣性計測単位)モーションシンセシスフレームワークを提案する。 L_accは、生成した動きの離散的な2次時間差の一貫性を強制し、IMU固有の加速度パターンに先行して拡散を調整する。既存の拡散モデルのトレーニング対象にL_accを組み込んで,IMU固有の動きを事前に把握し,表面モデリングと仮想センサシミュレーションを組み合わせた既存のテキスト・ツー・IMUフレームワークを用いてモデルの評価を行う。我々は、加速度信号の忠実度と、合成運動表現と実際のIMU記録の違いを分析した。ダウンストリームアプリケーションとして,HAR(Human Activity Recognition)を評価し,従来の拡散モデルと2つの拡散モデルに基づく分類性能を比較した。 L_acc の初期拡散モデル対象をL_acc に拡張し継続訓練を行ったところ,L_acc は元のモデルと比較して 12.7% 減少していた。これらの改善は、低ダイナミックな活動(つまり、立位、立位)に比べて、高ダイナミックな活動(すなわち、ランニング、ジャンプ)においてかなり大きくなっていた。低次元埋め込みでは、改良されたモデルにより生成された合成IMUデータが実際のIMU記録の分布に近づく。 HAR分類は, 従来の拡散モデルに比べて8.7%, 最高の比較拡散モデルよりも7.6%向上した。我々は,加速度対応拡散改善法が,動作生成とIMU合成の整合性に有効なアプローチであり,センサ固有のタスクに先立って,汎用的なテキスト・ツー・モーションを専門とする深層学習パイプラインがいかに柔軟かを強調した。

関連論文リスト

Optimization Benchmark for Diffusion Models on Dynamical Systems [1.1603243575080533]
本稿では,フロートラジェクトリをデノナイズする拡散モデルをトレーニングするための最近の最適化アルゴリズムをベンチマークする。私たちは、MuonとSOAPがAdamWの非常に効率的な代替品であること(18%の最終損失)を観察します。
論文参考訳（メタデータ） (2025-10-22T08:50:31Z)
Mobility-Aware Asynchronous Federated Learning with Dynamic Sparsification [14.942677904783759]
本稿では,スペーシフィケーション,モデルスタルネス,モビリティに起因した接触パターン間の相互作用を特徴付ける理論的モデルを開発する。本稿では,接触時間とモデル安定化度に基づいてスペーシフィケーション度を最適化する移動性を考慮した動的スペーシフィケーションアルゴリズムを提案する。最先端のベンチマークと比較すると、MADSアルゴリズムはCIFAR-10データセットの画像分類精度を8.76%向上し、Argoverse軌道予測データセットの平均変位誤差を9.46%削減する。
論文参考訳（メタデータ） (2025-06-08T23:58:32Z)
FlowMo: Variance-Based Flow Guidance for Coherent Motion in Video Generation [51.110607281391154]
FlowMoは、テキスト・ビデオ・モデルにおける動きコヒーレンスを高めるためのトレーニング不要のガイダンス手法である。時間次元のパッチワイドな分散を測定して動きのコヒーレンスを推定し、サンプリング中にこの分散を動的に減少させるためにモデルを導く。
論文参考訳（メタデータ） (2025-06-01T19:55:33Z)
Joint Velocity-Growth Flow Matching for Single-Cell Dynamics Modeling [38.9381649903752]
破壊的な測定手法と細胞増殖・死の結果、スナップショット間の不均衡および不均衡なデータが得られる。単細胞個体群における状態遷移と大量成長を共同で学習する新パラダイムであるVelocity-Growth Flow Matchingを提案する。 VGFMは、静的半緩和された最適輸送の2周期の動的理解によって駆動される、状態速度と質量の成長速度を含む理想的な単一セルダイナミクスを構築する。
論文参考訳（メタデータ） (2025-05-19T17:48:04Z)
REWIND: Real-Time Egocentric Whole-Body Motion Diffusion with Exemplar-Based Identity Conditioning [95.07708090428814]
本稿では,一段階拡散モデルREWINDを提案する。身体中心運動と手の動きの相関を効果的にモデル化する。また、ターゲットアイデンティティの小さなポーズ例に基づく新しいアイデンティティ条件付け手法を提案し、動き推定品質をさらに向上させる。
論文参考訳（メタデータ） (2025-04-07T11:44:11Z)
Energy-Based Diffusion Language Models for Text Generation [126.23425882687195]
エネルギーベース拡散言語モデル(Energy-based Diffusion Language Model, EDLM)は、拡散ステップごとに全シーケンスレベルで動作するエネルギーベースモデルである。我々のフレームワークは、既存の拡散モデルよりも1.3$times$のサンプリングスピードアップを提供する。
論文参考訳（メタデータ） (2024-10-28T17:25:56Z)
Bayesian-Optimized One-Step Diffusion Model with Knowledge Distillation for Real-Time 3D Human Motion Prediction [2.402745776249116]
本稿では,知識蒸留とベイズ最適化を用いた1段階多層パーセプトロン(MLP)拡散モデルによる動き予測のトレーニングを提案する。提案モデルでは,予測速度を大幅に向上し,性能の劣化を伴わないリアルタイム予測を実現している。
論文参考訳（メタデータ） (2024-09-19T04:36:40Z)
Synthetic location trajectory generation using categorical diffusion models [50.809683239937584]
拡散モデル(DPM)は急速に進化し、合成データのシミュレーションにおける主要な生成モデルの一つとなっている。本稿では,個人が訪れた物理的位置を表す変数列である合成個別位置軌跡(ILT)の生成にDPMを用いることを提案する。
論文参考訳（メタデータ） (2024-02-19T15:57:39Z)
Generative Modeling with Phase Stochastic Bridges [49.4474628881673]
拡散モデル(DM)は、連続入力のための最先端の生成モデルを表す。我々はtextbfphase space dynamics に基づく新しい生成モデリングフレームワークを提案する。我々のフレームワークは、動的伝播の初期段階において、現実的なデータポイントを生成する能力を示す。
論文参考訳（メタデータ） (2023-10-11T18:38:28Z)
How Much is Enough? A Study on Diffusion Times in Score-based Generative Models [76.76860707897413]
現在のベストプラクティスは、フォワードダイナミクスが既知の単純なノイズ分布に十分に近づくことを確実にするために大きなTを提唱している。本稿では, 理想とシミュレーションされたフォワードダイナミクスのギャップを埋めるために補助モデルを用いて, 標準的な逆拡散過程を導出する方法について述べる。
論文参考訳（メタデータ） (2022-06-10T15:09:46Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。