Fugu-MT 論文翻訳(概要): Learning to Assist: Physics-Grounded Human-Human Control via Multi-Agent Reinforcement Learning

論文の概要: Learning to Assist: Physics-Grounded Human-Human Control via Multi-Agent Reinforcement Learning

arxiv url: http://arxiv.org/abs/2603.11346v1
Date: Wed, 11 Mar 2026 22:25:44 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-13 14:46:25.671925
Title: Learning to Assist: Physics-Grounded Human-Human Control via Multi-Agent Reinforcement Learning
Title（参考訳）: 支援への学習:多エージェント強化学習による物理を取り巻くヒューマン・ヒューマン制御
Authors: Yuto Shibata, Kashu Yamazaki, Lalit Jayanti, Yoshimitsu Aoki, Mariko Isogawa, Katerina Fragkiadaki,
Abstract要約: 多エージェント強化学習問題として, 密接に相互作用し, 力量変化する人間の動作系列の模倣を定式化する。 AssistMimicは、確立されたベンチマーク上でのアシストインタラクション動作の追跡に成功できる最初の方法であることを示す。
参考スコア（独自算出の注目度）: 29.898955971414154
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Humanoid robotics has strong potential to transform daily service and caregiving applications. Although recent advances in general motion tracking within physics engines (GMT) have enabled virtual characters and humanoid robots to reproduce a broad range of human motions, these behaviors are primarily limited to contact-less social interactions or isolated movements. Assistive scenarios, by contrast, require continuous awareness of a human partner and rapid adaptation to their evolving posture and dynamics. In this paper, we formulate the imitation of closely interacting, force-exchanging human-human motion sequences as a multi-agent reinforcement learning problem. We jointly train partner-aware policies for both the supporter (assistant) agent and the recipient agent in a physics simulator to track assistive motion references. To make this problem tractable, we introduce a partner policies initialization scheme that transfers priors from single-human motion-tracking controllers, greatly improving exploration. We further propose dynamic reference retargeting and contact-promoting reward, which adapt the assistant's reference motion to the recipient's real-time pose and encourage physically meaningful support. We show that AssistMimic is the first method capable of successfully tracking assistive interaction motions on established benchmarks, demonstrating the benefits of a multi-agent RL formulation for physically grounded and socially aware humanoid control.
Abstract（参考訳）: ヒューマノイドロボットは、日々のサービスと介護の応用を変革する強い可能性を秘めている。近年の物理エンジン(GMT)における一般的なモーショントラッキングの進歩により、仮想キャラクターやヒューマノイドロボットは幅広い人間の動きを再現できるようになったが、これらの動作は主に接触のない社会的相互作用や孤立した動きに限られている。対照的に、補助的なシナリオは、人間のパートナーの継続的な認識と、進化する姿勢とダイナミクスへの迅速な適応を必要とします。本稿では,多エージェント強化学習問題として,密接な相互作用と力による人間の動作系列の模倣を定式化する。我々は,支援者(支援者)エージェントと受取人エージェントの両方に対して,補助動作参照を追跡するために協調的にパートナー認識ポリシーを訓練する。この問題を解消するために、単一人のモーショントラッキングコントローラから先行情報を転送するパートナーポリシー初期化方式を導入し、探索を大幅に改善する。さらに,リアルタイムのポーズにアシスタントの参照動作を適応させ,身体的に意味のある支援を促進する動的参照リターゲティングと接触促進報酬を提案する。 AssistMimicは、確立されたベンチマーク上での補助的インタラクション動作のトラッキングに成功し、物理的に座屈し、社会的に認識されたヒューマノイド制御のためのマルチエージェントRL定式化の利点を実証する最初の方法であることを示す。

論文の概要: Learning to Assist: Physics-Grounded Human-Human Control via Multi-Agent Reinforcement Learning

関連論文リスト