Fugu-MT 論文翻訳(概要): InterEdit: Navigating Text-Guided Multi-Human 3D Motion Editing

論文の概要: InterEdit: Navigating Text-Guided Multi-Human 3D Motion Editing

arxiv url: http://arxiv.org/abs/2603.13082v1
Date: Fri, 13 Mar 2026 15:30:51 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-16 17:38:12.157769
Title: InterEdit: Navigating Text-Guided Multi-Human 3D Motion Editing
Title（参考訳）: InterEdit: テキストガイドによるマルチHuman 3Dモーション編集
Authors: Yebin Yang, Di Wen, Lei Qi, Weitong Kong, Junwei Zheng, Ruiping Liu, Yufan Chen, Chengzhi Wu, Kailun Yang, Yuqian Fu, Danda Pani Paudel, Luc Van Gool, Kunyu Peng,
Abstract要約: 本稿では,複数の人物による3Dモーション編集のタスクについて紹介する。これをサポートするために、InterEdit3D、手動2人動作変更アノテーションを備えた新しいデータセット、およびテキスト誘導多人動作編集(TMME)ベンチマークを提案する。 InterEditはテキスト間の一貫性を改善し、忠実さを編集し、最先端のTMMEパフォーマンスを実現する。
参考スコア（独自算出の注目度）: 73.51964472028392
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Text-guided 3D motion editing has seen success in single-person scenarios, but its extension to multi-person settings is less explored due to limited paired data and the complexity of inter-person interactions. We introduce the task of multi-person 3D motion editing, where a target motion is generated from a source and a text instruction. To support this, we propose InterEdit3D, a new dataset with manual two-person motion change annotations, and a Text-guided Multi-human Motion Editing (TMME) benchmark. We present InterEdit, a synchronized classifier-free conditional diffusion model for TMME. It introduces Semantic-Aware Plan Token Alignment with learnable tokens to capture high-level interaction cues and an Interaction-Aware Frequency Token Alignment strategy using DCT and energy pooling to model periodic motion dynamics. Experiments show that InterEdit improves text-to-motion consistency and edit fidelity, achieving state-of-the-art TMME performance. The dataset and code will be released at https://github.com/YNG916/InterEdit.
Abstract（参考訳）: テキスト誘導型3Dモーション編集は、シングルパーソンシナリオで成功したが、ペアデータに制限があることと、対人インタラクションの複雑さにより、マルチパーソン設定への拡張は検討されていない。本稿では,複数の人物による3Dモーション編集のタスクについて紹介する。これをサポートするために、InterEdit3D、手動2人動作変更アノテーションを備えた新しいデータセット、およびテキスト誘導多人動作編集(TMME)ベンチマークを提案する。 TMMEのための同期型分類器自由条件拡散モデルであるInterEditを提案する。セマンティック・アウェア・プラン・トークンアライメント(Semantic-Aware Plan Token Alignment)を導入し、高レベルなインタラクションキューをキャプチャするためのトークンと、DCTとエネルギプールを用いたインタラクション・アウェア・周波数・トークンアライメント戦略を導入し、周期的な動きのダイナミクスをモデル化する。実験により、InterEditはテキスト間の一貫性を改善し、忠実さを編集し、最先端のTMMEパフォーマンスを実現する。データセットとコードはhttps://github.com/YNG916/InterEditで公開される。

論文の概要: InterEdit: Navigating Text-Guided Multi-Human 3D Motion Editing

関連論文リスト