Fugu-MT 論文翻訳(概要): ParTY: Part-Guidance for Expressive Text-to-Motion Synthesis

論文の概要: ParTY: Part-Guidance for Expressive Text-to-Motion Synthesis

arxiv url: http://arxiv.org/abs/2603.09611v1
Date: Tue, 10 Mar 2026 12:53:12 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-11 15:25:24.316676
Title: ParTY: Part-Guidance for Expressive Text-to-Motion Synthesis
Title（参考訳）: ParTY: 表現型テキスト間合成のためのパートガイド
Authors: KunHo Heo, SuYeon Kim, Yonghyun Gwon, Youngbin Kim, MyeongAh Cho,
Abstract要約: 本稿では,コヒーレントな全身運動を発生させながら部分表現性を高める新しいフレームワークであるParTYを提案する。 ParTY は,(1) 部分動作を生成する部分ガイドネットワーク,(2) テキスト埋め込みを多様に変換し,各本体に適切にアライメントする部分認識テキストグラウンド,(3) 全体動作と部分動作を適応的に融合する全体的部分フュージョンからなる。
参考スコア（独自算出の注目度）: 16.628208335930857
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Text-to-motion synthesis aims to generate natural and expressive human motions from textual descriptions. While existing approaches primarily focus on generating holistic motions from text descriptions, they struggle to accurately reflect actions involving specific body parts. Recent part-wise motion generation methods attempt to resolve this but face two critical limitations: (i) they lack explicit mechanisms for aligning textual semantics with individual body parts, and (ii) they often generate incoherent full-body motions due to integrating independently generated part motions. To overcome these issues and resolve the fundamental trade-off in existing methods, we propose ParTY, a novel framework that enhances part expressiveness while generating coherent full-body motions. ParTY comprises: (1) Part-Guided Network, which first generates part motions to obtain part guidance, then uses it to generate holistic motions; (2) Part-aware Text Grounding, which diversely transforms text embeddings and appropriately aligns them with each body part; and (3) Holistic-Part Fusion, which adaptively fuses holistic motions and part motions. Extensive experiments, including part-level and coherence-level evaluations, demonstrate that ParTY achieves substantial improvements over previous methods.
Abstract（参考訳）: テキスト・トゥ・モーション合成は、テキスト記述から自然で表現力のある人間の動作を生成することを目的としている。既存のアプローチは、主にテキスト記述から全体像を生成することに重点を置いているが、それらは特定の身体部分に関わる動作を正確に反映するのに苦労している。最近のパートワイズ・モーション・ジェネレーション手法は、この問題を解決しようとするが、2つの限界に直面している。一個々の身体部分とテキスト意味論を整合させる明確な機構が欠如していること。 (II)独立に生成された部分運動を統合することにより、不整合体運動を生じることが多い。これらの問題を克服し、既存の手法の基本的なトレードオフを解決するために、コヒーレントな全体運動を生成しながら部分表現性を高める新しいフレームワークであるParTYを提案する。 ParTY は,(1) 部分動作を生成する部分ガイドネットワーク,(2) テキスト埋め込みを多様に変換し,各本体に適切にアライメントする部分認識テキストグラウンド,(3) 全体動作と部分動作を適応的に融合する全体的部分フュージョンからなる。部分レベルとコヒーレンスレベルの評価を含む広範囲な実験は、ParTYが従来の方法よりも大幅に改善されていることを示す。

論文の概要: ParTY: Part-Guidance for Expressive Text-to-Motion Synthesis

関連論文リスト