Fugu-MT 論文翻訳(概要): MDD: A Dataset for Text-and-Music Conditioned Duet Dance Generation

論文の概要: MDD: A Dataset for Text-and-Music Conditioned Duet Dance Generation

arxiv url: http://arxiv.org/abs/2508.16911v1
Date: Sat, 23 Aug 2025 05:56:37 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-26 18:43:45.248027
Title: MDD: A Dataset for Text-and-Music Conditioned Duet Dance Generation
Title（参考訳）: MDD: テキスト・アンド・ミュージック・コンディショニング・デュエット・ダンス生成のためのデータセット
Authors: Prerit Gupta, Jason Alexander Fotso-Puepi, Zhengyuan Li, Jay Mehta, Aniket Bera,
Abstract要約: テキスト制御および音楽条件付き3Dデュエットダンスモーション生成用に設計された多モードベンチマークデータセットであるMultimodal DuetDance(MDD)を紹介する。我々のデータセットは、プロのダンサーが行う620分間の高品質なモーションキャプチャーデータからなり、音楽と同期し、10K以上のきめ細かい自然言語記述を詳述する。アノテーションは、空間的関係、身体の動き、リズムを詳述し、人間の動き、音楽、そしてデュエットダンス生成のためのテキストをシームレスに統合する最初のデータセットとなる。
参考スコア（独自算出の注目度）: 16.657210427678198
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We introduce Multimodal DuetDance (MDD), a diverse multimodal benchmark dataset designed for text-controlled and music-conditioned 3D duet dance motion generation. Our dataset comprises 620 minutes of high-quality motion capture data performed by professional dancers, synchronized with music, and detailed with over 10K fine-grained natural language descriptions. The annotations capture a rich movement vocabulary, detailing spatial relationships, body movements, and rhythm, making MDD the first dataset to seamlessly integrate human motions, music, and text for duet dance generation. We introduce two novel tasks supported by our dataset: (1) Text-to-Duet, where given music and a textual prompt, both the leader and follower dance motion are generated (2) Text-to-Dance Accompaniment, where given music, textual prompt, and the leader's motion, the follower's motion is generated in a cohesive, text-aligned manner. We include baseline evaluations on both tasks to support future research.
Abstract（参考訳）: テキスト制御および音楽条件付き3Dデュエットダンスモーション生成用に設計された多モードベンチマークデータセットであるMultimodal DuetDance(MDD)を紹介する。我々のデータセットは、プロのダンサーが行う620分間の高品質なモーションキャプチャーデータからなり、音楽と同期し、10K以上のきめ細かい自然言語記述を詳述する。アノテーションは、空間的関係、身体の動き、リズムを詳述し、人間の動き、音楽、そしてデュエットダンス生成のためのテキストをシームレスに統合する最初のデータセットとなる。本研究では,(1) 与えられた音楽とテキスト・プロンプトを生成するテキスト・トゥ・デュエット(Text-to-Duet),(2) 与えられた音楽,テキスト・プロンプト,およびリーダの動作を協調的に生成するテキスト・トゥ・ダンス・モーション(Text-to-Dance Acompaniment)という2つの新しいタスクを紹介した。今後の研究を支援するため,両課題のベースライン評価を含める。

論文の概要: MDD: A Dataset for Text-and-Music Conditioned Duet Dance Generation

関連論文リスト