Fugu-MT 論文翻訳(概要): Geometry-Aware Losses for Structure-Preserving Text-to-Sign Language Generation

論文の概要: Geometry-Aware Losses for Structure-Preserving Text-to-Sign Language Generation

arxiv url: http://arxiv.org/abs/2509.23011v1
Date: Sat, 27 Sep 2025 00:06:17 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 22:32:18.98295
Title: Geometry-Aware Losses for Structure-Preserving Text-to-Sign Language Generation
Title（参考訳）: 構造保存型テキスト-手話生成のための幾何学的認識損失
Authors: Zetian Wu, Tianshuo Zhou, Stefan Lee, Liang Huang,
Abstract要約: 手話翻訳は、難聴者や難聴者に対して効果的なコミュニケーションを可能にする上で重要な役割を担っている。以前の手法では、人間の骨格運動の解剖学的制約や調整パターンを無視することが多かった。骨格関節間の関係を明示的にモデル化する新しいアプローチを提案する。
参考スコア（独自算出の注目度）: 14.94145299705437
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Sign language translation from text to video plays a crucial role in enabling effective communication for Deaf and hard--of--hearing individuals. A major challenge lies in generating accurate and natural body poses and movements that faithfully convey intended meanings. Prior methods often neglect the anatomical constraints and coordination patterns of human skeletal motion, resulting in rigid or biomechanically implausible outputs. To address this, we propose a novel approach that explicitly models the relationships among skeletal joints--including shoulders, arms, and hands--by incorporating geometric constraints on joint positions, bone lengths, and movement dynamics. During training, we introduce a parent-relative reweighting mechanism to enhance finger flexibility and reduce motion stiffness. Additionally, bone-pose losses and bone-length constraints enforce anatomically consistent structures. Our method narrows the performance gap between the previous best and the ground-truth oracle by 56.51%, and further reduces discrepancies in bone length and movement variance by 18.76% and 5.48%, respectively, demonstrating significant gains in anatomical realism and motion naturalness.
Abstract（参考訳）: テキストからビデオへの手話翻訳は、難聴者や難聴者に対する効果的なコミュニケーションを可能にする上で重要な役割を担っている。大きな課題は、意図した意味を忠実に伝える正確で自然なポーズや動きを生み出すことである。以前の手法は、人間の骨格運動の解剖学的制約や調整パターンを無視し、硬直的または生体力学的に不可解な出力をもたらす。そこで本稿では, 肩, 腕, 手など骨格関節間の関係を, 関節位置, 骨長, 運動動態に幾何的制約を加えて, 明確にモデル化する手法を提案する。トレーニング中,指の柔軟性を高め,運動剛性を低下させる親相対リウェイト機構を導入する。さらに、骨配置の損失と骨長の制約は解剖学的に一貫した構造を強制する。提案手法は,前回のベストとグラウンド・トゥルース・オーラクルのパフォーマンスギャップを56.51%削減し,骨長と移動の差異を18.76%,5.48%削減し,解剖学的リアリズムと運動自然性において有意な優位性を示した。

論文の概要: Geometry-Aware Losses for Structure-Preserving Text-to-Sign Language Generation

関連論文リスト