Fugu-MT 論文翻訳(概要): ETCH-X: Robustify Expressive Body Fitting to Clothed Humans with Composable Datasets

論文の概要: ETCH-X: Robustify Expressive Body Fitting to Clothed Humans with Composable Datasets

arxiv url: http://arxiv.org/abs/2604.08548v1
Date: Thu, 09 Apr 2026 17:59:59 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-10 18:34:06.084916
Title: ETCH-X: Robustify Expressive Body Fitting to Clothed Humans with Composable Datasets
Title（参考訳）: ETCH-X:構成可能なデータセットを持つ人間の衣服にフィットする表現体をロバスト化する
Authors: Xiaoben Li, Jingyi Wu, Zeyu Cai, Yu Siyuan, Boqian Li, Yuliang Xiu,
Abstract要約: コードモデルは8D2と0.8%のデータ、例えば35.8%(MPJPE-All)やVVVVAllでリリースされる。 BEDETCHは、さまざまな入力レベルにわたって堅牢なパフォーマンスとスケーラブルな改善を実現する。コードモデルは8D2と0.8%のデータ、例えば35.8%(MPJPE-All)やVVVVAllでリリースされる。
参考スコア（独自算出の注目度）: 7.74087799534966
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Human body fitting, which aligns parametric body models such as SMPL to raw 3D point clouds of clothed humans, serves as a crucial first step for downstream tasks like animation and texturing. An effective fitting method should be both locally expressive-capturing fine details such as hands and facial features-and globally robust to handle real-world challenges, including clothing dynamics, pose variations, and noisy or partial inputs. Existing approaches typically excel in only one aspect, lacking an all-in-one solution.We upgrade ETCH to ETCH-X, which leverages a tightness-aware fitting paradigm to filter out clothing dynamics ("undress"), extends expressiveness with SMPL-X, and replaces explicit sparse markers (which are highly sensitive to partial data) with implicit dense correspondences ("dense fit") for more robust and fine-grained body fitting. Our disentangled "undress" and "dense fit" modular stages enable separate and scalable training on composable data sources, including diverse simulated garments (CLOTH3D), large-scale full-body motions (AMASS), and fine-grained hand gestures (InterHand2.6M), improving outfit generalization and pose robustness of both bodies and hands. Our approach achieves robust and expressive fitting across diverse clothing, poses, and levels of input completeness, delivering a substantial performance improvement over ETCH on both: 1) seen data, such as 4D-Dress (MPJPE-All, 33.0% ) and CAPE (V2V-Hands, 35.8% ), and 2) unseen data, such as BEDLAM2.0 (MPJPE-All, 80.8% ; V2V-All, 80.5% ). Code and models will be released at https://xiaobenli00.github.io/ETCH-X/.
Abstract（参考訳）: SMPLのようなパラメトリックなボディモデルと、人間の生の3Dポイントの雲を合わせる人体フィッティングは、アニメーションやテクスチャといった下流のタスクにとって、重要な第一歩となる。効果的なフィッティング方法は、手や顔の特徴などの細部を局所的に表現的にキャプチャすると同時に、衣服のダイナミックス、ポーズのバリエーション、ノイズや部分的な入力など、現実世界の課題に対処するためにグローバルに堅牢であるべきである。我々はETCHをETCH-Xにアップグレードし、衣服のダイナミックス("undress")をフィルターするタイトネス・アウェア・フィッティング・パラダイムを活用し、SMPL-Xで表現性を拡張し、より頑丈できめ細かなボディフィッティングのために暗黙の密文("dense fit")で明示的なスパースマーカー(部分データに非常に敏感な)を置き換える。我々の「アンドレス」と「デンスフィット」のモジュラーステージは、多様なシミュレートされた衣服(CLOTH3D)、大規模なフルボディモーション(AMASS)、細粒度のハンドジェスチャー(InterHand2.6M)など、構成可能なデータソースの分離されたスケーラブルなトレーニングを可能にします。提案手法は, 多様な衣服, ポーズ, 入力完全性のレベルに対して, 堅牢かつ表現力のある適合性を実現し, ETCHよりも大幅な性能向上を実現している。 1) 4D-Dress(MPJPE-All, 33.0%)やCAPE(V2V-Hands, 35.8%)などのデータ。 2) BEDLAM2.0 (MPJPE-All, 80.8%, V2V-All, 80.5%) のような目に見えないデータ。コードとモデルはhttps://xiaobenli00.github.io/ETCH-X/でリリースされる。

論文の概要: ETCH-X: Robustify Expressive Body Fitting to Clothed Humans with Composable Datasets

関連論文リスト