Fugu-MT 論文翻訳(概要): InfiniHuman: Infinite 3D Human Creation with Precise Control

論文の概要: InfiniHuman: Infinite 3D Human Creation with Precise Control

arxiv url: http://arxiv.org/abs/2510.11650v1
Date: Mon, 13 Oct 2025 17:29:55 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-14 18:06:30.479247
Title: InfiniHuman: Infinite 3D Human Creation with Precise Control
Title（参考訳）: InfiniHuman: 精密制御による無限の3次元人間の創造
Authors: Yuxuan Xue, Xianghui Xie, Margaret Kostyrko, Gerard Pons-Moll,
Abstract要約: InfiniHumanは、既存の基礎モデルを蒸留して、リッチな注釈付き人間データを生成するフレームワークである。 InfiniHumanDataには、前例のない多様性にまたがる111KのIDが含まれている。 InfiniHumanGenは高速でリアルで正確に制御可能なアバター生成を可能にする。
参考スコア（独自算出の注目度）: 25.397126937363012
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Generating realistic and controllable 3D human avatars is a long-standing challenge, particularly when covering broad attribute ranges such as ethnicity, age, clothing styles, and detailed body shapes. Capturing and annotating large-scale human datasets for training generative models is prohibitively expensive and limited in scale and diversity. The central question we address in this paper is: Can existing foundation models be distilled to generate theoretically unbounded, richly annotated 3D human data? We introduce InfiniHuman, a framework that synergistically distills these models to produce richly annotated human data at minimal cost and with theoretically unlimited scalability. We propose InfiniHumanData, a fully automatic pipeline that leverages vision-language and image generation models to create a large-scale multi-modal dataset. User study shows our automatically generated identities are undistinguishable from scan renderings. InfiniHumanData contains 111K identities spanning unprecedented diversity. Each identity is annotated with multi-granularity text descriptions, multi-view RGB images, detailed clothing images, and SMPL body-shape parameters. Building on this dataset, we propose InfiniHumanGen, a diffusion-based generative pipeline conditioned on text, body shape, and clothing assets. InfiniHumanGen enables fast, realistic, and precisely controllable avatar generation. Extensive experiments demonstrate significant improvements over state-of-the-art methods in visual quality, generation speed, and controllability. Our approach enables high-quality avatar generation with fine-grained control at effectively unbounded scale through a practical and affordable solution. We will publicly release the automatic data generation pipeline, the comprehensive InfiniHumanData dataset, and the InfiniHumanGen models at https://yuxuan-xue.com/infini-human.
Abstract（参考訳）: 現実的でコントロール可能な3Dアバターを生成することは、特に民族、年齢、衣服のスタイル、詳細な体型といった幅広い属性範囲をカバーする場合、長年にわたる課題である。生成モデルをトレーニングするための大規模な人的データセットのキャプチャと注釈付けは、違法に高価であり、スケールと多様性に制限がある。既存の基礎モデルを蒸留して理論上非有界でリッチな注釈付き3Dデータを生成することは可能か? InfiniHumanは、これらのモデルを相乗的に蒸留して、高額な注釈付きデータを最小限のコストで、理論上無制限のスケーラビリティで生成するフレームワークである。 InfiniHumanDataは、視覚言語と画像生成モデルを利用して大規模なマルチモーダルデータセットを作成する完全自動パイプラインである。ユーザスタディは、自動生成されたIDがスキャンレンダリングと区別できないことを示している。 InfiniHumanDataには、前例のない多様性にまたがる111KのIDが含まれている。各アイデンティティには、マルチグラニュラリティテキスト記述、マルチビューRGBイメージ、詳細な衣料品画像、SMPLボディ形状パラメータが注釈付けされている。 InfiniHumanGenは,テキスト,体形,衣服の資産を条件とした拡散型生成パイプラインである。 InfiniHumanGenは高速でリアルで正確に制御可能なアバター生成を可能にする。広汎な実験は、視覚的品質、生成速度、制御性において最先端の手法よりも大幅に改善されている。提案手法により,実用的で手頃なソリューションにより,高精細な粒度制御が可能な高品質なアバターを効果的に非有界スケールで実現する。私たちは、自動データ生成パイプライン、包括的なInfiniHumanDataデータセット、InfiniHumanGenモデルをhttps://yuxuan-xue.com/infini-human.comで公開します。

論文の概要: InfiniHuman: Infinite 3D Human Creation with Precise Control

関連論文リスト