Fugu-MT 論文翻訳(概要): Hoi3DGen: Generating High-Quality Human-Object-Interactions in 3D

論文の概要: Hoi3DGen: Generating High-Quality Human-Object-Interactions in 3D

arxiv url: http://arxiv.org/abs/2603.12126v1
Date: Thu, 12 Mar 2026 16:27:35 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-13 14:46:26.214488
Title: Hoi3DGen: Generating High-Quality Human-Object-Interactions in 3D
Title（参考訳）: Hoi3DGen: 高品質なヒューマンオブジェクトインタラクションを3Dで生成する
Authors: Agniv Sharma, Xianghui Xie, Tom Fischer, Eddy Ilg, Gerard Pons-Moll,
Abstract要約: Hoi3DGenは、入力インタラクション記述を正確に追従する、人間とオブジェクトのインタラクションの高品質なテクスチャメッシュを生成するフレームワークである。本手法は,テキストの一貫性が4～15倍,3次元モデル品質が3～7倍に向上する。
参考スコア（独自算出の注目度）: 29.37815662492805
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Modeling and generating 3D human-object interactions from text is crucial for applications in AR, XR, and gaming. Existing approaches often rely on score distillation from text-to-image models, but their results suffer from the Janus problem and do not follow text prompts faithfully due to the scarcity of high-quality interaction data. We introduce Hoi3DGen, a framework that generates high-quality textured meshes of human-object interaction that follow the input interaction descriptions precisely. We first curate realistic and high-quality interaction data leveraging multimodal large language models, and then create a full text-to-3D pipeline, which achieves orders-of-magnitude improvements in interaction fidelity. Our method surpasses baselines by 4-15x in text consistency and 3-7x in 3D model quality, exhibiting strong generalization to diverse categories and interaction types, while maintaining high-quality 3D generation.
Abstract（参考訳）: テキストからの3Dヒューマンオブジェクトインタラクションのモデリングと生成は、AR、XR、ゲームにおけるアプリケーションにとって不可欠である。既存のアプローチは、しばしばテキスト・ツー・イメージのモデルからのスコアの蒸留に頼っているが、その結果はヤヌスの問題に悩まされ、高品質な相互作用データが不足しているため、テキストのプロンプトに忠実に従わない。入力インタラクション記述を正確に追従する,人間-オブジェクトインタラクションの高品質なテクスチャメッシュを生成するフレームワークであるHoi3DGenを紹介する。まず,マルチモーダルな大言語モデルを利用して,現実的かつ高品質な対話データをキュレートし,その上で,対話の忠実さのオーダー・オブ・マグニチュード向上を実現する,完全なテキスト・ツー・3Dパイプラインを作成する。提案手法は,テキストの一貫性が4～15倍,3次元モデル品質が3～7倍に向上し,高品質な3次元生成を維持しつつ,多様なカテゴリやインタラクションタイプに強い一般化を示す。

論文の概要: Hoi3DGen: Generating High-Quality Human-Object-Interactions in 3D

関連論文リスト