Fugu-MT 論文翻訳(概要): 3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models

論文の概要: 3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models

arxiv url: http://arxiv.org/abs/2503.21745v1
Date: Thu, 27 Mar 2025 17:53:00 GMT
ステータス: 翻訳完了
システム内更新日: 2025-03-28 18:49:11.63935
Title: 3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models
Title（参考訳）: 3DGen-Bench: 3D生成モデルのための総合ベンチマークスイート
Authors: Yuhan Zhang, Mengchen Zhang, Tong Wu, Tengfei Wang, Gordon Wetzstein, Dahua Lin, Ziwei Liu,
Abstract要約: 3D世代は急速に進歩しているが、3D評価の開発はペースを保っていない。大規模人選好データセット3DGen-Benchを開発した。次に、CLIPベースのスコアモデルである3DGen-ScoreとMLLMベースの自動評価器である3DGen-Evalを訓練する。
参考スコア（独自算出の注目度）: 94.48803082248872
License: http://creativecommons.org/licenses/by/4.0/
Abstract: 3D generation is experiencing rapid advancements, while the development of 3D evaluation has not kept pace. How to keep automatic evaluation equitably aligned with human perception has become a well-recognized challenge. Recent advances in the field of language and image generation have explored human preferences and showcased respectable fitting ability. However, the 3D domain still lacks such a comprehensive preference dataset over generative models. To mitigate this absence, we develop 3DGen-Arena, an integrated platform in a battle manner. Then, we carefully design diverse text and image prompts and leverage the arena platform to gather human preferences from both public users and expert annotators, resulting in a large-scale multi-dimension human preference dataset 3DGen-Bench. Using this dataset, we further train a CLIP-based scoring model, 3DGen-Score, and a MLLM-based automatic evaluator, 3DGen-Eval. These two models innovatively unify the quality evaluation of text-to-3D and image-to-3D generation, and jointly form our automated evaluation system with their respective strengths. Extensive experiments demonstrate the efficacy of our scoring model in predicting human preferences, exhibiting a superior correlation with human ranks compared to existing metrics. We believe that our 3DGen-Bench dataset and automated evaluation system will foster a more equitable evaluation in the field of 3D generation, further promoting the development of 3D generative models and their downstream applications.
Abstract（参考訳）: 3D世代は急速に進歩しているが、3D評価の開発はペースを保っていない。人間の知覚と等しく一致した自動評価の維持は、よく認識される課題となっている。言語と画像生成の分野での最近の進歩は人間の嗜好を探求し、優れた適合能力を示してきた。しかし、3Dドメインには生成モデルよりも包括的な嗜好データセットがない。この不在を緩和するため,戦闘方式の統合プラットフォームである3DGen-Arenaを開発した。そして、多種多様なテキストや画像のプロンプトを慎重に設計し、アリーナプラットフォームを活用して、公開ユーザと専門家アノテータの両方から人間の嗜好を収集し、3DGen-Benchを大規模に構築する。このデータセットを用いて、CLIPベースのスコアモデルである3DGen-ScoreとMLLMベースの自動評価器である3DGen-Evalをさらに訓練する。これら2つのモデルは,テキスト・ツー・3Dと画像・ツー・3D生成の品質評価を革新的に統一し,それぞれの強度で自動評価システムを構築する。大規模な実験では,評価モデルが人間の嗜好を予測する上で有効であることを示し,既存の指標と比較して人格との相関が優れていることを示した。我々は、3DGen-Benchデータセットと自動評価システムにより、3D生成分野におけるより公平な評価が促進され、3D生成モデルとその下流アプリケーションの開発が促進されると考えている。

論文の概要: 3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models

関連論文リスト