Fugu-MT 論文翻訳(概要): DSH-Bench: A Difficulty- and Scenario-Aware Benchmark with Hierarchical Subject Taxonomy for Subject-Driven Text-to-Image Generation

論文の概要: DSH-Bench: A Difficulty- and Scenario-Aware Benchmark with Hierarchical Subject Taxonomy for Subject-Driven Text-to-Image Generation

arxiv url: http://arxiv.org/abs/2603.08090v1
Date: Mon, 09 Mar 2026 08:30:28 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-10 15:13:15.706454
Title: DSH-Bench: A Difficulty- and Scenario-Aware Benchmark with Hierarchical Subject Taxonomy for Subject-Driven Text-to-Image Generation
Title（参考訳）: DSH-Bench: 主題駆動型テキスト・画像生成のための階層型分類法を用いた難易度・シナリオ認識ベンチマーク
Authors: Zhenyu Hu, Qing Wang, Te Cao, Luo Liao, Longfei Lu, Liqun Liu, Shuang Li, Hang Chen, Mengge Xue, Yuan Chen, Chao Deng, Peng Shu, Huan Yu, Jie Jiang,
Abstract要約: 対象駆動型T2Iモデルの系統的マルチパースペクティブ分析を可能にする総合ベンチマークであるDSH-Benchを提案する。 DSH-Benchは、19の先行モデルの広範な実験的な評価を通じて、現在のアプローチでこれまで明らかであった制限を明らかにした。
参考スコア（独自算出の注目度）: 38.16770019228023
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Significant progress has been achieved in subject-driven text-to-image (T2I) generation, which aims to synthesize new images depicting target subjects according to user instructions. However, evaluating these models remains a significant challenge. Existing benchmarks exhibit critical limitations: 1) insufficient diversity and comprehensiveness in subject images, 2) inadequate granularity in assessing model performance across different subject difficulty levels and prompt scenarios, and 3) a profound lack of actionable insights and diagnostic guidance for subsequent model refinement. To address these limitations, we propose DSH-Bench, a comprehensive benchmark that enables systematic multi-perspective analysis of subject-driven T2I models through four principal innovations: 1) a hierarchical taxonomy sampling mechanism ensuring comprehensive subject representation across 58 fine-grained categories, 2) an innovative classification scheme categorizing both subject difficulty level and prompt scenario for granular capability assessment, 3) a novel Subject Identity Consistency Score (SICS) metric demonstrating a 9.4\% higher correlation with human evaluation compared to existing measures in quantifying subject preservation, and 4) a comprehensive set of diagnostic insights derived from the benchmark, offering critical guidance for optimizing future model training paradigms and data construction strategies. Through an extensive empirical evaluation of 19 leading models, DSH-Bench uncovers previously obscured limitations in current approaches, establishing concrete directions for future research and development.
Abstract（参考訳）: 対象対象を対象とする画像をユーザ指示に従って合成することを目的とした、主題駆動型テキスト・ツー・イメージ(T2I)生成において、重要な進歩が達成されている。しかし、これらのモデルを評価することは依然として大きな課題である。既存のベンチマークでは、重要な制限が示されています。 1) 被写体画像の多様性と包括性が不十分である。 2【主題の難易度・シナリオの相違によるモデル性能評価の難しさ】 3) その後のモデル改良のための実用的な洞察と診断ガイダンスの欠如。これらの制約に対処するために、DSH-Benchを提案する。DSH-Benchは、対象駆動型T2Iモデルの系統的マルチパースペクティブ分析を可能にする包括的なベンチマークである。 1)58の細粒度カテゴリにわたる包括的対象表現を保証する階層型分類抽出機構。 2)難易度と難易度の両方を分類する革新的な分類手法。 3 主観的整合度スコア(SICS)尺度は、主観的保存の定量化における既存の指標と比較して、人的評価と9.4 %高い相関を示す。 4) 将来のモデルトレーニングパラダイムとデータ構築戦略を最適化するための重要なガイダンスを提供する。 DSH-Benchは、19種類の主要なモデルに対する広範な実証的な評価を通じて、現在のアプローチにおける未解明の限界を明らかにし、将来の研究開発のための具体的な方向性を確立した。

論文の概要: DSH-Bench: A Difficulty- and Scenario-Aware Benchmark with Hierarchical Subject Taxonomy for Subject-Driven Text-to-Image Generation

関連論文リスト