Fugu-MT 論文翻訳(概要): Graphic-Design-Bench: A Comprehensive Benchmark for Evaluating AI on Graphic Design Tasks

論文の概要: Graphic-Design-Bench: A Comprehensive Benchmark for Evaluating AI on Graphic Design Tasks

arxiv url: http://arxiv.org/abs/2604.04192v1
Date: Sun, 05 Apr 2026 17:20:08 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-07 15:49:18.971648
Title: Graphic-Design-Bench: A Comprehensive Benchmark for Evaluating AI on Graphic Design Tasks
Title（参考訳）: Graphic-Design-Bench: グラフィックデザインタスクにおけるAI評価のための総合ベンチマーク
Authors: Adrienne Deganutti, Elad Hirsch, Haonan Zhu, Jaejung Seol, Purvanshi Mehta,
Abstract要約: GraphicDesignBench(GDB)は、プロフェッショナルなグラフィックデザインタスクの全範囲でAIモデルを評価するために設計された、初めての包括的なベンチマークスイートである。このスイートは、レイアウト、タイポグラフィー、インフォグラフィック、テンプレートとデザインのセマンティクス、アニメーションの5つの軸に沿って構成された50のタスクで構成されている。本研究では,空間的精度,知覚的品質,テキストの忠実度,セマンティックアライメント,構造的妥当性を網羅した標準化された計量分類法を用いて,フロンティアクローズソースモデルの集合を評価する。
参考スコア（独自算出の注目度）: 7.841779848822317
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We introduce GraphicDesignBench (GDB), the first comprehensive benchmark suite designed specifically to evaluate AI models on the full breadth of professional graphic design tasks. Unlike existing benchmarks that focus on natural-image understanding or generic text-to-image synthesis, GDB targets the unique challenges of professional design work: translating communicative intent into structured layouts, rendering typographically faithful text, manipulating layered compositions, producing valid vector graphics, and reasoning about animation. The suite comprises 50 tasks organized along five axes: layout, typography, infographics, template & design semantics and animation, each evaluated under both understanding and generation settings, and grounded in real-world design templates drawn from the LICA layered-composition dataset. We evaluate a set of frontier closed-source models using a standardized metric taxonomy covering spatial accuracy, perceptual quality, text fidelity, semantic alignment, and structural validity. Our results reveal that current models fall short on the core challenges of professional design: spatial reasoning over complex layouts, faithful vector code generation, fine-grained typographic perception, and temporal decomposition of animations remain largely unsolved. While high-level semantic understanding is within reach, the gap widens sharply as tasks demand precision, structure, and compositional awareness. GDB provides a rigorous, reproducible testbed for tracking progress toward AI systems that can function as capable design collaborators. The full evaluation framework is publicly available.
Abstract（参考訳）: GraphicDesignBench(GDB)は、プロフェッショナルなグラフィックデザインタスクのフル範囲でAIモデルを評価するために設計された、最初の包括的なベンチマークスイートである。自然なイメージ理解や汎用的なテキスト・ツー・イメージ合成に焦点を当てた既存のベンチマークとは異なり、GDBはプロの設計作業におけるユニークな課題をターゲットにしている。このスイートは、レイアウト、タイポグラフィ、インフォグラフィック、テンプレートとデザインのセマンティクスとアニメーションの5つの軸に沿って編成された50のタスクで構成され、それぞれが理解と生成の両方で評価され、LICA階層化データセットから引き出された実世界のデザインテンプレートに基礎を置いている。我々は,空間的正確性,知覚的品質,テキストの忠実度,意味的アライメント,構造的妥当性を網羅した標準化された計量分類法を用いて,フロンティアクローズソースモデルの集合を評価する。複雑なレイアウトに対する空間的推論,忠実なベクトルコード生成,微粒なタイポグラフィ知覚,アニメーションの時間的分解などは未解決のままである。高レベルの意味理解が到達範囲内にある一方で、タスクが精度、構造、構成的認識を要求するにつれて、ギャップは急速に広がる。 GDBは、AIシステムに向けた進捗を追跡するための厳格で再現可能なテストベッドを提供する。完全な評価フレームワークが公開されている。

論文の概要: Graphic-Design-Bench: A Comprehensive Benchmark for Evaluating AI on Graphic Design Tasks

関連論文リスト