Fugu-MT 論文翻訳(概要): ServImage: An Image Generation and Editing Benchmark from Real-world Commercial Imaging Services

論文の概要: ServImage: An Image Generation and Editing Benchmark from Real-world Commercial Imaging Services

arxiv url: http://arxiv.org/abs/2604.24023v1
Date: Mon, 27 Apr 2026 04:11:06 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-28 17:12:07.733088
Title: ServImage: An Image Generation and Editing Benchmark from Real-world Commercial Imaging Services
Title（参考訳）: ServImage: リアルタイムの商用画像サービスによる画像生成と編集ベンチマーク
Authors: Fengxian Ji, Jingpu Yang, Zirui Song, Lang Gao, Junhong Liang, Zhenhao Chen, Jinghui Zhang, Xiuying Chen,
Abstract要約: textbfServImageは、商業設計プロジェクトにおいて、モデル出力と経済的価値を明示的に関連付けるベンチマークである。 ServImageは、(i) textbftextitServImageBench: 1.07kの有償商用デザインタスクと2.05kのデザイナ向けデザイナ向けデータセットで構成され、合計で295k以上の画像、製品、デジタルコンテンツ、33kの候補画像、33kの人間のアノテーションで構成されている。 textbftextitServImageModel:このスコアリングシステムの下で、人間アノテーションに基づいて訓練された支払予測モデルを提案する。
参考スコア（独自算出の注目度）: 20.283096576678933
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recent image generation and editing models demonstrate robust adherence to instructions and high visual quality on academic benchmarks. However, their performance on paid, real-world design projects remains uncertain. We introduce \textbf{ServImage}, a benchmark that explicitly correlates model outputs with economic value in commercial design projects. ServImage consists of (i) \textbf{\textit{ServImageBench}}: a dataset of 1.07k paid commercial design tasks and 2.05k designer deliverables totaling over \$295k, covering portrait, product, and digital content, along with 33k candidate images and 33k human annotations. (ii) \textbf{\textit{ServImageScore}}: an integrated scoring system that combines three quality dimensions: baseline requirements fulfilment, visual execution quality, and commercial necessity satisfaction. These three dimensions are designed to characterize the factors that drive human payment decisions and indicate whether an image is commercially acceptable. (iii) \textbf{\textit{ServImageModel}}: under this scoring system, we propose a payment prediction model trained on the human-annotated candidate images, achieving 82.00\% accuracy in predicting human payment decisions and producing calibrated payment probabilities. ServImage provides a comprehensive foundation for assessing the commercial viability of image generation models and offers a scalable resource for future research on economically grounded vision systems \href{https://github.com/FengxianJi/ServImage}{Github.}
Abstract（参考訳）: 最近の画像生成・編集モデルは, 学術ベンチマークにおいて, 指示への頑健な執着と高い視覚的品質を示す。しかし、彼らの有償の現実世界の設計プロジェクトにおけるパフォーマンスは、いまだに不確実である。本稿では, 商業設計プロジェクトにおいて, モデル出力と経済的価値を明示的に相関させるベンチマークである \textbf{ServImage} を紹介する。 ServImage は (i) \textbf{\textit{ServImageBench}}: 1.07kの有償の商用デザインタスクと2.05kのデザイナによる合計295k以上の製品、デジタルコンテンツ、33kの候補画像、33kの人間のアノテーションのデータセット。 (ii) \textbf{\textit{ServImageScore}}: ベースライン要求満足度、視覚的実行品質、商業的必要満足度という3つの品質次元を組み合わせた統合スコアシステム。これらの3次元は、人間の支払い決定を駆動する要因を特徴づけ、画像が商業的に受け入れられているかどうかを示すように設計されている。 (iii) \textbf{\textit{ServImageModel}}:このスコアリングシステムでは、人手による候補画像に基づいて訓練された支払予測モデルを提案し、人手による支払決定の精度82.00\%、キャリブレーションされた支払確率を推定する。 ServImageは、画像生成モデルの商業的生存性を評価するための総合的な基盤を提供し、経済的基盤を持つビジョンシステム \href{https://github.com/FengxianJi/ServImage}{Github に関する将来の研究のためのスケーラブルなリソースを提供する。 ※

論文の概要: ServImage: An Image Generation and Editing Benchmark from Real-world Commercial Imaging Services

関連論文リスト