ColorConceptBench: A Benchmark for Probabilistic Color-Concept Understanding in Text-to-Image Models
- URL: http://arxiv.org/abs/2601.16836v2
- Date: Wed, 28 Jan 2026 05:15:48 GMT
- Title: ColorConceptBench: A Benchmark for Probabilistic Color-Concept Understanding in Text-to-Image Models
- Authors: Chenxi Ruan, Yu Xiao, Yihan Hou, Guosheng Hu, Wei Zeng,
- Abstract summary: We introduce ColorConceptBench, a new human-annotated benchmark to evaluate color-concept associations.<n>Our evaluation of seven leading text-to-image (T2I) models reveals that current models lack sensitivity to abstract semantics.<n>This demonstrates that achieving human-like color semantics requires more than larger models.
- Score: 20.130253460357547
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: While text-to-image (T2I) models have advanced considerably, their capability to associate colors with implicit concepts remains underexplored. To address the gap, we introduce ColorConceptBench, a new human-annotated benchmark to systematically evaluate color-concept associations through the lens of probabilistic color distributions. ColorConceptBench moves beyond explicit color names or codes by probing how models translate 1,281 implicit color concepts using a foundation of 6,369 human annotations. Our evaluation of seven leading T2I models reveals that current models lack sensitivity to abstract semantics, and crucially, this limitation appears resistant to standard interventions (e.g., scaling and guidance). This demonstrates that achieving human-like color semantics requires more than larger models, but demands a fundamental shift in how models learn and represent implicit meaning.
Related papers
- GenColorBench: A Color Evaluation Benchmark for Text-to-Image Generation Models [61.786094845872576]
We propose GenColorBench, the first comprehensive benchmark for text-to-image color generation.<n>It is grounded in color systems like I SCC-NBS and CSS3/X11, including numerical colors which are absent elsewhere.<n>With 44K color-focused prompts covering 400+ colors, it reveals models' true capabilities via perceptual and automated assessments.
arXiv Detail & Related papers (2025-10-23T14:12:55Z) - Color Names in Vision-Language Models [48.847573209643265]
We present the first systematic evaluation of color naming capabilities across vision-language models (VLMs)<n>Our results show that while VLMs achieve high accuracy on colors from classical studies, performance drops significantly on expanded, non-prototypical color sets.<n>We identify 21 common color terms that consistently emerge across all models, revealing two distinct approaches.
arXiv Detail & Related papers (2025-09-26T16:04:18Z) - COLIBRI Fuzzy Model: Color Linguistic-Based Representation and Interpretation [0.0]
This paper introduces the Human Perception-Based Fuzzy Color Model, COLIBRI, to bridge the gap between computational color representations and human visual perception.<n>The proposed model uses fuzzy sets and logic to create a framework for color categorization.<n>Our findings are significant for fields such as design, artificial intelligence, marketing, and human-computer interaction.
arXiv Detail & Related papers (2025-07-15T17:01:45Z) - Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models [53.73253164099701]
We introduce ColorWave, a training-free approach that achieves exact RGB-level color control in diffusion models without fine-tuning.<n>We demonstrate that ColorWave establishes a new paradigm for structured, color-consistent diffusion-based image synthesis.
arXiv Detail & Related papers (2025-03-12T21:49:52Z) - ColorFlow: Retrieval-Augmented Image Sequence Colorization [65.93834649502898]
We propose a three-stage diffusion-based framework tailored for image sequence colorization in industrial applications.<n>Unlike existing methods that require per-ID finetuning or explicit ID embedding extraction, we propose a novel Retrieval Augmented Colorization pipeline.<n>Our pipeline also features a dual-branch design: one branch for color identity extraction and the other for colorization.
arXiv Detail & Related papers (2024-12-16T14:32:49Z) - Improved Diffusion-based Image Colorization via Piggybacked Models [19.807766482434563]
We introduce a colorization model piggybacking on the existing powerful T2I diffusion model.
A diffusion guider is designed to incorporate the pre-trained weights of the latent diffusion model.
A lightness-aware VQVAE will then generate the colorized result with pixel-perfect alignment to the given grayscale image.
arXiv Detail & Related papers (2023-04-21T16:23:24Z) - PalGAN: Image Colorization with Palette Generative Adversarial Networks [51.59276436217957]
We propose a new GAN-based colorization approach PalGAN, integrated with palette estimation and chromatic attention.
PalGAN outperforms state-of-the-arts in quantitative evaluation and visual comparison, delivering notable diverse, contrastive, and edge-preserving appearances.
arXiv Detail & Related papers (2022-10-20T12:28:31Z) - Semantic-driven Colorization [78.88814849391352]
Recent colorization works implicitly predict the semantic information while learning to colorize black-and-white images.
In this study, we simulate that human-like action to let our network first learn to understand the photo, then colorize it.
arXiv Detail & Related papers (2020-06-13T08:13:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.