Fugu-MT 論文翻訳(概要): GLEaN: A Text-to-image Bias Detection Approach for Public Comprehension

論文の概要: GLEaN: A Text-to-image Bias Detection Approach for Public Comprehension

arxiv url: http://arxiv.org/abs/2604.09923v1
Date: Fri, 10 Apr 2026 21:42:29 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-14 20:13:15.750547
Title: GLEaN: A Text-to-image Bias Detection Approach for Public Comprehension
Title（参考訳）: GLEaN: 公開理解のためのテキストから画像へのバイアス検出手法
Authors: Bochu Ding, Brinnae Bent, Augustus Wendell,
Abstract要約: GLEaN(Generative Likeness Evaluation at N-Scale)は、T2Iモデルのバイアスを視覚的に理解できるようにデザインされたポートレートベースの説明可能性パイプラインである。 GLEaN on Stable Diffusion XL across 40 social and occupational identity prompts, produced Composites that reproduceed biases and surface new associations between skin tone and predict emotion。
参考スコア（独自算出の注目度）: 0.17205106391379024
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Text-to-image (T2I) models, and their encoded biases, increasingly shape the visual media the public encounters. While researchers have produced a rich body of work on bias measurement, auditing, and mitigation in T2I systems, those methods largely target technical stakeholders, leaving a gap in public legibility. We introduce GLEaN (Generative Likeness Evaluation at N-Scale), a portrait-based explainability pipeline designed to make T2I model biases visually understandable to a broad audience. GLEaN comprises three stages: automated large-scale image generation from identity prompts, facial landmark-based filtering and spatial alignment, and median-pixel composition that distills a model's central tendency into a single representative portrait. The resulting composites require no statistical background to interpret; a viewer can see, at a glance, who a model 'imagines' when prompted with 'a doctor' versus a 'felon.' We demonstrate GLEaN on Stable Diffusion XL across 40 social and occupational identity prompts, producing composites that reproduce documented biases and surface new associations between skin tone and predicted emotion. We find in a between-subjects user study (N = 291) that GLEaN portraits communicate biases as effectively as conventional data tables, but require significantly less viewing time. Because the method relies solely on generated outputs, it can also be replicated on any black-box and closed-weight systems without access to model internals. GLEaN offers a scalable, model-agnostic approach to bias explainability, purpose-built for public comprehension, and is publicly available at https://github.com/cultureiolab/GLEaN.
Abstract（参考訳）: テキスト・ツー・イメージ(T2I)モデルとその符号化されたバイアスは、公共の遭遇する視覚メディアをますます形作る。研究者は、T2Iシステムにおけるバイアス測定、監査、緩和に関する豊富な研究を生み出しているが、これらの手法は主に技術的ステークホルダーを対象としており、公共の妥当性の差を残している。 GLEaN(Generative Likeness Evaluation at N-Scale)は、T2Iモデルのバイアスを視覚的に理解できるようにデザインされたポートレートベースの説明可能性パイプラインである。 GLEaNは、アイデンティティプロンプトからの大規模な画像の自動生成、顔のランドマークに基づくフィルタリングと空間アライメント、モデルの中心的な傾向を単一の代表像に蒸留する中央値画素構成の3段階からなる。結果として得られる合成物は解釈する統計的背景を必要とせず、ビューアーは一見すると、モデルが「医者」と「フェロン」で刺激されたときに「想像する」のを見ることができる。 GLEaN on Stable Diffusion XL across 40 social and occupational identity prompts, produced Composites that reproduceed biases and surface new associations between skin tone and predict emotion。我々は、GLEaNポートレートが従来のデータテーブルと同じくらい効果的にバイアスを伝達するが、視聴時間が大幅に少なくなることを発見した(N = 291)。この方法は生成した出力にのみ依存するため、モデル内部にアクセスすることなく、ブラックボックスやクローズドウェイトシステムでも複製できる。 GLEaNは、バイアス説明可能性に対するスケーラブルでモデルに依存しないアプローチを提供し、公開理解のために構築され、https://github.com/ Cultureiolab/GLEaNで公開されている。

論文の概要: GLEaN: A Text-to-image Bias Detection Approach for Public Comprehension

関連論文リスト