Fugu-MT 論文翻訳(概要): DebFilter: Eradicating Biases Stashed in Value

論文の概要: DebFilter: Eradicating Biases Stashed in Value

arxiv url: http://arxiv.org/abs/2605.28167v1
Date: Wed, 27 May 2026 08:49:41 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-28 17:38:55.908607
Title: DebFilter: Eradicating Biases Stashed in Value
Title（参考訳）: DebFilter: 価値の高いバイアスの排除
Authors: Seung Hyuk Lee, Songkuk Kim,
Abstract要約: テキスト・ツー・イメージ・モデルにおける社会的・意味的バイアスを軽減するために,DebFilterを提案する。我々は,クロスアテンション内の値成分を調整するバイアス補正戦略を適用した。本手法は,生成した画像の社会的バイアスを効果的に再構成することを示した。
参考スコア（独自算出の注目度）: 4.060731229044571
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Text-to-image diffusion models, which are theoretically equivalent to score-based generative models, generate images through a multi-step denoising process guided by text embeddings extracted from pretrained vision-language models such as CLIP. However, these text embeddings inherently encode social and semantic biases -- such as those related to gender and age -- that are subsequently propagated and amplified through the guidance mechanism, along with the model's training on large-scale datasets that are imbalanced with respect to these bias-related concepts, often leading to skewed outputs in text-to-image generation. We propose DebFilter, a lightweight and training-free framework for mitigating such biases in text-to-image diffusion models. Observing that the model's error prediction at each denoising step is primarily influenced by cross-attention dynamics, we introduce a bias-correction strategy that adjusts the value components within cross-attention. Specifically, we apply a fixed offset to the slice of guidance embedding, effectively steering the semantic direction of cross-attention values toward unbiased representations. This adjustment reconfigures the score landscape to produce balanced outputs while maintaining alignment with the intended text semantics. Unlike prior approaches that rely on fine-tuning or retraining, DebFilter operates entirely at inference time, requiring no additional data or model updates. Our results demonstrate that this method effectively mitigates social biases in generated images, offering an efficient and scalable pathway toward fairer and more inclusive text-to-image generation.
Abstract（参考訳）: テキスト・ツー・イメージ拡散モデルは、理論上はスコアベース生成モデルと等価であり、CLIPのような事前学習された視覚言語モデルから抽出されたテキスト埋め込みによって導かれる多段階のデノナイズプロセスを通して画像を生成する。しかしながら、これらのテキスト埋め込みは、本質的に社会的および意味的なバイアス(性別や年齢など)を符号化し、その後、ガイダンスメカニズムを通じて伝播および増幅し、モデルがこれらのバイアスに関する概念に関して不均衡な大規模なデータセットをトレーニングし、しばしばテキスト・ツー・イメージ生成において歪んだ出力をもたらす。 DebFilterはテキストから画像への拡散モデルにおいて,そのようなバイアスを緩和するための軽量でトレーニング不要なフレームワークである。各段階でのモデルの誤差予測は、主にクロスアテンションダイナミクスの影響を受けており、クロスアテンション内の値成分を調整するバイアス補正戦略を導入する。具体的には、固定オフセットをガイダンス埋め込みのスライスに適用し、非バイアス表現に対する横断的意図値の意味的な方向を効果的に操る。この調整はスコアランドスケープを再構成し、意図したテキストセマンティクスとの整合を維持しながらバランスの取れた出力を生成する。微調整や再トレーニングに依存する従来のアプローチとは異なり、DebFilterは推論時に完全に動作し、追加のデータやモデル更新を必要としない。本手法は,画像生成における社会的バイアスを効果的に軽減し,より公平で包括的なテキスト・ツー・イメージ生成への効率よくスケーラブルな経路を提供することを示す。

論文の概要: DebFilter: Eradicating Biases Stashed in Value

関連論文リスト