Fugu-MT 論文翻訳(概要): From Preferences to Prejudice: The Role of Alignment Tuning in Shaping Social Bias in Video Diffusion Models

論文の概要: From Preferences to Prejudice: The Role of Alignment Tuning in Shaping Social Bias in Video Diffusion Models

arxiv url: http://arxiv.org/abs/2510.17247v1
Date: Mon, 20 Oct 2025 07:37:43 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-25 00:56:39.351104
Title: From Preferences to Prejudice: The Role of Alignment Tuning in Shaping Social Bias in Video Diffusion Models
Title（参考訳）: 選好から偏見:ビデオ拡散モデルにおける社会的バイアス形成におけるアライメント調整の役割
Authors: Zefan Cai, Haoyi Qiu, Haozhe Zhao, Ke Wan, Jiachen Li, Jiuxiang Gu, Wen Xiao, Nanyun Peng, Junjie Hu,
Abstract要約: 本稿では,ビデオ生成における社会的表現を評価するためのフレームワークであるVideoBiasEvalを紹介する。 VideoBiasEvalでは、アクター属性からセマンティックコンテンツをアンタングルするために、イベントベースのプロンプト戦略を採用している。我々は、人間の嗜好データセットにおけるバイアス、報酬モデルにおける増幅、アライメント調整されたビデオ拡散モデルによる伝播を結合する最初のエンドツーエンド分析を行う。
参考スコア（独自算出の注目度）: 69.4332879415364
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recent advances in video diffusion models have significantly enhanced text-to-video generation, particularly through alignment tuning using reward models trained on human preferences. While these methods improve visual quality, they can unintentionally encode and amplify social biases. To systematically trace how such biases evolve throughout the alignment pipeline, we introduce VideoBiasEval, a comprehensive diagnostic framework for evaluating social representation in video generation. Grounded in established social bias taxonomies, VideoBiasEval employs an event-based prompting strategy to disentangle semantic content (actions and contexts) from actor attributes (gender and ethnicity). It further introduces multi-granular metrics to evaluate (1) overall ethnicity bias, (2) gender bias conditioned on ethnicity, (3) distributional shifts in social attributes across model variants, and (4) the temporal persistence of bias within videos. Using this framework, we conduct the first end-to-end analysis connecting biases in human preference datasets, their amplification in reward models, and their propagation through alignment-tuned video diffusion models. Our results reveal that alignment tuning not only strengthens representational biases but also makes them temporally stable, producing smoother yet more stereotyped portrayals. These findings highlight the need for bias-aware evaluation and mitigation throughout the alignment process to ensure fair and socially responsible video generation.
Abstract（参考訳）: 映像拡散モデルの最近の進歩は、特に人間の嗜好に基づいて訓練された報酬モデルを用いたアライメントチューニングによって、テキスト・ビデオ生成を大幅に改善した。これらの手法は視覚的品質を改善するが、意図せずにエンコードし、社会的偏見を増幅することができる。このようなバイアスがアライメントパイプラインを通してどのように進化するかを体系的に追跡するために,ビデオ生成における社会的表現を評価するための総合的な診断フレームワークであるVideoBiasEvalを紹介した。確立された社会的偏見の分類に基づいて、VideoBiasEvalは、アクター属性(性別と民族)からセマンティックコンテンツ(アクションとコンテキスト)をアンタングルするイベントベースのプロンプト戦略を採用している。さらに、(1)全体民族性バイアス、(2)民族性に条件づけられた性別バイアス、(3)モデル変種間の社会的属性の分布変化、(4)ビデオ内のバイアスの時間的持続性を評価するために、多粒度メトリクスを導入している。このフレームワークを用いて、人間の嗜好データセットのバイアス、報酬モデルの増幅、アライメント調整されたビデオ拡散モデルによる伝播を結合する最初のエンドツーエンド分析を行う。その結果、アライメント調整は表現バイアスを強めるだけでなく、時間的に安定し、よりスムーズでステレオタイプ化された表現を生み出すことが明らかとなった。これらの知見は、公平で社会的に責任のあるビデオ生成を保証するために、アライメントプロセス全体を通してバイアス認識の評価と緩和の必要性を強調している。

論文の概要: From Preferences to Prejudice: The Role of Alignment Tuning in Shaping Social Bias in Video Diffusion Models

関連論文リスト