Fugu-MT 論文翻訳(概要): MBD: A Model-Based Debiasing Framework Across User, Content, and Model Dimensions

論文の概要: MBD: A Model-Based Debiasing Framework Across User, Content, and Model Dimensions

arxiv url: http://arxiv.org/abs/2603.14422v1
Date: Sun, 15 Mar 2026 15:07:01 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:35.803616
Title: MBD: A Model-Based Debiasing Framework Across User, Content, and Model Dimensions
Title（参考訳）: MBD: ユーザ、コンテンツ、モデル次元をまたいだモデルベースのデバイアスフレームワーク
Authors: Yuantong Li, Lei Yuan, Zhihao Zheng, Weimiao Wu, Songbin Liu, Jeong Min Lee, Ali Selman Aydin, Shaofeng Deng, Junbo Chen, Xinyi Zhang, Hongjing Xia, Sam Fieldman, Matthew Kosko, Wei Fu, Du Zhang, Peiyu Yang, Albert Jin Chung, Xianlei Qiu, Miao Yu, Zhongwei Teng, Hao Chen, Sunny Baek, Hui Tang, Yang Lv, Renze Wang, Qifan Wang, Zhan Li, Tiantian Xu, Peng Wu, Ji Liu,
Abstract要約: この課題に対処する一般モデルベースデバイアス(MBD)フレームワークを提案する。任意のコホートに対するエンゲージメント分布の文脈平均と分散を明示的に推定する。この統合により、フレームワークはバイアス付き生信号からバイアスなしの表現に変換することができる。
参考スコア（独自算出の注目度）: 50.00784452900918
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Modern recommendation systems rank candidates by aggregating multiple behavioral signals through a value model. However, many commonly used signals are inherently affected by heterogeneous biases. For example, watch time naturally favors long-form content, loop rate favors short - form content, and comment probability favors videos over images. Such biases introduce two critical issues: (1) value model scores may be systematically misaligned with users' relative preferences - for instance, a seemingly low absolute like probability may represent exceptionally strong interest for a user who rarely engages; and (2) changes in value modeling rules can trigger abrupt and undesirable ecosystem shifts. In this work, we ask a fundamental question: can biased behavioral signals be systematically transformed into unbiased signals, under a user - defined notion of ``unbiasedness'', that are both personalized and adaptive? We propose a general, model-based debiasing (MBD) framework that addresses this challenge by augmenting it with distributional modeling. By conditioning on a flexible subset of features (partial feature set), we explicitly estimate the contextual mean and variance of the engagement distribution for arbitrary cohorts (e.g., specific video lengths or user regions) directly alongside the main prediction. This integration allows the framework to convert biased raw signals into unbiased representations, enabling the construction of higher-level, calibrated signals (such as percentiles or z - scores) suitable for the value model. Importantly, the definition of unbiasedness is flexible and controllable, allowing the system to adapt to different personalization objectives and modeling preferences. Crucially, this is implemented as a lightweight, built-in branch of the existing MTML ranking model, requiring no separate serving infrastructure.
Abstract（参考訳）: 現代のレコメンデーションシステムは、値モデルを通じて複数の行動信号を集約することで候補をランク付けする。しかし、多くの一般的な信号は本質的に異種バイアスの影響を受けている。例えば、ウォッチタイムは自然に長文のコンテンツを好むし、ループレートは短文のコンテンツを好むし、コメント確率は画像よりもビデオを好む。このようなバイアスは、(1) 価値モデルスコアがユーザーの相対的な嗜好と体系的に不一致している可能性があること、例えば、一見して絶対的な確率が低いことは、めったに関わらないユーザーにとって非常に強い関心を示す可能性があること、(2) 価値モデリングルールの変更は、突然で望ましくないエコシステムシフトを引き起こす可能性があること、の2つの重大な問題を引き起こす。偏りのある行動信号は、ユーザの下で、体系的に非偏りの信号に変換できるか? 本稿では,この課題に対処する一般モデルベースデバイアス(MBD)フレームワークを提案する。特徴の柔軟な部分集合(部分的特徴集合)を条件に、任意のコホート(例えば、特定のビデオの長さやユーザ領域)のエンゲージメント分布の文脈的平均と分散を主予測と直接的に推定する。この統合により、偏りのある生信号から偏りのない表現に変換することができ、値モデルに適した高レベルな校正信号(パーセンタイルやz-スコアなど)を構築することができる。重要なことは、不偏性の定義は柔軟で制御可能であり、システムは異なるパーソナライゼーションの目的やモデリングの好みに適応できる。重要なことに、これは既存のMTMLランキングモデルの軽量で組み込みのブランチとして実装されており、別々のサービスインフラストラクチャを必要としない。

論文の概要: MBD: A Model-Based Debiasing Framework Across User, Content, and Model Dimensions

関連論文リスト