Fugu-MT 論文翻訳(概要): Learning from Emptiness: De-biasing Listwise Rerankers with Content-Agnostic Probability Calibration

論文の概要: Learning from Emptiness: De-biasing Listwise Rerankers with Content-Agnostic Probability Calibration

arxiv url: http://arxiv.org/abs/2604.10150v1
Date: Sat, 11 Apr 2026 10:47:22 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-14 20:13:15.873802
Title: Learning from Emptiness: De-biasing Listwise Rerankers with Content-Agnostic Probability Calibration
Title（参考訳）: 経験から学ぶ:コンテンツに依存しない確率の校正によるリスランカの非バイアス化
Authors: Hang Lv, Hongchao Gu, Ruiqing Yang, Liangyue Li, Zulong Chen, Defu Lian, Hao Wang, Enhong Chen,
Abstract要約: CapCalは、ランキング決定から位置バイアスを機械的に分離する、トレーニング不要のフレームワークである。シングルパス効率を保ちながら、トレーニング不要の手法で優れた性能を発揮する。
参考スコア（独自算出の注目度）: 76.08899010904652
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Generative listwise reranking leverages global context for superior retrieval but is plagued by intrinsic position bias, where models exhibit structural sensitivity to input order independent of relevance. Existing mitigations present a dilemma: inference-time aggregation incurs prohibitive latency, while training-based methods often fail to eradicate ingrained priors, particularly in compact models. To resolve this dilemma, we propose CapCal (Content-Agnostic Probability Calibration), a training-free framework that mechanically decouples positional bias from ranking decisions. By estimating the bias distribution via content-free placeholders, CapCal rectifies output logits through an entropy-adaptive contrastive mechanism. Evaluations across 10 benchmarks confirm that CapCal achieves superior performance among training-free methods while preserving single-pass efficiency. Notably, it unlocks the latent potential of lightweight models (e.g., 0.6B), delivering absolute NDCG gains exceeding 10 points and outperforming both permutation-based aggregation and data-augmentation baselines.
Abstract（参考訳）: 生成的リストワイド・リグレードは、グローバルコンテキストを優れた検索に活用するが、本質的な位置バイアスに悩まされ、モデルが関連性に依存しない入力順序に対する構造的感度を示す。既存の緩和策はジレンマを呈している: 推論時アグリゲーションは禁忌の遅延を引き起こすが、トレーニングベースの手法は、特にコンパクトモデルにおいて、詳細な先行を根絶するのに失敗することが多い。このジレンマを解決するために,ランキング決定から位置バイアスを機械的に分離するトレーニングフリーフレームワークであるCapCal(Content-Agnostic Probability Calibration)を提案する。 CapCalは、コンテンツフリープレースホルダーを介してバイアス分布を推定することにより、エントロピー適応コントラスト機構を通じて出力ロジットを補正する。 10ベンチマークで評価したところ、CapCalはシングルパス効率を保ちながら、トレーニング不要のメソッドで優れたパフォーマンスを実現している。特に、軽量モデルの潜在可能性(例えば0.6B)を解放し、絶対的なNDCGゲインを10ポイント以上達成し、置換ベースのアグリゲーションとデータ拡張ベースラインの両方を上回っている。

論文の概要: Learning from Emptiness: De-biasing Listwise Rerankers with Content-Agnostic Probability Calibration

関連論文リスト