Fugu-MT 論文翻訳(概要): Diagnosing Retrieval Bias Under Multiple In-Context Knowledge Updates in Large Language Models

論文の概要: Diagnosing Retrieval Bias Under Multiple In-Context Knowledge Updates in Large Language Models

arxiv url: http://arxiv.org/abs/2603.12271v1
Date: Wed, 18 Feb 2026 11:11:39 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-23 08:17:42.194718
Title: Diagnosing Retrieval Bias Under Multiple In-Context Knowledge Updates in Large Language Models
Title（参考訳）: 大規模言語モデルにおける複数文脈知識更新に基づく検索バイアスの診断
Authors: Boyu Qiao, Sean Guo, Xian Yang, Kun Li, Wei Zhou, Songlin Hu, Yunya Song,
Abstract要約: マルチアップデートシナリオには、検索で競合する複数の歴史的に有効なバージョンが含まれているが、未調査のままである。我々は、動的知識インスタンス(DKI)評価フレームワークを導入し、更新された値のシーケンスと組み合わせたキューと同じ事実の複数更新をモデル化する。最新状態の精度が大幅に低下する一方で,更新が増加するにつれて,検索バイアスが増大するのを観察する。
参考スコア（独自算出の注目度）: 19.498411614667294
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: LLMs are widely used in knowledge-intensive tasks where the same fact may be revised multiple times within context. Unlike prior work focusing on one-shot updates or single conflicts, multi-update scenarios contain multiple historically valid versions that compete at retrieval, yet remain underexplored. This challenge resembles the AB-AC interference paradigm in cognitive psychology: when the same cue A is successively associated with B and C, the old and new associations compete during retrieval, leading to bias. Inspired by this, we introduce a Dynamic Knowledge Instance (DKI) evaluation framework, modeling multi-updates of the same fact as a cue paired with a sequence of updated values, and assess models via endpoint probing of the earliest (initial) and latest (current) states. Across diverse LLMs, we observe that retrieval bias intensifies as updates increase, earliest-state accuracy stays high while latest-state accuracy drops substantially. Diagnostic analyses of attention, hidden-state similarity, and output logits further reveal that these signals become flatter and weakly discriminative on errors, providing little stable basis for identifying the latest update. Finally, cognitively inspired heuristic intervention strategies yield only modest gains and do not eliminate the bias. Our results reveal a persistent challenge in tracking and following knowledge updates in long contexts.
Abstract（参考訳）: LLMは、同じ事実がコンテキスト内で複数回修正されるような知識集約的なタスクで広く使用されている。ワンショット更新やシングルコンフリクトにフォーカスする以前の作業とは異なり、マルチアップデートシナリオには、検索で競合する複数の歴史的に有効なバージョンが含まれているが、未調査のままである。この課題は、認知心理学におけるAB-AC干渉パラダイム(英語版)に似ており、同じキューAがBとCと連続的に関連付けられている場合、古い協会と新しい協会が検索中に競い合い、バイアスをもたらす。そこで我々は、動的知識インスタンス(DKI)評価フレームワークを導入し、更新された値のシーケンスと組み合わせたキューと同じ事実のマルチアップデートをモデル化し、最初期の(初期)状態と最新の(現在の)状態のエンドポイント探索によるモデルを評価する。様々なLDMにおいて,更新が増加するにつれて検索バイアスが増大し,最新状態の精度が著しく低下する一方,初期状態の精度は高いままである。注意、隠れ状態の類似性、出力ロジットの診断分析により、これらの信号が誤りに対してより平坦で弱い識別力を持つことが明らかとなり、最新の更新を特定するための安定した基盤はほとんど得られない。最後に、認知にインスパイアされたヒューリスティックな介入戦略は、緩やかな利得しか得られず、バイアスを排除しない。この結果から,長期にわたる知識更新の追跡と追跡において,永続的な課題が明らかとなった。

論文の概要: Diagnosing Retrieval Bias Under Multiple In-Context Knowledge Updates in Large Language Models

関連論文リスト