Fugu-MT 論文翻訳(概要): Investigating the Interplay between Contextual and Parametric Chain-of-Thought Faithfulness under Optimization

論文の概要: Investigating the Interplay between Contextual and Parametric Chain-of-Thought Faithfulness under Optimization

arxiv url: http://arxiv.org/abs/2605.24960v1
Date: Sun, 24 May 2026 09:16:55 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-26 19:50:18.546746
Title: Investigating the Interplay between Contextual and Parametric Chain-of-Thought Faithfulness under Optimization
Title（参考訳）: コンテクストとパラメトリック・オブ・サード・チェーンの相互作用を最適化した検討
Authors: Jingyi Sun, Qianli Wang, Pepa Atanasova, Nils Feldhus, Isabelle Augenstein,
Abstract要約: CoT(Chain-of-Thought)の忠実さは2つの相反するパラダイムの下で評価される。モデルを忠実度パラダイムに最適化するための統一された嗜好アライメントインターフェースであるFaithMateを提案する。
参考スコア（独自算出の注目度）: 41.908827893947525
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Chain-of-Thought (CoT) faithfulness, i.e., whether CoTs genuinely reflect large language models' (LLM) underlying behavior, is typically evaluated under two disjoint paradigms: contextual faithfulness, measured by perturbing the input or CoT trace, and parametric faithfulness, assessed by intervening on a model's parametric knowledge. Yet prior work compares them only descriptively. We fill this gap by proposing FaithMate, a unified preference-alignment interface for optimizing models towards either faithfulness paradigm. It enables us to investigate the interplay between the two paradigms, examining whether and to what extent faithfulness gains generalize within and across paradigms. Across three models, two datasets, and six faithfulness metrics, we find that the two paradigms are positively coupled, yet asymmetric: optimizing towards parametric faithfulness yields consistent gains across both paradigms, whereas the contextual counterpart delivers more variable gains. Within the contextual paradigm, faithfulness gains on one metric do not consistently transfer to others, implying that existing contextual metrics capture disjoint facets of faithfulness and exposing inherent trade-offs. These findings imply that CoT faithfulness is not a monolithic objective and therefore requires multifaceted optimization and evaluation.
Abstract（参考訳）: CoT(Chain-of-Thought)の忠実さ、すなわち、CoTが大きな言語モデル(LLM)を真に反映しているかどうかは、典型的には2つの相反するパラダイムで評価される: 入力またはCoTトレースを摂動することによって測定される文脈的忠実性と、モデルのパラメトリック知識を介入することによって評価されるパラメトリック忠実である。しかし、事前の作業は、それらを記述的にのみ比較する。このギャップを埋めるために、モデルを忠実度パラダイムに最適化するための統一された優先順位調整インターフェースであるFaithMateを提案する。これにより、2つのパラダイム間の相互作用を調査し、パラダイム内およびパラダイム間の信頼度がどの程度高まっていくかを調べることができる。 3つのモデル、2つのデータセット、6つの忠実度測定値から、2つのパラダイムは正に結合されているが非対称であることが分かる。文脈的パラダイムの中では、ある計量に対する忠実さの獲得は、常に他への移動ではなく、既存の文脈的メトリクスが不一致な忠実さの面を捉え、固有のトレードオフを露呈することを意味している。これらの結果から,CoT忠実度はモノリシックな目的ではなく,多面的最適化と評価が必要であることが示唆された。

論文の概要: Investigating the Interplay between Contextual and Parametric Chain-of-Thought Faithfulness under Optimization

関連論文リスト