Fugu-MT 論文翻訳(概要): Do Cognitively Interpretable Reasoning Traces Improve LLM Performance?

論文の概要: Do Cognitively Interpretable Reasoning Traces Improve LLM Performance?

arxiv url: http://arxiv.org/abs/2508.16695v1
Date: Thu, 21 Aug 2025 23:48:50 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-26 18:43:45.127892
Title: Do Cognitively Interpretable Reasoning Traces Improve LLM Performance?
Title（参考訳）: 認知的推論トレースはLLM性能を向上させるか?
Authors: Siddhant Bhambri, Upasana Biswas, Subbarao Kambhampati,
Abstract要約: CoT(Chain-of-Thought)トレースは、回答を生成する前に中間的推論トレースを生成する。よくあるが、しばしば暗黙的な仮定は、CoTトレースは意味論的に意味があり、エンドユーザに解釈可能であるべきだというものである。本稿では,LLaMAモデルとQwenモデルの教師付き微調整によるオープンブック質問応答領域におけるこの問題について検討する。
参考スコア（独自算出の注目度）: 15.970162088175593
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recent progress in reasoning-oriented Large Language Models (LLMs) has been driven by introducing Chain-of-Thought (CoT) traces, where models generate intermediate reasoning traces before producing an answer. These traces, as in DeepSeek R1, are not only used to guide inference but also serve as supervision signals for distillation into smaller models. A common but often implicit assumption is that CoT traces should be semantically meaningful and interpretable to the end user. While recent research questions the need for semantic nature of these traces, in this paper, we ask: ``\textit{Must CoT reasoning traces be interpretable to enhance LLM task performance?}" We investigate this question in the Open Book Question-Answering domain by supervised fine-tuning LLaMA and Qwen models on four types of reasoning traces: (1) DeepSeek R1 traces, (2) LLM-generated summaries of R1 traces, (3) LLM-generated post-hoc explanations of R1 traces, and (4) algorithmically generated verifiably correct traces. To quantify the trade-off between interpretability and performance, we further conduct a human-subject study with 100 participants rating the interpretability of each trace type. Our results reveal a striking mismatch: while fine-tuning on R1 traces yields the strongest performance, participants judged these traces to be the least interpretable. These findings suggest that it is useful to decouple intermediate tokens from end user interpretability.
Abstract（参考訳）: 推論指向の大規模言語モデル(LLM)の最近の進歩は、モデルが回答を生成する前に中間的推論トレースを生成する、Chain-of-Thought(CoT)トレースの導入によって進められている。これらの痕跡は、DeepSeek R1のように、推論を誘導するだけでなく、より小さなモデルへの蒸留の監視信号としても用いられる。よくあるが、しばしば暗黙的な仮定は、CoTトレースは意味論的に意味があり、エンドユーザに解釈可能であるべきだというものである。最近の研究では、これらのトレースのセマンティックな性質の必要性について疑問が呈されているが、本稿では次のように問う。 The Open Book Question-Answering domain by supervised fine-tuning LLaMA and Qwen models on four types of reasoning traces: (1) DeepSeek R1 traces, (2) LLM- generated summaries of R1 traces, (3) LLM- generated post-hoc explanations of R1 traces, (4) algorithmally generated verifiably correct traces。さらに,解釈可能性と性能のトレードオフを定量化するために,100人の参加者を対象に,各トレースタイプの解釈可能性を評価する。 R1トレースの微調整は、最も高いパフォーマンスをもたらすが、参加者はこれらのトレースを最も解釈しにくいと判断した。これらの結果は,中間トークンをエンドユーザの解釈可能性から切り離すことが有用であることが示唆された。

論文の概要: Do Cognitively Interpretable Reasoning Traces Improve LLM Performance?

関連論文リスト