Fugu-MT 論文翻訳(概要): Uncovering the Representation Geometry of Minimal Cores in Overcomplete Reasoning Traces

論文の概要: Uncovering the Representation Geometry of Minimal Cores in Overcomplete Reasoning Traces

arxiv url: http://arxiv.org/abs/2605.14358v1
Date: Thu, 14 May 2026 04:35:45 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-15 21:45:34.619374
Title: Uncovering the Representation Geometry of Minimal Cores in Overcomplete Reasoning Traces
Title（参考訳）: オーバーコンプリート推論トレースにおける最小コアの表現幾何学の解明
Authors: Sanjoy Chowdhury, Dinesh Manocha,
Abstract要約: 言語モデルは、しばしば長いチェーン・オブ・ソート・トレースを生成するが、最終的な予測を維持するのに、この理由がどの程度必要かは定かではない。オーバーコンプリート推論トレースのレンズを通してこれを研究する。我々は最小のコアを最終回答または予測分布を保存するステップの最小サブセットとして定義する。
参考スコア（独自算出の注目度）: 56.497263592610295
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Language models often generate long chain-of-thought traces, but it remains unclear how much of this reasoning is necessary for preserving the final prediction. We study this through the lens of overcomplete reasoning traces: generated traces that contain more intermediate steps than are needed to support the model's answer. We define the minimal core as the smallest subset of steps that preserves either the final answer or predictive distribution, and introduce metrics for compression ratio, redundancy mass, step necessity, and necessity concentration. Across six deliberative reasoning benchmarks spanning arithmetic, competition mathematics, expert scientific reasoning, and commonsense multi-hop QA, we find substantial overcompleteness: on average, 46% of steps are removable under greedy minimal-core extraction while preserving the original answer in 86% of cases. We also find that predictive support is concentrated: the top three steps account for 65% of measured necessity mass on average. Beyond compression, minimal cores expose a cleaner geometry of reasoning: compared with full traces, they improve correct-incorrect trace separation by 11 points, reduce estimated intrinsic dimensionality by 34%, and transfer across model families with 85% off-diagonal answer retention. Theoretically, we establish existence of minimal sufficient subsets, local irreducibility guarantees for greedy elimination, and certificates of overcompleteness and sparse necessity. Together, these results suggest that full reasoning traces are often verbose and overcomplete, while minimal cores isolate the effective support underlying language-model predictions.
Abstract（参考訳）: 言語モデルは、しばしば長いチェーン・オブ・ソート・トレースを生成するが、最終的な予測を維持するのに、この理由がどの程度必要かは定かではない。我々はこれを、過剰完備な推論トレースのレンズを通して研究する:モデルの答えをサポートするために必要以上の中間ステップを含む生成されたトレース。最小限のコアを最終回答または予測分布のいずれかを保持する最小のステップのサブセットとして定義し、圧縮比、冗長質量、ステップ必要量、必要濃度の指標を導入する。算術、競争数学、専門科学推論、コモンセンスマルチホップQAにまたがる6つの熟考的推論ベンチマークにおいて、平均して46%のステップがグリージーな最小コア抽出の下で取り外し可能であり、元の回答を86%のケースで保存する。また、予測支援が集中していることがわかりました。トップ3ステップは平均で測定された必要質量の65%を占めています。圧縮以外にも、最小限のコアは推論のよりクリーンな幾何学を公開し、完全なトレースと比較すると、正しい不正確なトレース分離を11ポイント改善し、推定固有次元を34%削減し、85%の非対角応答保持を持つモデルファミリ間を転送する。理論的には、最小限の十分な部分集合、グリーディ除去のための局所的な既約性保証、過剰完全性とスパースの必要性の証明の存在を確立します。これらの結果は、完全な推論トレースが冗長で過剰な場合が多く、最小限のコアは、基礎となる言語モデル予測を効果的にサポートしていることを示唆している。

論文の概要: Uncovering the Representation Geometry of Minimal Cores in Overcomplete Reasoning Traces

関連論文リスト