Fugu-MT 論文翻訳(概要): Inference-Time Rethinking with Latent Thought Vectors for Math Reasoning

論文の概要: Inference-Time Rethinking with Latent Thought Vectors for Math Reasoning

arxiv url: http://arxiv.org/abs/2602.06584v1
Date: Fri, 06 Feb 2026 10:23:18 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-09 22:18:26.354126
Title: Inference-Time Rethinking with Latent Thought Vectors for Math Reasoning
Title（参考訳）: 数学推論のための潜在思考ベクトルを用いた推論時間再考
Authors: Deqian Kong, Minglu Zhao, Aoyang Qin, Bo Pang, Chenxin Tao, David Hartmann, Edouardo Honig, Dehong Xu, Amit Kumar, Matt Sarte, Chuan Li, Jianwen Xie, Ying Nian Wu,
Abstract要約: 推論時間再考(Inference-Time Rethinking)は、反復的な自己補正を可能にする生成フレームワークである。我々は、推論を連続的な潜在思考ベクトル(何を推論すべきか)と、このベクトル上で条件付けられたトレースを言語化する復号器に分解する。
参考スコア（独自算出の注目度）: 44.0190939401003
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Standard chain-of-thought reasoning generates a solution in a single forward pass, committing irrevocably to each token and lacking a mechanism to recover from early errors. We introduce Inference-Time Rethinking, a generative framework that enables iterative self-correction by decoupling declarative latent thought vectors from procedural generation. We factorize reasoning into a continuous latent thought vector (what to reason about) and a decoder that verbalizes the trace conditioned on this vector (how to reason). Beyond serving as a declarative buffer, latent thought vectors compress the reasoning structure into a continuous representation that abstracts away surface-level token variability, making gradient-based optimization over reasoning strategies well-posed. Our prior model maps unstructured noise to a learned manifold of valid reasoning patterns, and at test time we employ a Gibbs-style procedure that alternates between generating a candidate trace and optimizing the latent vector to better explain that trace, effectively navigating the latent manifold to refine the reasoning strategy. Training a 0.2B-parameter model from scratch on GSM8K, our method with 30 rethinking iterations surpasses baselines with 10 to 15 times more parameters, including a 3B counterpart. This result demonstrates that effective mathematical reasoning can emerge from sophisticated inference-time computation rather than solely from massive parameter counts.
Abstract（参考訳）: 標準的なチェーン・オブ・ソート推論は、1つのフォワードパスでソリューションを生成し、トークン毎に無効にコミットし、早期エラーから回復するメカニズムが欠如している。 Inference-Time Rethinkingは、宣言的潜在思考ベクトルを手続き生成から切り離して反復的自己補正を可能にする生成フレームワークである。我々は、推論を連続的な潜在思考ベクトル(何を推論すべきか)と、このベクトル上で条件付けられたトレースを(どのように推論するか)言語化するデコーダに分解する。宣言的なバッファとして機能するだけでなく、潜在思考ベクトルは推論構造を連続的な表現に圧縮し、表面レベルのトークンの可変性を抽象化し、推論戦略に対する勾配に基づく最適化をうまく提示する。我々の先行モデルは、非構造ノイズを有効な推論パターンの学習多様体にマッピングし、テスト時には、候補トレースの生成と潜在ベクトルの最適化を交互に交互に行うギブススタイルの手順を用いて、そのトレースをよりよく説明し、潜在多様体を効果的にナビゲートして推論戦略を洗練させる。 GSM8Kでスクラッチから0.2Bパラメータモデルをトレーニングし、30回繰り返し再考した手法は、3Bのパラメータを含む10～15倍のパラメータでベースラインを超えた。この結果は、膨大なパラメータ数からではなく、高度な推論時間計算から効果的な数学的推論が生まれることを証明している。

論文の概要: Inference-Time Rethinking with Latent Thought Vectors for Math Reasoning

関連論文リスト