Fugu-MT 論文翻訳(概要): Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization

論文の概要: Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization

arxiv url: http://arxiv.org/abs/2601.21358v2
Date: Wed, 04 Feb 2026 16:14:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-05 15:07:33.531198
Title: Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization
Title（参考訳）: 計画としての潜在的連鎖--言語化と推論の切り離し
Authors: Jiecong Wang, Hao Peng, Chunyang Liu,
Abstract要約: CoT(Chain-of-Thought)は、大規模言語モデル(LLM)に複雑な問題に取り組む権限を与える。最近の潜伏推論手法は、連続した隠蔽状態内で推論を行うことによって効率を最適化しようとする。 PLaTは、潜在推論を言語化から根本的に切り離すことによって計画として再構成するフレームワークである。
参考スコア（独自算出の注目度）: 9.193078163792427
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Chain-of-Thought (CoT) empowers Large Language Models (LLMs) to tackle complex problems, but remains constrained by the computational cost and reasoning path collapse when grounded in discrete token spaces. Recent latent reasoning approaches attempt to optimize efficiency by performing reasoning within continuous hidden states. However, these methods typically operate as opaque end-to-end mappings from explicit reasoning steps to latent states, and often require a pre-defined number of latent steps during inference. In this work, we introduce PLaT (Planning with Latent Thoughts), a framework that reformulates latent reasoning as planning by fundamentally decouple reasoning from verbalization. We model reasoning as a deterministic trajectory of latent planning states, while a separate Decoder grounds these thoughts into text when necessary. This decoupling allows the model to dynamically determine when to terminate reasoning rather than relying on fixed hyperparameters. Empirical results on mathematical benchmarks reveal a distinct trade-off: while PLaT achieves lower greedy accuracy than baselines, it demonstrates superior scalability in terms of reasoning diversity. This indicates that PLaT learns a robust, broader solution space, offering a transparent and scalable foundation for inference-time search. Our code can be found in https://github.com/yunsaijc/PLaT.
Abstract（参考訳）: CoT(Chain-of-Thought)は、LLM(Large Language Models)に複雑な問題に取り組む権限を与えるが、離散トークン空間における計算コストと推論経路の崩壊によって制約される。最近の潜伏推論手法は、連続した隠蔽状態内で推論を行うことによって効率を最適化しようとする。しかしながら、これらの手法は通常、明示的な推論ステップから潜伏状態への不透明なエンドツーエンドマッピングとして機能し、推論中に予め定義された潜伏ステップの数を必要とすることが多い。本研究は, PLaT (Planning with Latent Thoughts) を導入し, PLaT(Planning with Latent Thoughts) について述べる。我々は、潜在計画状態の決定論的軌跡として推論をモデル化し、別個のデコーダは必要に応じてこれらの思考をテキスト化します。この分離により、モデルは固定されたハイパーパラメータに頼るのではなく、推論をいつ終了するかを動的に決定できる。 PLaTはベースラインよりもグリージーの精度が低いが、推論の多様性の点で優れたスケーラビリティを示す。これは、PLaTが堅牢で広範なソリューション空間を学習し、推論時間検索のための透明でスケーラブルな基盤を提供することを示している。私たちのコードはhttps://github.com/yunsaijc/PLaT.comで見られます。

論文の概要: Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization

関連論文リスト