Fugu-MT 論文翻訳(概要): Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space

論文の概要: Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space

arxiv url: http://arxiv.org/abs/2505.15778v1
Date: Wed, 21 May 2025 17:29:15 GMT
ステータス: 翻訳完了
システム内更新日: 2025-05-22 15:42:59.802745
Title: Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space
Title（参考訳）: ソフトシンキング - 連続概念空間におけるLLMの推論可能性の解き方
Authors: Zhen Zhang, Xuehai He, Weixiang Yan, Ao Shen, Chenyang Zhao, Shuohang Wang, Yelong Shen, Xin Eric Wang,
Abstract要約: ソフトシンキング(Soft Thinking)は、ソフトで抽象的な概念トークンを生成することによって、人間のような「ソフト」推論をエミュレートする訓練不要な手法である。これらの概念トークンは、連続的な概念空間を形成するトークン埋め込みの確率重み付き混合によって生成される。本質的に、生成された概念トークンは関連する離散トークンから複数の意味をカプセル化し、暗黙的に様々な推論経路を探索する。
参考スコア（独自算出の注目度）: 62.54887038032942
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Human cognition typically involves thinking through abstract, fluid concepts rather than strictly using discrete linguistic tokens. Current reasoning models, however, are constrained to reasoning within the boundaries of human language, processing discrete token embeddings that represent fixed points in the semantic space. This discrete constraint restricts the expressive power and upper potential of such reasoning models, often causing incomplete exploration of reasoning paths, as standard Chain-of-Thought (CoT) methods rely on sampling one token per step. In this work, we introduce Soft Thinking, a training-free method that emulates human-like "soft" reasoning by generating soft, abstract concept tokens in a continuous concept space. These concept tokens are created by the probability-weighted mixture of token embeddings, which form the continuous concept space, enabling smooth transitions and richer representations that transcend traditional discrete boundaries. In essence, each generated concept token encapsulates multiple meanings from related discrete tokens, implicitly exploring various reasoning paths to converge effectively toward the correct answer. Empirical evaluations on diverse mathematical and coding benchmarks consistently demonstrate the effectiveness and efficiency of Soft Thinking, improving pass@1 accuracy by up to 2.48 points while simultaneously reducing token usage by up to 22.4% compared to standard CoT. Qualitative analysis further reveals that Soft Thinking outputs remain highly interpretable and readable, highlighting the potential of Soft Thinking to break the inherent bottleneck of discrete language-based reasoning. Code is available at https://github.com/eric-ai-lab/Soft-Thinking.
Abstract（参考訳）: 人間の認知は通常、個別の言語トークンを厳密に使用するのではなく、抽象的で流動的な概念を通して思考する。しかし、現在の推論モデルは、人間の言語の境界内での推論に制約され、意味空間における固定点を表す離散トークン埋め込みを処理する。この離散的な制約は、そのような推論モデルの表現力と上層ポテンシャルを制限し、標準的なChain-of-Thought (CoT) 法がステップ毎に1つのトークンをサンプリングすることに依存するため、しばしば推論経路の不完全な探索を引き起こす。本研究では,連続的な概念空間において,ソフトで抽象的な概念トークンを生成することによって,人間のような「ソフト」推論をエミュレートする学習自由な手法であるSoft Thinkingを紹介する。これらの概念トークンは、連続的な概念空間を形成するトークン埋め込みの確率重み付け混合によって生成され、スムーズな遷移とよりリッチな表現を可能にし、従来の離散的境界を超越する。本質的に、生成された概念トークンは、関連する離散トークンから複数の意味をカプセル化し、様々な推論経路を暗黙的に探索して、正しい解に向かって効果的に収束する。様々な数学的およびコーディングベンチマークに関する実証的な評価は、ソフトシンキングの有効性と効率を一貫して示し、パス@1の精度を2.48ポイントまで改善し、トークンの使用量を標準のCoTに比べて最大22.4%削減した。質的な分析により、Soft Thinkingのアウトプットは高い解釈性と可読性を維持し、Soft Thinkingが独立した言語に基づく推論の固有のボトルネックを突破する可能性を強調している。コードはhttps://github.com/eric-ai-lab/Soft-Thinking.comで入手できる。

論文の概要: Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space

関連論文リスト