Fugu-MT 論文翻訳(概要): SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens

論文の概要: SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens

arxiv url: http://arxiv.org/abs/2510.24940v1
Date: Tue, 28 Oct 2025 20:11:54 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-30 15:50:44.779869
Title: SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens
Title（参考訳）: SemCoT: セマンティック・アライン・インシシット・トークンによるチェーン・オブ・サート推論の高速化
Authors: Yinhan He, Wendy Zheng, Yaochen Zhu, Zaiyi Zheng, Lin Su, Sriram Vasudevan, Qi Guo, Liangjie Hong, Jundong Li,
Abstract要約: CoT(Chain-of-Thought)の推論は、効率クリティカルなアプリケーションへの大量展開を妨げる。本稿では,SemCoTと呼ばれる意味的に整合した暗黙的CoTフレームワークを提案する。
参考スコア（独自算出の注目度）: 43.78883511257627
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The verbosity of Chain-of-Thought (CoT) reasoning hinders its mass deployment in efficiency-critical applications. Recently, implicit CoT approaches have emerged, which encode reasoning steps within LLM's hidden embeddings (termed ``implicit reasoning'') rather than explicit tokens. This approach accelerates CoT by reducing the reasoning length and bypassing some LLM components. However, existing implicit CoT methods face two significant challenges: (1) they fail to preserve the semantic alignment between the implicit reasoning (when transformed to natural language) and the ground-truth reasoning, resulting in a significant CoT performance degradation, and (2) they focus on reducing the length of the implicit reasoning; however, they neglect the considerable time cost for an LLM to generate one individual implicit reasoning token. To tackle these challenges, we propose a novel semantically-aligned implicit CoT framework termed SemCoT. In particular, for the first challenge, we design a contrastively trained sentence transformer that evaluates semantic alignment between implicit and explicit reasoning, which is used to enforce semantic preservation during implicit reasoning optimization. To address the second challenge, we introduce an efficient implicit reasoning generator by finetuning a lightweight language model using knowledge distillation. This generator is guided by our sentence transformer to distill ground-truth reasoning into semantically aligned implicit reasoning, while also optimizing for accuracy. SemCoT is the first approach that enhances CoT efficiency by jointly optimizing token-level generation speed and preserving semantic alignment with ground-truth reasoning. Extensive experiments demonstrate the superior performance of SemCoT compared to state-of-the-art methods in both efficiency and effectiveness. Our code can be found at https://github.com/YinhanHe123/SemCoT/.
Abstract（参考訳）: CoT(Chain-of-Thought)推論の冗長性は、効率クリティカルなアプリケーションへの大量展開を妨げる。最近、暗黙のCoTアプローチが出現し、明示的なトークンではなく、LLMの隠れ埋め込み('implicit reasoning'' と呼ばれる)内の推論ステップをエンコードしている。このアプローチは、推理長を減らし、いくつかのLCMコンポーネントをバイパスすることにより、CoTを加速する。しかし、既存の暗黙のCoT法では、(1)暗黙の推論(自然言語に変換された場合)と暗示の推論とのセマンティックアライメントを維持することができず、また(2)暗示の推論の長さを減らすことに重点を置いている。これらの課題に対処するために,SemCoTと呼ばれる意味的に整合した暗黙的CoTフレームワークを提案する。特に、最初の課題として、暗黙的推論と明示的推論のセマンティックアライメントを評価する、対照的に訓練された文変換器を設計する。第2の課題に対処するために,知識蒸留を用いた軽量言語モデルを微調整し,効率的な暗黙的推論生成手法を提案する。本生成装置は, 文変換器により, 意味的に整合した暗黙的推論に基礎構造推論を蒸留し, 精度を最適化する。 SemCoTはトークンレベルの生成速度を最適化し、接地構造推論とセマンティックアライメントを保持することでCoT効率を向上させる最初のアプローチである。大規模な実験により,SemCoTの効率と有効性の両方において,最先端の手法と比較して優れた性能を示した。私たちのコードはhttps://github.com/YinhanHe123/SemCoT/で確認できます。

論文の概要: SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens

関連論文リスト