Fugu-MT 論文翻訳(概要): Upfront Chain-of-Thought: A Cooperative Framework for Chain-of-Thought Compression

論文の概要: Upfront Chain-of-Thought: A Cooperative Framework for Chain-of-Thought Compression

arxiv url: http://arxiv.org/abs/2510.08647v1
Date: Thu, 09 Oct 2025 06:34:31 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-14 00:38:47.298057
Title: Upfront Chain-of-Thought: A Cooperative Framework for Chain-of-Thought Compression
Title（参考訳）: チェーン・オブ・ワウト:チェーン・オブ・ワウト・コンプレッションのための協調的枠組み
Authors: Chengzhengxu Li, Xiaoming Liu, Zhaohan Zhang, Shaochu Zhang, Shengchao Liu, Guoxin Ma, Yu Lan, Chao Shen,
Abstract要約: Upfront CoT (UCoT) は、Chain-of-Thought (CoT) 圧縮を自動化するために事前思考を組み込んだ効率的な推論フレームワークである。 UCoTはエグゼクタの強力な推論能力を維持しつつ、CoTの長さを大幅に削減している。
参考スコア（独自算出の注目度）: 29.354544133745453
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent developments have enabled advanced reasoning in Large Language Models (LLMs) via long Chain-of-Thought (CoT), while long CoT suffers from high computational costs and significant latency losses owing to the autoregressive nature of generative LLMs. CoT compression aims to improve efficiency in the reasoning process by reducing output length. Previous works trade reasoning efficiency by either laborious discrete prompt designing or the construction of external compressed CoT datasets that sacrifice key reasoning details. In this work, we propose Upfront CoT (UCoT): an efficient reasoning framework with upfront thought embedding to automate CoT compression. UCoT is a cooperative workflow involving a small model (compressor) and a large model (executor). The first stage of UCoT trains compressor to generate upfront thought embeddings rich in reasoning information for the executor, avoiding the drawbacks of manually designed prompts. The second stage optimizes executor to utilize upfront thought embeddings to derive the correct answer with short reasoning, using a reward mechanism. Extensive experiments show that UCoT maintains the powerful reasoning ability of executor while significantly reducing the length of CoT. It is worth mentioning that when applying UCoT to the Qwen2.5-7B-Instruct model, the usage of tokens on GSM8K dataset is reduced by 50\%, while the performance is 3.08\% higher than that of the state-of-the-art (SOTA) method. The code and dataset are in supplementary material.
Abstract（参考訳）: 近年の大規模言語モデル (LLMs) では, 長期のチェーン・オブ・ソート (CoT) による高度な推論が実現されているが, 生成LDMの自己回帰的性質により, CoT は高い計算コストと大幅な遅延損失に悩まされている。 CoT圧縮は、出力長を削減して推論プロセスの効率を向上させることを目的としている。従来の作業では、離散的なプロンプト設計や、重要な推論の詳細を犠牲にした外部圧縮されたCoTデータセットの構築による推論効率の取引が行われた。本研究では,CoT圧縮を自動化するために,事前思考を組み込んだ効率的な推論フレームワークであるUpfront CoT (UCoT)を提案する。 UCoTは、小さなモデル(圧縮機)と大きなモデル(実行機)を含む協調ワークフローである。 UCoTの最初の段階は、手動で設計されたプロンプトの欠点を避けるために、実行者の推論情報に富んだ事前の思考埋め込みを生成するために圧縮機を訓練した。第2段階は、報酬メカニズムを使用して、前もって思考の埋め込みを利用して正しい答えを短い推論で導き出すよう実行者を最適化する。大規模な実験により、UCoTはエグゼキュータの強力な推論能力を維持しつつ、CoTの長さを著しく減少させることが示された。なお、Qwen2.5-7B-InstructモデルにUCoTを適用すると、GSM8Kデータセット上のトークンの使用量は50\%削減され、パフォーマンスは最先端(SOTA)メソッドよりも3.08\%向上した。コードとデータセットは補助的な素材である。

論文の概要: Upfront Chain-of-Thought: A Cooperative Framework for Chain-of-Thought Compression

関連論文リスト