Fugu-MT 論文翻訳(概要): R-Capsule: Compressing High-Level Plans for Efficient Large Language Model Reasoning

論文の概要: R-Capsule: Compressing High-Level Plans for Efficient Large Language Model Reasoning

arxiv url: http://arxiv.org/abs/2509.22131v2
Date: Mon, 29 Sep 2025 03:19:32 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 14:13:47.659755
Title: R-Capsule: Compressing High-Level Plans for Efficient Large Language Model Reasoning
Title（参考訳）: R-Capsule: 効率的な大規模言語モデル推論のための高レベルプラン圧縮
Authors: Hongyu Shan, Mingyang Song, Chang Dai, Di Liang, Han Chen,
Abstract要約: CoT(Chain-of-Thought)は、大規模言語モデル(LLM)が明確なステップバイステップの合理性を引き出すことによって、複雑な推論に対処するのに役立つ。提案するReasoning Capsule (R-Capsule) は,遅延推論の効率と明示的なCoTの透明性の両立を目的としたフレームワークである。
参考スコア（独自算出の注目度）: 25.87953249848607
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Chain-of-Thought (CoT) prompting helps Large Language Models (LLMs) tackle complex reasoning by eliciting explicit step-by-step rationales. However, CoT's verbosity increases latency and memory usage and may propagate early errors across long chains. We propose the Reasoning Capsule (R-Capsule), a framework that aims to combine the efficiency of latent reasoning with the transparency of explicit CoT. The core idea is to compress the high-level plan into a small set of learned latent tokens (a Reasoning Capsule) while keeping execution steps lightweight or explicit. This hybrid approach is inspired by the Information Bottleneck (IB) principle, where we encourage the capsule to be approximately minimal yet sufficient for the task. Minimality is encouraged via a low-capacity bottleneck, which helps improve efficiency. Sufficiency is encouraged via a dual objective: a primary task loss for answer accuracy and an auxiliary plan-reconstruction loss that encourages the capsule to faithfully represent the original textual plan. The reconstruction objective helps ground the latent space, thereby improving interpretability and reducing the use of uninformative shortcuts. Our framework strikes a balance between efficiency, accuracy, and interpretability, thereby reducing the visible token footprint of reasoning while maintaining or improving accuracy on complex benchmarks. Our codes are available at: https://anonymous.4open.science/r/Reasoning-Capsule-7BE0
Abstract（参考訳）: CoT(Chain-of-Thought)は、大規模言語モデル(LLM)が明確なステップバイステップの合理性を引き出すことによって、複雑な推論に対処するのに役立つ。しかし、CoTの冗長性はレイテンシとメモリ使用量を増やし、長いチェーンにまたがる早期エラーを伝播させる可能性がある。提案するReasoning Capsule (R-Capsule) は,遅延推論の効率と明示的なCoTの透明性の両立を目的としたフレームワークである。コアとなる考え方は、高レベルなプランを、軽量あるいは明示的な実行ステップを維持しながら、学習済みの潜在トークン(Reasoning Capsule)の小さなセットに圧縮することだ。このハイブリッドなアプローチはInformation Bottleneck(IB)の原則にインスパイアされ、カプセルを最小限にし、タスクに十分なものにすることを奨励します。最小化は低容量のボトルネックを通じて奨励され、効率を改善するのに役立つ。満足度は2つの目的により奨励される: 答えの正確性に対する第一のタスク損失と、カプセルが元のテキストプランを忠実に表現することを奨励する補助的な計画再構築損失である。再構成の目的は、潜伏空間を接地し、解釈性を改善し、不定形ショートカットの使用を減らすことに役立つ。筆者らのフレームワークは,効率,精度,解釈可能性のバランスを保ちながら,複雑なベンチマークの精度を維持したり改善したりしながら,推論の目に見えるトークンフットプリントを削減する。私たちのコードは、https://anonymous.4open.science/r/Reasoning-Capsule-7BE0で利用可能です。

論文の概要: R-Capsule: Compressing High-Level Plans for Efficient Large Language Model Reasoning

関連論文リスト