Fugu-MT 論文翻訳(概要): Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck

論文の概要: Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck

arxiv url: http://arxiv.org/abs/2603.08462v1
Date: Mon, 09 Mar 2026 14:56:57 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-10 15:13:16.217771
Title: Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck
Title（参考訳）: 圧縮としての推論:条件付き情報ボトルネックによる予算の統一
Authors: Fabio Valerio Massoli, Andrey Kuzmin, Arash Behboodi,
Abstract要約: 既存の「予算強制」手法は、本質的な推論と冗長なフィラーの両方を抑える。 Information Bottleneck (IB) の原理により, 効率的な推論を損失のある圧縮問題として再放送する。単純トークンカウントに基づくアプローチとは対照的に,先行する言語モデルの下でトークンコストを代入的に測定するセマンティック・プリミティブを導入する。
参考スコア（独自算出の注目度）: 12.360124156284305
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Chain-of-Thought (CoT) prompting improves LLM accuracy on complex tasks but often increases token usage and inference cost. Existing "Budget Forcing" methods reducing cost via fine-tuning with heuristic length penalties, suppress both essential reasoning and redundant filler. We recast efficient reasoning as a lossy compression problem under the Information Bottleneck (IB) principle, and identify a key theoretical gap when applying naive IB to transformers: attention violates the Markov property between prompt, reasoning trace, and response. To resolve this issue, we model CoT generation under the Conditional Information Bottleneck (CIB) principle, where the reasoning trace Z acts as a computational bridge that contains only the information about the response Y that is not directly accessible from the prompt X. This yields a general Reinforcement Learning objective: maximize task reward while compressing completions under a prior over reasoning traces, subsuming common heuristics (e.g., length penalties) as special cases (e.g., uniform priors). In contrast to naive token-counting-based approaches, we introduce a semantic prior that measures token cost by surprisal under a language model prior. Empirically, our CIB objective prunes cognitive bloat while preserving fluency and logic, improving accuracy at moderate compression and enabling aggressive compression with minimal accuracy drop.
Abstract（参考訳）: CoT(Chain-of-Thought)は複雑なタスクにおけるLCMの精度を向上させるが、トークンの使用量や推論コストを増大させる。既存の「予算強制」手法は、ヒューリスティックな長さのペナルティを微調整することでコストを削減し、本質的な推論と冗長なフィラーの両方を抑える。 Information Bottleneck (IB) の原理により,効率的な推論を損失のある圧縮問題として再放送し,トランスフォーマーにネイブIBを適用する際の重要な理論的ギャップを同定する。この問題を解決するために、条件情報ボトルネック(CIB)の原理に基づいて、推論トレースZがプロンプトXから直接アクセスできない応答Yに関する情報のみを含む計算ブリッジとして機能するCoT生成をモデル化する。単純トークンカウントに基づくアプローチとは対照的に,先行する言語モデルの下でトークンコストを代入的に測定するセマンティック・プリミティブを導入する。実験的に、CIBの目的は、流速と論理を保ちながら認知的肥大を誘発し、適度な圧縮における精度を改善し、最小限の精度低下で攻撃的圧縮を可能にする。

論文の概要: Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck

関連論文リスト