Fugu-MT 論文翻訳(概要): Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models

論文の概要: Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models

arxiv url: http://arxiv.org/abs/2603.22784v1
Date: Tue, 24 Mar 2026 04:19:26 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-25 19:53:37.300629
Title: Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models
Title（参考訳）: Caterpillar of Thoughts: 大規模言語モデルのための最適テスト時間アルゴリズム
Authors: Amir Azarmehr, Soheil Behnezhad, Alma Ghafari,
Abstract要約: マルコフ連鎖と相互作用するアルゴリズムとしてテスト時間計算をモデル化する。バックトラックは指数関数的に世代数を減少させることができるが、理論的にはバックトラックの非常に限られた形態が十分であることを示す。最適アルゴリズムの特性から,新しいテスト時間計算アルゴリズムであるCaterpillar of Thoughts (CaT)を提案する。
参考スコア（独自算出の注目度）: 3.810612452609132
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) can often produce substantially better outputs when allowed to use additional test-time computation, such as sampling, chain of thought, backtracking, or revising partial solutions. Despite the growing empirical success of such techniques, there is limited theoretical understanding of how inference time computation should be structured, or what constitutes an optimal use of a fixed computation budget. We model test-time computation as an algorithm interacting with a Markov chain: at any point, the algorithm may resume generation from any previously observed state. That is, unlike standard Markov chains where the states are drawn passively, we allow the algorithm to backtrack to any previously observed state of the Markov chain at any time. Many of the existing test-time algorithms, such as Chain-of-Thought (CoT) (Wei et al., 2023), Tree-of-Thoughts (ToT) (Yao et al., 2023), or Best-of-$k$ (Brown et al., 2024) could be seen as specific algorithms in this model. We prove that while backtracking can reduce the number of generations exponentially, a very limited form of backtracking is theoretically sufficient. Namely, we show that the optimal algorithm always generates a caterpillar tree. That is, if we remove the leaves of the state tree generated by the optimal algorithm, we obtain a path. Motivated by our characterization of the optimal algorithm, we present Caterpillar of Thoughts (CaT), a new test-time computation algorithm, reducing the number of token/state generations. Our empirical evaluation shows that CaT, compared to ToT, achieves a better success rate while also reducing the number of token generations.
Abstract（参考訳）: 大規模な言語モデル(LLM)は、サンプリング、思考の連鎖、バックトラック、部分的な解決策の修正など、追加のテスト時間計算を使用することが許される場合、かなり優れた出力を生成することができる。このような手法の実証的な成功にもかかわらず、推論時間計算がどのように構成されるべきか、あるいは固定された計算予算の最適利用を構成するかという理論的な理解は限られている。我々はマルコフ連鎖と相互作用するアルゴリズムとしてテスト時間計算をモデル化する。すなわち、状態が受動的に描画される標準的なマルコフ連鎖とは異なり、アルゴリズムはいつでもマルコフ連鎖の任意の観測状態にバックトラックすることができる。既存のテストタイムアルゴリズム、例えばChain-of-Thought (CoT) (Wei et al , 2023), Tree-of-Thoughts (ToT) (Yao et al , 2023), Best-of-k$ (Brown et al , 2024)は、このモデルで特定のアルゴリズムとして見られる。バックトラックは指数関数的に世代数を減少させることができるが、理論的にはバックトラックの非常に限られた形態が十分であることを示す。すなわち、最適なアルゴリズムが常に毛虫の木を生成することを示す。すなわち、最適アルゴリズムによって生成される状態木の葉を取り除いたら、経路を得る。最適アルゴリズムの特性から,新しいテスト時間計算アルゴリズムであるCaterpillar of Thoughts (CaT)を提案する。実験により,ToTと比較してCaTの方が良好な成功率を示し,トークン生成数も減少することがわかった。

論文の概要: Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models

関連論文リスト