Fugu-MT 論文翻訳(概要): COLT: Lightweight Multi-LLM Collaboration through Shared MCTS Reasoning for Model Compilation

論文の概要: COLT: Lightweight Multi-LLM Collaboration through Shared MCTS Reasoning for Model Compilation

arxiv url: http://arxiv.org/abs/2602.01935v1
Date: Mon, 02 Feb 2026 10:37:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-03 19:28:34.084749
Title: COLT: Lightweight Multi-LLM Collaboration through Shared MCTS Reasoning for Model Compilation
Title（参考訳）: COLT:モデルコンパイルのための共有MCTS推論による軽量多LLM協調
Authors: Annabelle Sujun Tang, Christopher Priebe, Lianhui Qin, Hadi Esmaeilzadeh,
Abstract要約: コンパイラ最適化のための軽量協調型マルチLLMフレームワークであるCOLTを提案する。重要な貢献は、単一の共有MCTSツリーをLLM間の協調基盤として使用することである。
参考スコア（独自算出の注目度）: 5.792898693767499
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Model serving costs dominate AI systems, making compiler optimization essential for scalable deployment. Recent works show that a large language model (LLM) can guide compiler search by reasoning over program structure and optimization history. However, using a single large model throughout the search is expensive, while smaller models are less reliable when used alone. Thus, this paper seeks to answer whether multi-LLM collaborative reasoning relying primarily on small LLMs can match or exceed the performance of a single large model. As such, we propose a lightweight collaborative multi-LLM framework, dubbed COLT, for compiler optimization that enables coordinated reasoning across multiple models within a single Monte Carlo tree search (MCTS) process. A key contribution is the use of a single shared MCTS tree as the collaboration substrate across LLMs, enabling the reuse of transformation prefixes and cross-model value propagation. Hence, we circumvent both heavy internal reasoning mechanisms and conventional agentic machinery that relies on external planners, multiple concurrent LLMs, databases, external memory/versioning of intermediate results, and controllers by simply endogenizing model selection within the lightweight MCTS optimization loop. Every iteration, the acting LLM proposes a joint action: (compiler transformation, model to be queried next). We also introduce a model-aware tree policy that biases search toward smaller models while preserving exploration, and a course-alteration mechanism that escalates to the largest model when the search exhibits persistent regressions attributable to smaller models.
Abstract（参考訳）: モデル提供コストがAIシステムを支配し、コンパイラの最適化がスケーラブルなデプロイメントに不可欠である。近年の研究では,大規模言語モデル (LLM) がプログラム構造と最適化履歴を推論することで,コンパイラ検索をガイドできることが示されている。しかし、検索全体を通して1つの大きなモデルを使用することは高価であるが、より小さなモデルは単独で使用すると信頼性が低下する。そこで本研究では,LLMを主目的とする複数LLM協調推論が単一大モデルの性能に適合するか否かを問う。そこで本研究では,モンテカルロ木探索(MCTS)プロセス内の複数モデル間の協調推論を可能にする,COLTと呼ばれる軽量協調型マルチLLMフレームワークを提案する。重要な貢献は、単一の共有MCTSツリーをLLM間の協調基盤として使用することで、変換プレフィックスの再利用とモデル間の値伝搬を可能にすることである。そこで我々は,軽量MCTS最適化ループ内にモデル選択を内在させることで,外部プランナ,複数並列LLM,データベース,中間結果の外部メモリ/変換,コントローラに依存する従来のエージェント機構を回避した。イテレーション毎に、アクション LLM はジョイントアクション(コンパイラ変換、次にクエリされるモデル)を提案する。また,探索を保存しながら,より小さなモデルに対する探索をバイアスするモデル対応ツリーポリシーと,より小さなモデルに起因する永続的回帰を示す場合に,最大モデルにエスカレーションするコース調整機構を導入する。

論文の概要: COLT: Lightweight Multi-LLM Collaboration through Shared MCTS Reasoning for Model Compilation

関連論文リスト