Fugu-MT 論文翻訳(概要): SGA-MCTS: Decoupling Planning from Execution via Training-Free Atomic Experience Retrieval

論文の概要: SGA-MCTS: Decoupling Planning from Execution via Training-Free Atomic Experience Retrieval

arxiv url: http://arxiv.org/abs/2604.14712v1
Date: Thu, 16 Apr 2026 07:22:36 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-17 21:29:31.777066
Title: SGA-MCTS: Decoupling Planning from Execution via Training-Free Atomic Experience Retrieval
Title（参考訳）: SGA-MCTS: 訓練不要な原子体験検索による実行からのプランニングの分離
Authors: Xin Xie, Dongyun Xue, Wuguannan Yao, Mingxiao Feng, Wengang Zhou, Xiang Qi, Houqiang Li, Peng Zhang,
Abstract要約: 我々は, LLM計画を非パラメトリック検索として活用するフレームワークである textbfSGA-MCTS を紹介する。オンラインでは、検索増強剤は、関連するステート-ゴール-アクション原子を取得するために、ハイブリッドシンボリック-セマンティック機構を使用する。 SGA-MCTSは、探索の重い計算コストを効果的に減らし、System 1推論速度におけるシステム2推論の深さを達成し、スケーラブルかつリアルタイムに自律的な計画が実現可能である。
参考スコア（独自算出の注目度）: 74.1918709002557
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: LLM-powered systems require complex multi-step decision-making abilities to solve real-world tasks, yet current planning approaches face a trade-off between the high latency of inference-time search and the limited generalization of supervised fine-tuning. To address this limitation, we introduce \textbf{SGA-MCTS}, a framework that casts LLM planning as non-parametric retrieval. Offline, we leverage Monte Carlo Tree Search (MCTS) to explore the solution space and distill high-fidelity trajectories into State-Goal-Action (SGA) atoms. These atoms are de-lexicalized primitives that abstract concrete entities into symbolic slots, preserving reusable causal logic while discarding domain-specific noise. Online, a retrieval-augmented agent employs a hybrid symbolic-semantic mechanism to fetch relevant SGAs and re-ground them into the current context as soft reasoning hints. Empirical results on complex benchmarks demonstrate that this paradigm enables frozen, open-weights models to match the performance of SOTA systems (e.g., GPT-5) without task-specific fine-tuning. By effectively amortizing the heavy computational cost of search, SGA-MCTS achieves System 2 reasoning depth at System 1 inference speeds, rendering autonomous planning both scalable and real-time feasible.
Abstract（参考訳）: LLMを利用したシステムでは、実世界の課題を解決するために複雑な多段階決定能力を必要とするが、現在の計画手法では、推論時間探索のレイテンシの高さと教師付き微調整の限定的な一般化とのトレードオフに直面している。この制限に対処するために, LLM 計画を非パラメトリック検索として活用するフレームワークである \textbf{SGA-MCTS} を導入する。オフラインではモンテカルロ木探索(MCTS)を用いて解空間を探索し,高忠実度軌道をSGA(State-Goal-Action)原子に蒸留する。これらの原子は非レキシカル化プリミティブであり、具体的実体を象徴的なスロットに抽象化し、ドメイン固有のノイズを捨てながら再利用可能な因果論理を保存する。オンライン上では、検索強化エージェントは、関連するSGAを取得し、それらをソフトな推論ヒントとして現在のコンテキストに再配置するために、ハイブリッドなシンボル・セマンティックメカニズムを使用する。複雑なベンチマークの実証的な結果は、このパラダイムにより、タスク固有の微調整なしでSOTAシステム(例えば、GPT-5)の性能に適合するフリーズなオープンウェイトモデルが実現可能であることを証明している。 SGA-MCTSは、探索の重い計算コストを効果的に減らし、System 1推論速度におけるシステム2推論の深さを達成し、スケーラブルかつリアルタイムに自律的な計画が実現可能である。

論文の概要: SGA-MCTS: Decoupling Planning from Execution via Training-Free Atomic Experience Retrieval

関連論文リスト