Fugu-MT 論文翻訳(概要): M2-PALE: A Framework for Explaining Multi-Agent MCTS--Minimax Hybrids via Process Mining and LLMs

論文の概要: M2-PALE: A Framework for Explaining Multi-Agent MCTS--Minimax Hybrids via Process Mining and LLMs

arxiv url: http://arxiv.org/abs/2604.14687v1
Date: Thu, 16 Apr 2026 06:46:32 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-17 21:29:31.764985
Title: M2-PALE: A Framework for Explaining Multi-Agent MCTS--Minimax Hybrids via Process Mining and LLMs
Title（参考訳）: M2-PALE:プロセスマイニングとLCMによるマルチエージェントMCTS--Minimaxハイブリッドの解説フレームワーク
Authors: Yiyu Qian, Liyuan Zhao, Tim Miller,
Abstract要約: Monte-Carlo Tree Search (MCTS) は、シーケンシャルな意思決定領域におけるオンラインプランニングに広く使われているサンプリングベースの検索アルゴリズムである。人工知能の最近の進歩を推し進めることの成功にもかかわらず、MCTSエージェントの振る舞いを理解することは、開発者とユーザの両方にとって課題である。我々は、結果の意思決定ロジックを復号化するためにtextsfM2-PALE (MCTS--Minimax Process-Aided Linguistic Explanations) を導入する。我々は,より複雑な戦略領域においてハイブリッドエージェントを解釈するためのスケーラブルな基盤を確立するため,小規模チェッカー環境でのアプローチの有効性を実証する。
参考スコア（独自算出の注目度）: 1.9983412460505416
License: http://creativecommons.org/publicdomain/zero/1.0/
Abstract: Monte-Carlo Tree Search (MCTS) is a fundamental sampling-based search algorithm widely used for online planning in sequential decision-making domains. Despite its success in driving recent advances in artificial intelligence, understanding the behavior of MCTS agents remains a challenge for both developers and users. This difficulty stems from the complex search trees produced through the simulation of numerous future states and their intricate relationships. A known weakness of standard MCTS is its reliance on highly selective tree construction, which may lead to the omission of crucial moves and a vulnerability to tactical traps. To resolve this, we incorporate shallow, full-width Minimax search into the rollout phase of multi-agent MCTS to enhance strategic depth. Furthermore, to demystify the resulting decision-making logic, we introduce \textsf{M2-PALE} (MCTS--Minimax Process-Aided Linguistic Explanations). This framework employs process mining techniques, specifically the Alpha Miner, iDHM, and Inductive Miner algorithms, to extract underlying behavioral workflows from agent execution traces. These process models are then synthesized by LLMs to generate human-readable causal and distal explanations. We demonstrate the efficacy of our approach in a small-scale checkers environment, establishing a scalable foundation for interpreting hybrid agents in increasingly complex strategic domains.
Abstract（参考訳）: Monte-Carlo Tree Search (MCTS) は、シーケンシャルな意思決定領域におけるオンラインプランニングに広く用いられている基本サンプリングに基づく探索アルゴリズムである。人工知能の最近の進歩を推し進めることの成功にもかかわらず、MCTSエージェントの振る舞いを理解することは、開発者とユーザの両方にとって課題である。この難しさは、多くの将来の状態と複雑な関係のシミュレーションによって生成される複雑な探索木に起因している。標準的なMCTSの弱点は、高度に選択的な木の構造に依存しており、重要な動きが省略され、戦術的罠に対する脆弱性が生じる可能性があることである。これを解決するために,マルチエージェントMCTSのロールアウトフェーズに,浅い全幅のMinimax探索を組み込んで,戦略的深度を高める。さらに,結果の意思決定ロジックを復号化するために, MCTS--Minimax Process-Aided Linguistic Explanations(MCTS--Minimax Process-Aided Linguistic Explanations)を導入する。このフレームワークはプロセスマイニング技術、特にAlpha Miner、iDHM、Inductive Minerアルゴリズムを使用して、エージェントの実行トレースから基礎となる振る舞いワークフローを抽出する。これらのプロセスモデルはLLMによって合成され、人間の読みやすい因果関係と遠位関係の説明を生成する。我々は,より複雑な戦略領域においてハイブリッドエージェントを解釈するためのスケーラブルな基盤を確立するため,小規模チェッカー環境でのアプローチの有効性を実証する。

論文の概要: M2-PALE: A Framework for Explaining Multi-Agent MCTS--Minimax Hybrids via Process Mining and LLMs

関連論文リスト