Fugu-MT 論文翻訳(概要): A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning

論文の概要: A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning

arxiv url: http://arxiv.org/abs/2510.17697v3
Date: Mon, 27 Oct 2025 15:15:26 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-28 13:14:10.584241
Title: A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning
Title（参考訳）: マルチエージェント強化学習のための目標介入の原理
Authors: Anjie Liu, Jianhong Wang, Samuel Kaski, Jun Wang, Mengyue Yang,
Abstract要約: 上記の問題に対処するためのグラフィカルフレームワークとして,マルチエージェント・インフルエンス・ダイアグラム(MAID)を採用している。まず、MAIDを用いたMARL相互作用のパラダイムの概念を導入し、未指導の自己組織化とグローバルガイダンスのメカニズムの両方を分析し視覚化する。そこで本研究では,単一のターゲットエージェントにのみ適用可能な,新たなMARLインタラクションパラダイムを設計する。
参考スコア（独自算出の注目度）: 28.71333236116382
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Steering cooperative multi-agent reinforcement learning (MARL) towards desired outcomes is challenging, particularly when the global guidance from a human on the whole multi-agent system is impractical in a large-scale MARL. On the other hand, designing external mechanisms (e.g., intrinsic rewards and human feedback) to coordinate agents mostly relies on empirical studies, lacking a easy-to-use research tool. In this work, we employ multi-agent influence diagrams (MAIDs) as a graphical framework to address the above issues. First, we introduce the concept of MARL interaction paradigms (orthogonal to MARL learning paradigms), using MAIDs to analyze and visualize both unguided self-organization and global guidance mechanisms in MARL. Then, we design a new MARL interaction paradigm, referred to as the targeted intervention paradigm that is applied to only a single targeted agent, so the problem of global guidance can be mitigated. In implementation, we introduce a causal inference technique, referred to as Pre-Strategy Intervention (PSI), to realize the targeted intervention paradigm. Since MAIDs can be regarded as a special class of causal diagrams, a composite desired outcome that integrates the primary task goal and an additional desired outcome can be achieved by maximizing the corresponding causal effect through the PSI. Moreover, the bundled relevance graph analysis of MAIDs provides a tool to identify whether an MARL learning paradigm is workable under the design of an MARL interaction paradigm. In experiments, we demonstrate the effectiveness of our proposed targeted intervention, and verify the result of relevance graph analysis.
Abstract（参考訳）: 特に, 大規模MARLでは, マルチエージェントシステム全体の世界的指導が非現実的である場合, 望ましい結果に向けたMARLのステアリングが困難である。一方、エージェントの協調のための外部メカニズム(例えば、本質的な報酬や人間からのフィードバック)の設計は、主に経験的研究に依存しており、使い易い研究ツールが欠如している。本研究では、上記の問題に対処するためのグラフィカル・フレームワークとしてマルチエージェント・インフルエンス・ダイアグラム(MAID)を用いる。まず、MAIDを用いてMARLにおける自己組織とグローバルガイダンスの両方を解析・可視化するMARL相互作用パラダイム(MARL学習パラダイムと直交する)の概念を導入する。そこで我々は,単一のターゲットエージェントにのみ適用可能な新たなMARLインタラクションパラダイムを設計し,グローバルガイダンスの問題を緩和する。本稿では,PSI(Pre-Strategy Intervention)と呼ばれる因果推論手法を導入する。 MAIDは因果ダイアグラムの特別なクラスとみなすことができるため、PSIを介して対応する因果効果を最大化することにより、主タスクゴールと追加の所望の結果を統合する複合的な所望の結果が得られる。さらに、MAIDのバンドル関連グラフ解析は、MARL相互作用パラダイムの設計の下で、MARL学習パラダイムが動作可能であるかどうかを特定するためのツールを提供する。実験では,提案手法の有効性を実証し,妥当性グラフ解析の結果を検証した。

論文の概要: A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning

関連論文リスト