Fugu-MT 論文翻訳(概要): Diffusion Policy with Bayesian Expert Selection for Active Multi-Target Tracking

論文の概要: Diffusion Policy with Bayesian Expert Selection for Active Multi-Target Tracking

arxiv url: http://arxiv.org/abs/2604.03404v1
Date: Fri, 03 Apr 2026 19:05:22 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-07 15:49:18.559758
Title: Diffusion Policy with Bayesian Expert Selection for Active Multi-Target Tracking
Title（参考訳）: アクティブマルチターゲットトラッキングのためのベイズ専門家選択による拡散政策
Authors: Haotian Xiang, Qin Lu, Yaakov Bar-Shalom,
Abstract要約: アクティブなマルチターゲットトラッキングには、未検出ターゲットの探索と不確実な追跡対象の活用のバランスを取るための移動ロボットが必要である。拡散政策は、専門家によるデモンストレーションからアクションシーケンスを学習することで、多様な行動戦略を捉えるための強力なアプローチとして現れている。本稿では,拡散政策の専門的選択をオフラインの文脈的帯域幅問題として定式化し,悲観的かつ不確実性を考慮した戦略選択のためのベイズ的枠組みを提案する。
参考スコア（独自算出の注目度）: 3.715635410272242
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Active multi-target tracking requires a mobile robot to balance exploration for undetected targets with exploitation of uncertain tracked ones. Diffusion policies have emerged as a powerful approach for capturing diverse behavioral strategies by learning action sequences from expert demonstrations. However, existing methods implicitly select among strategies through the denoising process, without uncertainty quantification over which strategy to execute. We formulate expert selection for diffusion policies as an offline contextual bandit problem and propose a Bayesian framework for pessimistic, uncertainty-aware strategy selection. A multi-head Variational Bayesian Last Layer (VBLL) model predicts the expected tracking performance of each expert strategy given the current belief state, providing both a point estimate and predictive uncertainty. Following the pessimism principle for offline decision-making, a Lower Confidence Bound (LCB) criterion then selects the expert whose worst-case predicted performance is best, avoiding overcommitment to experts with unreliable predictions. The selected expert conditions a diffusion policy to generate corresponding action sequences. Experiments on simulated indoor tracking scenarios demonstrate that our approach outperforms both the base diffusion policy and standard gating methods, including Mixture-of-Experts selection and deterministic regression baselines.
Abstract（参考訳）: アクティブなマルチターゲットトラッキングには、未検出ターゲットの探索と不確実な追跡対象の活用のバランスを取るための移動ロボットが必要である。拡散政策は、専門家によるデモンストレーションからアクションシーケンスを学習することで、多様な行動戦略を捉えるための強力なアプローチとして現れている。しかし、既存の手法では、どの戦略を実行するべきかを不確実な定量化することなく、デノナイズプロセスを通じて戦略の中から暗黙的に選択する。本稿では,拡散政策の専門的選択をオフラインの文脈的帯域幅問題として定式化し,悲観的かつ不確実性を考慮した戦略選択のためのベイズ的枠組みを提案する。マルチヘッド変分ベイズ最終層(VBLL)モデルは、現在の信念状態から、各専門家戦略の予測された追跡性能を予測し、点推定と予測の不確実性の両方を提供する。オフライン意思決定の悲観主義の原則に従い、LCB(Low Confidence Bound)基準は、信頼性の低い専門家への過度なコミットを避けるために、最悪のケースで予測されるパフォーマンスが最適である専門家を選択する。選択された専門家は、対応するアクションシーケンスを生成する拡散ポリシーを条件とする。シミュレーション室内追跡実験により,提案手法は,Mixture-of-Experts選択や決定論的回帰ベースラインなど,基本拡散ポリシと標準ゲーティング手法の両方より優れていることが示された。

論文の概要: Diffusion Policy with Bayesian Expert Selection for Active Multi-Target Tracking

関連論文リスト