Fugu-MT 論文翻訳(概要): AgenticDiffusion: Agentic Diffusion-based Path Planning for Vision-Based UAV Navigation

論文の概要: AgenticDiffusion: Agentic Diffusion-based Path Planning for Vision-Based UAV Navigation

arxiv url: http://arxiv.org/abs/2606.04111v1
Date: Tue, 02 Jun 2026 18:18:35 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-04 20:44:18.318739
Title: AgenticDiffusion: Agentic Diffusion-based Path Planning for Vision-Based UAV Navigation
Title（参考訳）: エージェント拡散:視覚に基づくUAVナビゲーションのためのエージェント拡散に基づく経路計画
Authors: Faryal Batool, Muhammad Ahsan Mustafa, Fawad Mehboob, Valerii Serpiva, Dzmitry Tsetserukou,
Abstract要約: 屋内UAVナビゲーションは、視野の限られた観測下での効率的な探索、シーン理解、信頼性の高い軌道実行を必要とする。本稿では,多視点UAVナビゲーションフレームワークであるAgenticDiffusionを提案する。このフレームワークは、適応的な視点選択、多段階のミッション実行、長距離ナビゲーション、安全な着陸場所選択を含む4つの現実のUAVナビゲーションシナリオで検証された。
参考スコア（独自算出の注目度）: 2.186077977059593
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Indoor UAV navigation requires efficient exploration, scene understanding, and reliable trajectory execution under limited field-of-view observations. Existing vision-based navigation frameworks typically rely on single-view observations, limiting their ability to reason about occlusions, target visibility, and global scene structure. In this work, we propose AgenticDiffusion, a multi-view UAV navigation framework that coordinates language-guided reasoning, open-vocabulary target grounding, vision-based diffusion planning, and NMPC within a unified aerial navigation pipeline. Given a natural language instruction and synchronized first-person-view (FPV) and top-view observations, the framework determines the most informative viewpoint for navigation and generates a mission plan prior to trajectory execution. The targets are localized using an open-vocabulary grounding model, after which viewpoint-specific diffusion planners generate navigation trajectories for UAV execution. Using complementary viewpoints, the proposed framework reduces repeated target exploration and improves navigation efficiency in cluttered indoor environments. The framework was validated in four real-world UAV navigation scenarios involving adaptive viewpoint selection, multi-stage mission execution, long-horizon navigation, and safe landing-site selection. The experimental results demonstrated an overall mission success rate of 80% in 40 real-world trials, while the diffusion planners achieved a trajectory generation success rate of 100%.
Abstract（参考訳）: 屋内UAVナビゲーションは、視野の限られた観測下での効率的な探索、シーン理解、信頼性の高い軌道実行を必要とする。既存の視覚ベースのナビゲーションフレームワークは、通常は単一視点の観察に依存しており、閉塞性、ターゲット視認性、グローバルなシーン構造について推論する能力を制限する。本研究では,多視点UAVナビゲーションフレームワークであるAgenticDiffusionを提案する。自然言語の指示と一対一の視点(FPV)とトップビューの観察を与えられたフレームワークは、ナビゲーションの最も有益な視点を決定し、軌道実行の前にミッションプランを生成する。ターゲットはオープンボキャブラリグラウンドモデルを用いてローカライズされ、その後、視点特異的拡散プランナーがUAV実行のためのナビゲーショントラジェクトリを生成する。相補的な視点を用いて、提案手法は反復的な目標探索を減らし、乱れた屋内環境における航法効率を向上させる。このフレームワークは、適応的な視点選択、多段階のミッション実行、長距離ナビゲーション、安全な着陸場所選択を含む4つの現実のUAVナビゲーションシナリオで検証された。実験の結果、実際の40回の試験で全体のミッション成功率は80%、拡散プランナーは軌道生成成功率は100%であった。

論文の概要: AgenticDiffusion: Agentic Diffusion-based Path Planning for Vision-Based UAV Navigation

関連論文リスト