Fugu-MT 論文翻訳(概要): Simple but Stable, Fast and Safe: Achieve End-to-end Control by High-Fidelity Differentiable Simulation

論文の概要: Simple but Stable, Fast and Safe: Achieve End-to-end Control by High-Fidelity Differentiable Simulation

arxiv url: http://arxiv.org/abs/2604.10548v1
Date: Sun, 12 Apr 2026 09:38:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-14 20:13:16.093165
Title: Simple but Stable, Fast and Safe: Achieve End-to-end Control by High-Fidelity Differentiable Simulation
Title（参考訳）: 単純だが安定で、高速で、安全:高忠実性微分可能シミュレーションによるエンドツーエンド制御の実現
Authors: Fanxing Li, Shengyang Wang, Yuxiang Huang, Fangyu Sun, Yufei Yan, Danping Zou, Wenxian Yu,
Abstract要約: 障害物回避(Obstacle avoidance)は、四重項が高度なアプリケーションを実行できるようにするための基本的な視覚ベースのタスクである。本稿では,深度画像を直接低レベルのボディレートコマンドにマッピングする新しいエンドツーエンドポリシーを提案する。提案手法は,最先端のベースラインの中で,最も成功率が高く,かつ最低のジロジロジロジロジロジロジロジロジロジロジロジロジロジロジロジロジロジロジロジロジロジロジロジロジロジロジロジロジロジロジロ
参考スコア（独自算出の注目度）: 14.763759592028528
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Obstacle avoidance is a fundamental vision-based task essential for enabling quadrotors to perform advanced applications. When planning the trajectory, existing approaches both on optimization and learning typically regard quadrotor as a point-mass model, giving path or velocity commands then tracking the commands by outer-loop controller. However, at high speeds, planned trajectories sometimes become dynamically infeasible in actual flight, which beyond the capacity of controller. In this paper, we propose a novel end-to-end policy that directly maps depth images to low-level bodyrate commands by reinforcement learning via differentiable simulation. The high-fidelity simulation in training after parameter identification significantly reduces all the gaps between training, simulation and real world. Analytical process by differentiable simulation provides accurate gradient to ensure efficiently training the low-level policy without expert guidance. The policy employs a lightweight and the most simple inference pipeline that runs without explicit mapping, backbone networks, primitives, recurrent structures, or backend controllers, nor curriculum or privileged guidance. By inferring low-level command directly to the hardware controller, the method enables full flight envelope control and avoids the dynamic-infeasible issue.Experimental results demonstrate that the proposed approach achieves the highest success rate and the lowest jerk among state-of-the-art baselines across multiple benchmarks. The policy also exhibits strong generalization, successfully deploying zero-shot in unseen, outdoor environments while reaching speeds of up to 7.5m/s as well as stably flying in the super-dense forest.
Abstract（参考訳）: 障害物回避(Obstacle avoidance)は、四重項が高度なアプリケーションを実行できるようにするための基本的な視覚ベースのタスクである。軌道を計画する際には、最適化と学習の両方に既存のアプローチでは、四重項を点質量モデルとみなし、経路または速度コマンドを与えて、外ループコントローラでコマンドを追跡するのが一般的である。しかし、高速では、実際の飛行で計画された軌道が動的に機能しなくなることがある。本稿では,深度画像を直接低レベルのボディレートコマンドにマッピングする新しいエンドツーエンドポリシーを提案する。パラメータ同定後のトレーニングにおける高忠実度シミュレーションは、トレーニング、シミュレーション、実世界のすべてのギャップを著しく減らす。微分可能シミュレーションによる分析プロセスは、専門家の指導なしに低レベルの政策を効率的に訓練するための正確な勾配を提供する。このポリシーでは、明示的なマッピング、バックボーンネットワーク、プリミティブ、リカレントな構造、あるいはバックエンドコントローラ、カリキュラムや特権的なガイダンスのない、軽量で最も単純な推論パイプラインが採用されている。ハードウェアコントローラに直接低レベルコマンドを推論することにより、フルフライトエンベロープ制御が可能となり、動的に実現不可能な問題を回避することができる。この方針はまた、強い一般化を示し、目に見えない屋外環境にゼロショットを配置し、最高7.5m/sに到達し、超高密度の森で安定して飛行する。

論文の概要: Simple but Stable, Fast and Safe: Achieve End-to-end Control by High-Fidelity Differentiable Simulation

関連論文リスト