Reinforcement learning reward function in unmanned aerial vehicle
control tasks
- URL: http://arxiv.org/abs/2203.10519v1
- Date: Sun, 20 Mar 2022 10:32:44 GMT
- Title: Reinforcement learning reward function in unmanned aerial vehicle
control tasks
- Authors: Mikhail S. Tovarnov and Nikita V. Bykov
- Abstract summary: The reward function is based on the construction and estimation of the time of simplified trajectories to the target.
The effectiveness of the reward function was tested in a newly developed virtual environment.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper presents a new reward function that can be used for deep
reinforcement learning in unmanned aerial vehicle (UAV) control and navigation
problems. The reward function is based on the construction and estimation of
the time of simplified trajectories to the target, which are third-order Bezier
curves. This reward function can be applied unchanged to solve problems in both
two-dimensional and three-dimensional virtual environments. The effectiveness
of the reward function was tested in a newly developed virtual environment,
namely, a simplified two-dimensional environment describing the dynamics of UAV
control and flight, taking into account the forces of thrust, inertia, gravity,
and aerodynamic drag. In this formulation, three tasks of UAV control and
navigation were successfully solved: UAV flight to a given point in space,
avoidance of interception by another UAV, and organization of interception of
one UAV by another. The three most relevant modern deep reinforcement learning
algorithms, Soft actor-critic, Deep Deterministic Policy Gradient, and Twin
Delayed Deep Deterministic Policy Gradient were used. All three algorithms
performed well, indicating the effectiveness of the selected reward function.
Related papers
- Autonomous Decision Making for UAV Cooperative Pursuit-Evasion Game with Reinforcement Learning [50.33447711072726]
This paper proposes a deep reinforcement learning-based model for decision-making in multi-role UAV cooperative pursuit-evasion game.
The proposed method enables autonomous decision-making of the UAVs in pursuit-evasion game scenarios.
arXiv Detail & Related papers (2024-11-05T10:45:30Z) - UAV-enabled Collaborative Beamforming via Multi-Agent Deep Reinforcement Learning [79.16150966434299]
We formulate a UAV-enabled collaborative beamforming multi-objective optimization problem (UCBMOP) to maximize the transmission rate of the UVAA and minimize the energy consumption of all UAVs.
We use the heterogeneous-agent trust region policy optimization (HATRPO) as the basic framework, and then propose an improved HATRPO algorithm, namely HATRPO-UCB.
arXiv Detail & Related papers (2024-04-11T03:19:22Z) - UAV Obstacle Avoidance by Human-in-the-Loop Reinforcement in Arbitrary
3D Environment [17.531224704021273]
This paper focuses on the continuous control of the unmanned aerial vehicle (UAV) based on a deep reinforcement learning method.
We propose a deep reinforcement learning (DRL)-based method combined with human-in-the-loop, which allows the UAV to avoid obstacles automatically during flying.
arXiv Detail & Related papers (2023-04-07T01:44:05Z) - Advanced Algorithms of Collision Free Navigation and Flocking for
Autonomous UAVs [0.0]
This report contributes towards the state-of-the-art in UAV control for safe autonomous navigation and motion coordination of multi-UAV systems.
The first part of this report deals with single-UAV systems. The complex problem of three-dimensional (3D) collision-free navigation in unknown/dynamic environments is addressed.
The second part of this report addresses safe navigation for multi-UAV systems. Distributed motion coordination methods of multi-UAV systems for flocking and 3D area coverage are developed.
arXiv Detail & Related papers (2021-10-30T03:51:40Z) - Trajectory Design for UAV-Based Internet-of-Things Data Collection: A
Deep Reinforcement Learning Approach [93.67588414950656]
In this paper, we investigate an unmanned aerial vehicle (UAV)-assisted Internet-of-Things (IoT) system in a 3D environment.
We present a TD3-based trajectory design for completion time minimization (TD3-TDCTM) algorithm.
Our simulation results show the superiority of the proposed TD3-TDCTM algorithm over three conventional non-learning based baseline methods.
arXiv Detail & Related papers (2021-07-23T03:33:29Z) - Transferable Deep Reinforcement Learning Framework for Autonomous
Vehicles with Joint Radar-Data Communications [69.24726496448713]
We propose an intelligent optimization framework based on the Markov Decision Process (MDP) to help the AV make optimal decisions.
We then develop an effective learning algorithm leveraging recent advances of deep reinforcement learning techniques to find the optimal policy for the AV.
We show that the proposed transferable deep reinforcement learning framework reduces the obstacle miss detection probability by the AV up to 67% compared to other conventional deep reinforcement learning approaches.
arXiv Detail & Related papers (2021-05-28T08:45:37Z) - Multi-Agent Reinforcement Learning in NOMA-aided UAV Networks for
Cellular Offloading [59.32570888309133]
A novel framework is proposed for cellular offloading with the aid of multiple unmanned aerial vehicles (UAVs)
Non-orthogonal multiple access (NOMA) technique is employed at each UAV to further improve the spectrum efficiency of the wireless network.
A mutual deep Q-network (MDQN) algorithm is proposed to jointly determine the optimal 3D trajectory and power allocation of UAVs.
arXiv Detail & Related papers (2020-10-18T20:22:05Z) - Autonomous and cooperative design of the monitor positions for a team of
UAVs to maximize the quantity and quality of detected objects [0.5801044612920815]
This paper tackles the problem of positioning a swarm of UAVs inside a completely unknown terrain.
YOLOv3 and a system to identify duplicate objects of interest were employed to assign a single score to each UAVs' configuration.
A novel navigation algorithm, capable of optimizing the previously defined score, is proposed.
arXiv Detail & Related papers (2020-07-02T16:52:57Z) - Autonomous UAV Navigation: A DDPG-based Deep Reinforcement Learning
Approach [1.552282932199974]
We propose an autonomous UAV path planning framework using deep reinforcement learning approach.
The objective is to employ a self-trained UAV as a flying mobile unit to reach spatially distributed moving or static targets.
arXiv Detail & Related papers (2020-03-21T19:33:00Z) - Data Freshness and Energy-Efficient UAV Navigation Optimization: A Deep
Reinforcement Learning Approach [88.45509934702913]
We design a navigation policy for multiple unmanned aerial vehicles (UAVs) where mobile base stations (BSs) are deployed.
We incorporate different contextual information such as energy and age of information (AoI) constraints to ensure the data freshness at the ground BS.
By applying the proposed trained model, an effective real-time trajectory policy for the UAV-BSs captures the observable network states over time.
arXiv Detail & Related papers (2020-02-21T07:29:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.