Reinforcement Learning based Air Combat Maneuver Generation
- URL: http://arxiv.org/abs/2201.05528v1
- Date: Fri, 14 Jan 2022 15:55:44 GMT
- Title: Reinforcement Learning based Air Combat Maneuver Generation
- Authors: Muhammed Murat Ozbek and Emre Koyuncu
- Abstract summary: In this research we aimed our UAV which has a dubin vehicle dynamic property to move to the target in two dimensional space in an optimal path.
We did tests on two different environments and used simulations.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The advent of artificial intelligence technology paved the way of many
researches to be made within air combat sector. Academicians and many other
researchers did a research on a prominent research direction called autonomous
maneuver decision of UAV. Elaborative researches produced some outcomes, but
decisions that include Reinforcement Learning(RL) came out to be more
efficient. There have been many researches and experiments done to make an
agent reach its target in an optimal way, most prominent are Genetic
Algorithm(GA) , A star, RRT and other various optimization techniques have been
used. But Reinforcement Learning is the well known one for its success. In
DARPHA Alpha Dogfight Trials, reinforcement learning prevailed against a real
veteran F16 human pilot who was trained by Boeing. This successor model was
developed by Heron Systems. After this accomplishment, reinforcement learning
bring tremendous attention on itself. In this research we aimed our UAV which
has a dubin vehicle dynamic property to move to the target in two dimensional
space in an optimal path using Twin Delayed Deep Deterministic Policy Gradients
(TD3) and used in experience replay Hindsight Experience Replay(HER).We did
tests on two different environments and used simulations.
Related papers
- Autonomous Decision Making for UAV Cooperative Pursuit-Evasion Game with Reinforcement Learning [50.33447711072726]
This paper proposes a deep reinforcement learning-based model for decision-making in multi-role UAV cooperative pursuit-evasion game.
The proposed method enables autonomous decision-making of the UAVs in pursuit-evasion game scenarios.
arXiv Detail & Related papers (2024-11-05T10:45:30Z) - An Imitative Reinforcement Learning Framework for Autonomous Dogfight [20.150691753213817]
Unmanned Combat Aerial Vehicle (UCAV) dogfight plays a decisive role on the aerial battlefields.
This paper proposes a novel imitative reinforcement learning framework, which efficiently leverages expert data while enabling autonomous exploration.
The proposed framework can learn a successful dogfight policy of 'pursuit-lock-launch' for UCAVs.
arXiv Detail & Related papers (2024-06-17T13:59:52Z) - Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents [49.85633804913796]
We present an exploration-based trajectory optimization approach, referred to as ETO.
This learning method is designed to enhance the performance of open LLM agents.
Our experiments on three complex tasks demonstrate that ETO consistently surpasses baseline performance by a large margin.
arXiv Detail & Related papers (2024-03-04T21:50:29Z) - Human-Centric Aware UAV Trajectory Planning in Search and Rescue
Missions Employing Multi-Objective Reinforcement Learning with AHP and
Similarity-Based Experience Replay [0.0]
This paper explores the effect of human-centric factor in UAV trajectory planning for Search and Rescue missions.
Through a comprehensive survey, we investigate the impact of gender cues and anthropomorphism in UAV design on public acceptance and trust.
arXiv Detail & Related papers (2024-02-28T17:10:22Z) - Rethinking Closed-loop Training for Autonomous Driving [82.61418945804544]
We present the first empirical study which analyzes the effects of different training benchmark designs on the success of learning agents.
We propose trajectory value learning (TRAVL), an RL-based driving agent that performs planning with multistep look-ahead.
Our experiments show that TRAVL can learn much faster and produce safer maneuvers compared to all the baselines.
arXiv Detail & Related papers (2023-06-27T17:58:39Z) - Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning
for Task-oriented Dialogue Systems [111.80916118530398]
reinforcement learning (RL) techniques can naturally be utilized to train dialogue strategies to achieve user-specific goals.
This paper aims at answering the question of how to efficiently learn and leverage a reward function for training end-to-end (E2E) ToD agents.
arXiv Detail & Related papers (2023-02-20T22:10:04Z) - DIAMBRA Arena: a New Reinforcement Learning Platform for Research and
Experimentation [91.3755431537592]
This work presents DIAMBRA Arena, a new platform for reinforcement learning research and experimentation.
It features a collection of high-quality environments exposing a Python API fully compliant with OpenAI Gym standard.
They are episodic tasks with discrete actions and observations composed by raw pixels plus additional numerical values.
arXiv Detail & Related papers (2022-10-19T14:39:10Z) - A Deep Reinforcement Learning Strategy for UAV Autonomous Landing on a
Platform [0.0]
We proposed a reinforcement learning framework based on Gazebo that is a kind of physical simulation platform (ROS-RL)
We used three continuous action space reinforcement learning algorithms in the framework to dealing with the problem of autonomous landing of drones.
arXiv Detail & Related papers (2022-09-07T06:33:57Z) - Reward Uncertainty for Exploration in Preference-based Reinforcement
Learning [88.34958680436552]
We present an exploration method specifically for preference-based reinforcement learning algorithms.
Our main idea is to design an intrinsic reward by measuring the novelty based on learned reward.
Our experiments show that exploration bonus from uncertainty in learned reward improves both feedback- and sample-efficiency of preference-based RL algorithms.
arXiv Detail & Related papers (2022-05-24T23:22:10Z) - Homotopy Based Reinforcement Learning with Maximum Entropy for
Autonomous Air Combat [3.839929995011407]
The reinforcement learning (RL) method can significantly shorten the decision time via using neural networks.
The sparse reward problem limits its convergence speed and the artificial prior experience reward can easily deviate its optimal convergent direction of the original task.
We propose a homotopy-based soft actor-critic method (HSAC) which focuses on addressing these problems via following the homotopy path between the original task with sparse reward and the auxiliary task with artificial prior experience reward.
arXiv Detail & Related papers (2021-12-01T09:37:55Z) - Reinforcement Learning for Robust Missile Autopilot Design [0.0]
This work is pioneer in proposing Reinforcement Learning as a framework for flight control.
Under TRPO's methodology, the collected experience is augmented according to HER, stored in a replay buffer and sampled according to its significance.
Results show that it is possible both to achieve the optimal performance and to improve the agent's robustness to uncertainties.
arXiv Detail & Related papers (2020-11-26T09:30:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.