Jamming-Resilient Path Planning for Multiple UAVs via Deep Reinforcement
Learning
- URL: http://arxiv.org/abs/2104.04477v1
- Date: Fri, 9 Apr 2021 16:52:33 GMT
- Title: Jamming-Resilient Path Planning for Multiple UAVs via Deep Reinforcement
Learning
- Authors: Xueyuan Wang, M. Cenk Gursoy, Tugba Erpek and Yalin E. Sagduyu
- Abstract summary: Unmanned aerial vehicles (UAVs) are expected to be an integral part of wireless networks.
In this paper, we aim to find collision-free paths for multiple cellular-connected UAVs.
We propose an offline temporal difference (TD) learning algorithm with online signal-to-interference-plus-noise ratio mapping to solve the problem.
- Score: 1.2330326247154968
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Unmanned aerial vehicles (UAVs) are expected to be an integral part of
wireless networks. In this paper, we aim to find collision-free paths for
multiple cellular-connected UAVs, while satisfying requirements of connectivity
with ground base stations (GBSs) in the presence of a dynamic jammer. We first
formulate the problem as a sequential decision making problem in discrete
domain, with connectivity, collision avoidance, and kinematic constraints. We,
then, propose an offline temporal difference (TD) learning algorithm with
online signal-to-interference-plus-noise ratio (SINR) mapping to solve the
problem. More specifically, a value network is constructed and trained offline
by TD method to encode the interactions among the UAVs and between the UAVs and
the environment; and an online SINR mapping deep neural network (DNN) is
designed and trained by supervised learning, to encode the influence and
changes due to the jammer. Numerical results show that, without any information
on the jammer, the proposed algorithm can achieve performance levels close to
that of the ideal scenario with the perfect SINR-map. Real-time navigation for
multi-UAVs can be efficiently performed with high success rates, and collisions
are avoided.
Related papers
- Multi-Agent Reinforcement Learning for Offloading Cellular Communications with Cooperating UAVs [21.195346908715972]
Unmanned aerial vehicles present an alternative means to offload data traffic from terrestrial BSs.
This paper presents a novel approach to efficiently serve multiple UAVs for data offloading from terrestrial BSs.
arXiv Detail & Related papers (2024-02-05T12:36:08Z) - Deep Reinforcement Learning for Interference Management in UAV-based 3D
Networks: Potentials and Challenges [137.47736805685457]
We show that interference can still be effectively mitigated even without knowing its channel information.
By harnessing interference, the proposed solutions enable the continued growth of civilian UAVs.
arXiv Detail & Related papers (2023-05-11T18:06:46Z) - Joint Optimization of Deployment and Trajectory in UAV and IRS-Assisted
IoT Data Collection System [25.32139119893323]
Unmanned aerial vehicles (UAVs) can be applied in many Internet of Things (IoT) systems.
The UAV-IoT wireless channels may be occasionally blocked by trees or high-rise buildings.
This article aims to minimize the energy consumption of the system by jointly optimizing the deployment and trajectory of the UAV.
arXiv Detail & Related papers (2022-10-27T06:27:40Z) - Federated Deep Learning Meets Autonomous Vehicle Perception: Design and
Verification [168.67190934250868]
Federated learning empowered connected autonomous vehicle (FLCAV) has been proposed.
FLCAV preserves privacy while reducing communication and annotation costs.
It is challenging to determine the network resources and road sensor poses for multi-stage training.
arXiv Detail & Related papers (2022-06-03T23:55:45Z) - Adaptive Anomaly Detection for Internet of Things in Hierarchical Edge
Computing: A Contextual-Bandit Approach [81.5261621619557]
We propose an adaptive anomaly detection scheme with hierarchical edge computing (HEC)
We first construct multiple anomaly detection DNN models with increasing complexity, and associate each of them to a corresponding HEC layer.
Then, we design an adaptive model selection scheme that is formulated as a contextual-bandit problem and solved by using a reinforcement learning policy network.
arXiv Detail & Related papers (2021-08-09T08:45:47Z) - Trajectory Design for UAV-Based Internet-of-Things Data Collection: A
Deep Reinforcement Learning Approach [93.67588414950656]
In this paper, we investigate an unmanned aerial vehicle (UAV)-assisted Internet-of-Things (IoT) system in a 3D environment.
We present a TD3-based trajectory design for completion time minimization (TD3-TDCTM) algorithm.
Our simulation results show the superiority of the proposed TD3-TDCTM algorithm over three conventional non-learning based baseline methods.
arXiv Detail & Related papers (2021-07-23T03:33:29Z) - Efficient Real-Time Image Recognition Using Collaborative Swarm of UAVs
and Convolutional Networks [9.449650062296824]
We present a strategy aiming at distributing inference requests to a swarm of resource-constrained UAVs that classifies captured images on-board.
We formulate the model as an optimization problem that minimizes the latency between acquiring images and making the final decisions.
We introduce an online solution, namely DistInference, to find the layers placement strategy that gives the best latency among the available UAVs.
arXiv Detail & Related papers (2021-07-09T19:47:02Z) - Learning-Based UAV Trajectory Optimization with Collision Avoidance and
Connectivity Constraints [0.0]
Unmanned aerial vehicles (UAVs) are expected to be an integral part of wireless networks.
In this paper, we reformulate the multi-UAV trajectory optimization problem with collision avoidance and wireless connectivity constraints.
We propose a decentralized deep reinforcement learning approach to solve the problem.
arXiv Detail & Related papers (2021-04-03T22:22:20Z) - Multi-Agent Reinforcement Learning in NOMA-aided UAV Networks for
Cellular Offloading [59.32570888309133]
A novel framework is proposed for cellular offloading with the aid of multiple unmanned aerial vehicles (UAVs)
Non-orthogonal multiple access (NOMA) technique is employed at each UAV to further improve the spectrum efficiency of the wireless network.
A mutual deep Q-network (MDQN) algorithm is proposed to jointly determine the optimal 3D trajectory and power allocation of UAVs.
arXiv Detail & Related papers (2020-10-18T20:22:05Z) - Risk-Averse MPC via Visual-Inertial Input and Recurrent Networks for
Online Collision Avoidance [95.86944752753564]
We propose an online path planning architecture that extends the model predictive control (MPC) formulation to consider future location uncertainties.
Our algorithm combines an object detection pipeline with a recurrent neural network (RNN) which infers the covariance of state estimates.
The robustness of our methods is validated on complex quadruped robot dynamics and can be generally applied to most robotic platforms.
arXiv Detail & Related papers (2020-07-28T07:34:30Z) - UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement
Learning Approach [18.266087952180733]
We propose a new end-to-end reinforcement learning approach to UAV-enabled data collection from Internet of Things (IoT) devices.
An autonomous drone is tasked with gathering data from distributed sensor nodes subject to limited flying time and obstacle avoidance.
We show that our proposed network architecture enables the agent to make movement decisions for a variety of scenario parameters.
arXiv Detail & Related papers (2020-07-01T15:14:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.