Scheduling Drone and Mobile Charger via Hybrid-Action Deep Reinforcement Learning
- URL: http://arxiv.org/abs/2403.10761v1
- Date: Sat, 16 Mar 2024 01:51:42 GMT
- Title: Scheduling Drone and Mobile Charger via Hybrid-Action Deep Reinforcement Learning
- Authors: Jizhe Dou, Haotian Zhang, Guodong Sun,
- Abstract summary: We consider a charger-assisted drone application: a drone is deployed to observe a set points of interest, while a charger can move to recharge the drone's battery.
We focus on the route and charging schedule of the drone and the mobile charger, to obtain high observation utility with the shortest possible time.
We present a hybrid-action deep reinforcement learning framework, called HaDMC, which uses a standard policy learning algorithm to generate latent continuous actions.
- Score: 7.577251571044739
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recently there has been a growing interest in industry and academia, regarding the use of wireless chargers to prolong the operational longevity of unmanned aerial vehicles (commonly knowns as drones). In this paper we consider a charger-assisted drone application: a drone is deployed to observe a set points of interest, while a charger can move to recharge the drone's battery. We focus on the route and charging schedule of the drone and the mobile charger, to obtain high observation utility with the shortest possible time, while ensuring the drone remains operational during task execution. Essentially, this proposed drone-charger scheduling problem is a multi-stage decision-making process, in which the drone and the mobile charger act as two agents who cooperate to finish a task. The discrete-continuous hybrid action space of the two agents poses a significant challenge in our problem. To address this issue, we present a hybrid-action deep reinforcement learning framework, called HaDMC, which uses a standard policy learning algorithm to generate latent continuous actions. Motivated by representation learning, we specifically design and train an action decoder. It involves two pipelines to convert the latent continuous actions into original discrete and continuous actions, by which the drone and the charger can directly interact with environment. We embed a mutual learning scheme in model training, emphasizing the collaborative rather than individual actions. We conduct extensive numerical experiments to evaluate HaDMC and compare it with state-of-the-art deep reinforcement learning approaches. The experimental results show the effectiveness and efficiency of our solution.
Related papers
- Energy-Aware Multi-Agent Reinforcement Learning for Collaborative Execution in Mission-Oriented Drone Networks [3.4918110778972458]
Mission-oriented drone networks have been widely used for structural inspection, disaster monitoring, border surveillance, etc.
Due to the limited battery capacity of drones, mission execution strategy impacts network performance and mission completion.
We leverage multi-agent reinforcement learning (MARL) to manage the challenge in this study.
arXiv Detail & Related papers (2024-10-29T22:43:26Z) - Multi-model fusion for Aerial Vision and Dialog Navigation based on
human attention aids [69.98258892165767]
We present an aerial navigation task for the 2023 ICCV Conversation History.
We propose an effective method of fusion training of Human Attention Aided Transformer model (HAA-Transformer) and Human Attention Aided LSTM (HAA-LSTM) models.
arXiv Detail & Related papers (2023-08-27T10:32:52Z) - TransVisDrone: Spatio-Temporal Transformer for Vision-based
Drone-to-Drone Detection in Aerial Videos [57.92385818430939]
Drone-to-drone detection using visual feed has crucial applications, such as detecting drone collisions, detecting drone attacks, or coordinating flight with other drones.
Existing methods are computationally costly, follow non-end-to-end optimization, and have complex multi-stage pipelines, making them less suitable for real-time deployment on edge devices.
We propose a simple yet effective framework, itTransVisDrone, that provides an end-to-end solution with higher computational efficiency.
arXiv Detail & Related papers (2022-10-16T03:05:13Z) - VAE-Loco: Versatile Quadruped Locomotion by Learning a Disentangled Gait
Representation [78.92147339883137]
We show that it is pivotal in increasing controller robustness by learning a latent space capturing the key stance phases constituting a particular gait.
We demonstrate that specific properties of the drive signal map directly to gait parameters such as cadence, footstep height and full stance duration.
The use of a generative model facilitates the detection and mitigation of disturbances to provide a versatile and robust planning framework.
arXiv Detail & Related papers (2022-05-02T19:49:53Z) - Collaborative Target Search with a Visual Drone Swarm: An Adaptive
Curriculum Embedded Multistage Reinforcement Learning Approach [0.0]
We propose a novel data-efficient deep reinforcement learning (DRL) approach called adaptive curriculum embedded multistage learning (ACEMSL)
We decompose the collaborative target search task into several subtasks including individual obstacle avoidance, target search, and inter-agent collaboration, and progressively train the agents with multistage learning.
We deploy the trained model over a real visual drone swarm and perform CTS operations without fine-tuning.
arXiv Detail & Related papers (2022-04-26T09:32:22Z) - Autonomous Aerial Robot for High-Speed Search and Intercept Applications [86.72321289033562]
A fully-autonomous aerial robot for high-speed object grasping has been proposed.
As an additional sub-task, our system is able to autonomously pierce balloons located in poles close to the surface.
Our approach has been validated in a challenging international competition and has shown outstanding results.
arXiv Detail & Related papers (2021-12-10T11:49:51Z) - Drone swarm patrolling with uneven coverage requirements [22.475492500154573]
In this paper, we focus on visual coverage optimization with drone-mounted camera sensors.
We model these coverage requirements with relevance maps and propose a deep reinforcement learning algorithm to guide the swarm.
arXiv Detail & Related papers (2021-07-01T10:58:57Z) - Drones for Medical Delivery Considering Different Demands Classes: A
Markov Decision Process Approach for Managing Health Centers Dispatching
Medical Products [0.0]
We consider the problem of optimizing the distribution operations of a hub using drones to deliver medical supplies to different geographic regions.
By considering different geographic locations, we consider different classes of demand that require different flight ranges.
We classify the demands based on their distance from the drone hub, use a Markov decision process to model the problem and perform computational tests.
arXiv Detail & Related papers (2021-06-08T23:20:31Z) - Dogfight: Detecting Drones from Drones Videos [58.158988162743825]
This paper attempts to address the problem of drones detection from other flying drones variations.
The erratic movement of the source and target drones, small size, arbitrary shape, large intensity, and occlusion make this problem quite challenging.
To handle this, instead of using region-proposal based methods, we propose to use a two-stage segmentation-based approach.
arXiv Detail & Related papers (2021-03-31T17:43:31Z) - Time-Efficient Mars Exploration of Simultaneous Coverage and Charging
with Multiple Drones [14.160624396972707]
This paper presents a time-efficient scheme for Mars exploration by the cooperation of multiple drones and a rover.
A comprehensive framework has been developed with joint consideration for limited energy, sensor model, communication range and safety radius.
Extensive simulations have been conducted to demonstrate the remarkable performance of TIME-SC2.
arXiv Detail & Related papers (2020-11-16T07:28:37Z) - ReLMoGen: Leveraging Motion Generation in Reinforcement Learning for
Mobile Manipulation [99.2543521972137]
ReLMoGen is a framework that combines a learned policy to predict subgoals and a motion generator to plan and execute the motion needed to reach these subgoals.
Our method is benchmarked on a diverse set of seven robotics tasks in photo-realistic simulation environments.
ReLMoGen shows outstanding transferability between different motion generators at test time, indicating a great potential to transfer to real robots.
arXiv Detail & Related papers (2020-08-18T08:05:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.