Reinforcement Learning to Optimize the Logistics Distribution Routes of
  Unmanned Aerial Vehicle
        - URL: http://arxiv.org/abs/2004.09864v1
- Date: Tue, 21 Apr 2020 09:42:03 GMT
- Title: Reinforcement Learning to Optimize the Logistics Distribution Routes of
  Unmanned Aerial Vehicle
- Authors: Linfei Feng
- Abstract summary: This paper proposes an improved method to achieve path planning for UAVs in complex surroundings: multiple no-fly zones.
The results show the feasibility and efficiency of the model applying in this kind of complicated situation.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   Path planning methods for the unmanned aerial vehicle (UAV) in goods delivery
have drawn great attention from industry and academics because of its
flexibility which is suitable for many situations in the "Last Kilometer"
between customer and delivery nodes. However, the complicated situation is
still a problem for traditional combinatorial optimization methods. Based on
the state-of-the-art Reinforcement Learning (RL), this paper proposed an
improved method to achieve path planning for UAVs in complex surroundings:
multiple no-fly zones. The improved approach leverages the attention mechanism
and includes the embedding mechanism as the encoder and three different widths
of beam search (i.e.,~1, 5, and 10) as the decoders. Policy gradients are
utilized to train the RL model for obtaining the optimal strategies during
inference. The results show the feasibility and efficiency of the model
applying in this kind of complicated situation. Comparing the model with the
results obtained by the optimization solver OR-tools, it improves the
reliability of the distribution system and has a guiding significance for the
broad application of UAVs.
 
      
        Related papers
        - Preference Optimization for Combinatorial Optimization Problems [54.87466279363487]
 Reinforcement Learning (RL) has emerged as a powerful tool for neural optimization, enabling models learns that solve complex problems without requiring expert knowledge.<n>Despite significant progress, existing RL approaches face challenges such as diminishing reward signals and inefficient exploration in vast action spaces.<n>We propose Preference Optimization, a novel method that transforms quantitative reward signals into qualitative preference signals via statistical comparison modeling.
 arXiv  Detail & Related papers  (2025-05-13T16:47:00Z)
- Attention-based UAV Trajectory Optimization for Wireless Power   Transfer-assisted IoT Systems [19.680892841701674]
 We present an Attention-based UAV Trajectory Optimization framework based on the graph transformer.
In ATOM, a graph encoder is used to calculate the self-attention characteristics of all IoTDs.
TENMA then trains the ATOM using an improved Actor-Critic method, in which the real reward of the system is applied as the baseline to reduce variances in the critic network.
 arXiv  Detail & Related papers  (2025-02-23T02:57:06Z)
- Preventing Local Pitfalls in Vector Quantization via Optimal Transport [77.15924044466976]
 We introduce OptVQ, a novel vector quantization method that employs the Sinkhorn algorithm to optimize the optimal transport problem.
Our experiments on image reconstruction tasks demonstrate that OptVQ achieves 100% codebook utilization and surpasses current state-of-the-art VQNs in reconstruction quality.
 arXiv  Detail & Related papers  (2024-12-19T18:58:14Z)
- Enhancing Spectrum Efficiency in 6G Satellite Networks: A GAIL-Powered   Policy Learning via Asynchronous Federated Inverse Reinforcement Learning [67.95280175998792]
 A novel adversarial imitation learning (GAIL)-powered policy learning approach is proposed for optimizing beamforming, spectrum allocation, and remote user equipment (RUE) association ins.
We employ inverse RL (IRL) to automatically learn reward functions without manual tuning.
We show that the proposed MA-AL method outperforms traditional RL approaches, achieving a $14.6%$ improvement in convergence and reward value.
 arXiv  Detail & Related papers  (2024-09-27T13:05:02Z)
- UAV-enabled Collaborative Beamforming via Multi-Agent Deep Reinforcement   Learning [79.16150966434299]
 We formulate a UAV-enabled collaborative beamforming multi-objective optimization problem (UCBMOP) to maximize the transmission rate of the UVAA and minimize the energy consumption of all UAVs.
We use the heterogeneous-agent trust region policy optimization (HATRPO) as the basic framework, and then propose an improved HATRPO algorithm, namely HATRPO-UCB.
 arXiv  Detail & Related papers  (2024-04-11T03:19:22Z)
- Reinforcement Learning for Solving Stochastic Vehicle Routing Problem [0.09831489366502298]
 This study addresses a gap in the utilization of Reinforcement Learning (RL) and Machine Learning (ML) techniques in solving the Vehicle Routing Problem (SVRP)
We propose a novel end-to-end framework that comprehensively addresses the key sources of SVRP and utilizes an RL agent with a simple yet effective architecture and a tailored training method.
Our proposed model demonstrates superior performance compared to a widely adopted state-of-the-art meeuristic, achieving a significant 3.43% reduction in travel costs.
 arXiv  Detail & Related papers  (2023-11-13T19:46:22Z)
- Enhancing Secrecy in UAV RSMA Networks: Deep Unfolding Meets Deep   Reinforcement Learning [0.8287206589886881]
 We consider the network of the secrecy in multiple unmanned aerial vehicles (UAV) rate trajectory (SMAR)
The proposed deep reinforcement learning (DRL) has shown great performance and outperformed other DRL-based methods in the literature.
 arXiv  Detail & Related papers  (2023-09-30T12:26:24Z)
- A Hybrid Framework of Reinforcement Learning and Convex Optimization for
  UAV-Based Autonomous Metaverse Data Collection [16.731929552692524]
 This paper considers a UAV-assisted Metaverse network, in which UAVs extend the coverage of the base station (BS) to collect the Metaverse data generated at roadside units (RSUs)
To improve the data collection efficiency, resource allocation and trajectory control are integrated into the system model.
Based on the proposed UAV-assisted Metaverse network system model, we design a hybrid framework with reinforcement learning and convex optimization to cooperatively solve the time-sequential optimization problem.
 arXiv  Detail & Related papers  (2023-05-29T11:49:20Z)
- Joint Optimization of Deployment and Trajectory in UAV and IRS-Assisted
  IoT Data Collection System [25.32139119893323]
 Unmanned aerial vehicles (UAVs) can be applied in many Internet of Things (IoT) systems.
The UAV-IoT wireless channels may be occasionally blocked by trees or high-rise buildings.
This article aims to minimize the energy consumption of the system by jointly optimizing the deployment and trajectory of the UAV.
 arXiv  Detail & Related papers  (2022-10-27T06:27:40Z)
- Transferable Deep Reinforcement Learning Framework for Autonomous
  Vehicles with Joint Radar-Data Communications [69.24726496448713]
 We propose an intelligent optimization framework based on the Markov Decision Process (MDP) to help the AV make optimal decisions.
We then develop an effective learning algorithm leveraging recent advances of deep reinforcement learning techniques to find the optimal policy for the AV.
We show that the proposed transferable deep reinforcement learning framework reduces the obstacle miss detection probability by the AV up to 67% compared to other conventional deep reinforcement learning approaches.
 arXiv  Detail & Related papers  (2021-05-28T08:45:37Z)
- Efficient UAV Trajectory-Planning using Economic Reinforcement Learning [65.91405908268662]
 We introduce REPlanner, a novel reinforcement learning algorithm inspired by economic transactions to distribute tasks between UAVs.
We formulate the path planning problem as a multi-agent economic game, where agents can cooperate and compete for resources.
As the system computes task distributions via UAV cooperation, it is highly resilient to any change in the swarm size.
 arXiv  Detail & Related papers  (2021-03-03T20:54:19Z)
- Optimization-driven Deep Reinforcement Learning for Robust Beamforming
  in IRS-assisted Wireless Communications [54.610318402371185]
 Intelligent reflecting surface (IRS) is a promising technology to assist downlink information transmissions from a multi-antenna access point (AP) to a receiver.
We minimize the AP's transmit power by a joint optimization of the AP's active beamforming and the IRS's passive beamforming.
We propose a deep reinforcement learning (DRL) approach that can adapt the beamforming strategies from past experiences.
 arXiv  Detail & Related papers  (2020-05-25T01:42:55Z)
- Data Freshness and Energy-Efficient UAV Navigation Optimization: A Deep
  Reinforcement Learning Approach [88.45509934702913]
 We design a navigation policy for multiple unmanned aerial vehicles (UAVs) where mobile base stations (BSs) are deployed.
We incorporate different contextual information such as energy and age of information (AoI) constraints to ensure the data freshness at the ground BS.
By applying the proposed trained model, an effective real-time trajectory policy for the UAV-BSs captures the observable network states over time.
 arXiv  Detail & Related papers  (2020-02-21T07:29:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.