Energy Minimization in UAV-Aided Networks: Actor-Critic Learning for
Constrained Scheduling Optimization
- URL: http://arxiv.org/abs/2006.13610v2
- Date: Mon, 29 Jun 2020 09:17:00 GMT
- Title: Energy Minimization in UAV-Aided Networks: Actor-Critic Learning for
Constrained Scheduling Optimization
- Authors: Yaxiong Yuan, Lei Lei, Thang Xuan Vu, Symeon Chatzinotas, Sumei Sun
and Bjorn Ottersten
- Abstract summary: In unmanned aerial vehicle (UAV) applications, the UAV's limited energy supply and storage have triggered the development of intelligent energy-conserving solutions.
In this paper, we investigate energy-DSOS solution jointly optimizing data-transmission scheduling hovering time.
- Score: 30.742052801257998
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In unmanned aerial vehicle (UAV) applications, the UAV's limited energy
supply and storage have triggered the development of intelligent
energy-conserving scheduling solutions. In this paper, we investigate energy
minimization for UAV-aided communication networks by jointly optimizing
data-transmission scheduling and UAV hovering time. The formulated problem is
combinatorial and non-convex with bilinear constraints. To tackle the problem,
firstly, we provide an optimal relax-and-approximate solution and develop a
near-optimal algorithm. Both the proposed solutions are served as offline
performance benchmarks but might not be suitable for online operation. To this
end, we develop a solution from a deep reinforcement learning (DRL) aspect. The
conventional RL/DRL, e.g., deep Q-learning, however, is limited in dealing with
two main issues in constrained combinatorial optimization, i.e., exponentially
increasing action space and infeasible actions. The novelty of solution
development lies in handling these two issues. To address the former, we
propose an actor-critic-based deep stochastic online scheduling (AC-DSOS)
algorithm and develop a set of approaches to confine the action space. For the
latter, we design a tailored reward function to guarantee the solution
feasibility. Numerical results show that, by consuming equal magnitude of time,
AC-DSOS is able to provide feasible solutions and saves 29.94% energy compared
with a conventional deep actor-critic method. Compared to the developed
near-optimal algorithm, AC-DSOS consumes around 10% higher energy but reduces
the computational time from minute-level to millisecond-level.
Related papers
- Multiobjective Vehicle Routing Optimization with Time Windows: A Hybrid Approach Using Deep Reinforcement Learning and NSGA-II [52.083337333478674]
This paper proposes a weight-aware deep reinforcement learning (WADRL) approach designed to address the multiobjective vehicle routing problem with time windows (MOVRPTW)
The Non-dominated sorting genetic algorithm-II (NSGA-II) method is then employed to optimize the outcomes produced by the WADRL.
arXiv Detail & Related papers (2024-07-18T02:46:06Z) - DNN Partitioning, Task Offloading, and Resource Allocation in Dynamic Vehicular Networks: A Lyapunov-Guided Diffusion-Based Reinforcement Learning Approach [49.56404236394601]
We formulate the problem of joint DNN partitioning, task offloading, and resource allocation in Vehicular Edge Computing.
Our objective is to minimize the DNN-based task completion time while guaranteeing the system stability over time.
We propose a Multi-Agent Diffusion-based Deep Reinforcement Learning (MAD2RL) algorithm, incorporating the innovative use of diffusion models.
arXiv Detail & Related papers (2024-06-11T06:31:03Z) - Energy-Efficient Power Control for Multiple-Task Split Inference in
UAVs: A Tiny Learning-Based Approach [27.48920259431965]
We present a two-timescale approach for energy minimization in split inference in unmanned aerial vehicles (UAVs)
We replace the optimization of transmit power with that of transmission time to decrease the computational complexity of OP.
Simulation results show that the proposed algorithm can achieve a higher probability of successful task completion with lower energy consumption.
arXiv Detail & Related papers (2023-12-31T10:16:59Z) - Joint User Association, Interference Cancellation and Power Control for
Multi-IRS Assisted UAV Communications [80.35959154762381]
Intelligent reflecting surface (IRS)-assisted unmanned aerial vehicle (UAV) communications are expected to alleviate the load of ground base stations in a cost-effective way.
Existing studies mainly focus on the deployment and resource allocation of a single IRS instead of multiple IRSs.
We propose a new optimization algorithm for joint IRS-user association, trajectory optimization of UAVs, successive interference cancellation (SIC) decoding order scheduling and power allocation.
arXiv Detail & Related papers (2023-12-08T01:57:10Z) - Multi-Objective Optimization for UAV Swarm-Assisted IoT with Virtual
Antenna Arrays [55.736718475856726]
Unmanned aerial vehicle (UAV) network is a promising technology for assisting Internet-of-Things (IoT)
Existing UAV-assisted data harvesting and dissemination schemes require UAVs to frequently fly between the IoTs and access points.
We introduce collaborative beamforming into IoTs and UAVs simultaneously to achieve energy and time-efficient data harvesting and dissemination.
arXiv Detail & Related papers (2023-08-03T02:49:50Z) - Lower Bounds and Optimal Algorithms for Smooth and Strongly Convex
Decentralized Optimization Over Time-Varying Networks [79.16773494166644]
We consider the task of minimizing the sum of smooth and strongly convex functions stored in a decentralized manner across the nodes of a communication network.
We design two optimal algorithms that attain these lower bounds.
We corroborate the theoretical efficiency of these algorithms by performing an experimental comparison with existing state-of-the-art methods.
arXiv Detail & Related papers (2021-06-08T15:54:44Z) - Energy Efficient Edge Computing: When Lyapunov Meets Distributed
Reinforcement Learning [12.845204986571053]
In this work, we study the problem of energy-efficient offloading enabled by edge computing.
In the considered scenario, multiple users simultaneously compete for radio and edge computing resources.
The proposed solution also allows to increase the network's energy efficiency compared to a benchmark approach.
arXiv Detail & Related papers (2021-03-31T11:02:29Z) - Sum-Rate Maximization for UAV-assisted Visible Light Communications
using NOMA: Swarm Intelligence meets Machine Learning [15.385078410753986]
We consider a UAV-assisted visible light communications (VLC) using non-orthogonal multiple-access networks.
We formulate a problem of power allocation and UAVs placement to maximize the sum of service users and UAV's position.
We propose using harris hawks optimization (HHO) to solve formulated problem and obtain an efficient solution.
arXiv Detail & Related papers (2021-01-10T08:21:49Z) - Deep Reinforcement Learning for Delay-Oriented IoT Task Scheduling in
Space-Air-Ground Integrated Network [24.022108191145527]
We investigate a computing task scheduling problem in space-air-ground integrated network (SAGIN) for delay-oriented Internet of Things (IoT) services.
In the considered scenario, an unmanned aerial vehicle (UAV) collects computing tasks from IoT devices and then makes online offloading decisions.
Our objective is to design a task scheduling policy that minimizes offloading and computing delay of all tasks given the UAV energy capacity constraint.
arXiv Detail & Related papers (2020-10-04T02:58:03Z) - Combining Deep Learning and Optimization for Security-Constrained
Optimal Power Flow [94.24763814458686]
Security-constrained optimal power flow (SCOPF) is fundamental in power systems.
Modeling of APR within the SCOPF problem results in complex large-scale mixed-integer programs.
This paper proposes a novel approach that combines deep learning and robust optimization techniques.
arXiv Detail & Related papers (2020-07-14T12:38:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.