An Optimization Method-Assisted Ensemble Deep Reinforcement Learning
Algorithm to Solve Unit Commitment Problems
- URL: http://arxiv.org/abs/2206.04249v1
- Date: Thu, 9 Jun 2022 03:36:18 GMT
- Title: An Optimization Method-Assisted Ensemble Deep Reinforcement Learning
Algorithm to Solve Unit Commitment Problems
- Authors: Jingtao Qin, Yuanqi Gao, Mikhail Bragin, Nanpeng Yu
- Abstract summary: Unit commitment is a fundamental problem in the day-ahead electricity market.
It is critical to solve UC problems efficiently.
Recent advances in artificial intelligence have demonstrated the capability of reinforcement learning to solve UC problems.
- Score: 3.303380427144773
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Unit commitment (UC) is a fundamental problem in the day-ahead electricity
market, and it is critical to solve UC problems efficiently. Mathematical
optimization techniques like dynamic programming, Lagrangian relaxation, and
mixed-integer quadratic programming (MIQP) are commonly adopted for UC
problems. However, the calculation time of these methods increases at an
exponential rate with the amount of generators and energy resources, which is
still the main bottleneck in industry. Recent advances in artificial
intelligence have demonstrated the capability of reinforcement learning (RL) to
solve UC problems. Unfortunately, the existing research on solving UC problems
with RL suffers from the curse of dimensionality when the size of UC problems
grows. To deal with these problems, we propose an optimization method-assisted
ensemble deep reinforcement learning algorithm, where UC problems are
formulated as a Markov Decision Process (MDP) and solved by multi-step deep
Q-learning in an ensemble framework. The proposed algorithm establishes a
candidate action set by solving tailored optimization problems to ensure a
relatively high performance and the satisfaction of operational constraints.
Numerical studies on IEEE 118 and 300-bus systems show that our algorithm
outperforms the baseline RL algorithm and MIQP. Furthermore, the proposed
algorithm shows strong generalization capacity under unforeseen operational
conditions.
Related papers
- Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function Optimization [50.485788083202124]
Reinforcement Learning (RL) plays a crucial role in aligning large language models with human preferences and improving their ability to perform complex tasks.
We introduce Direct Q-function Optimization (DQO), which formulates the response generation process as a Markov Decision Process (MDP) and utilizes the soft actor-critic (SAC) framework to optimize a Q-function directly parameterized by the language model.
Experimental results on two math problem-solving datasets, GSM8K and MATH, demonstrate that DQO outperforms previous methods, establishing it as a promising offline reinforcement learning approach for aligning language models.
arXiv Detail & Related papers (2024-10-11T23:29:20Z) - Distributed Quantum Approximate Optimization Algorithm on Integrated High-Performance Computing and Quantum Computing Systems for Large-Scale Optimization [1.7099366779394252]
Quantum approximated optimization algorithm (QAOA) has shown promise for solving optimization problems by providing quantum speedup on gate-based quantum computing systems.
We propose a distributed QAOA (DQAOA), which leverages a high-performance computing-quantum computing integrated system.
We successfully optimize photonic structures using AL-DQAOA, indicating that solving real-world optimization problems using gate-based quantum computing is feasible with our strategies.
arXiv Detail & Related papers (2024-07-29T17:42:25Z) - Machine Learning Insides OptVerse AI Solver: Design Principles and
Applications [74.67495900436728]
We present a comprehensive study on the integration of machine learning (ML) techniques into Huawei Cloud's OptVerse AI solver.
We showcase our methods for generating complex SAT and MILP instances utilizing generative models that mirror multifaceted structures of real-world problem.
We detail the incorporation of state-of-the-art parameter tuning algorithms which markedly elevate solver performance.
arXiv Detail & Related papers (2024-01-11T15:02:15Z) - SEGO: Sequential Subgoal Optimization for Mathematical Problem-Solving [64.38649623473626]
Large Language Models (LLMs) have driven substantial progress in artificial intelligence.
We propose a novel framework called textbfSEquential subtextbfGoal textbfOptimization (SEGO) to enhance LLMs' ability to solve mathematical problems.
arXiv Detail & Related papers (2023-10-19T17:56:40Z) - Solution to Advanced Manufacturing Process Problems using Cohort
Intelligence Algorithm with Improved Constraint Handling Approaches [0.07989135005592125]
Cohort Intelligence (CI) algorithm is a socio inspired optimization technique which is successfully applied for solving several unconstrained & constrained real-world problems from the domains such as design, manufacturing, supply chain, healthcare, etc.
arXiv Detail & Related papers (2023-10-16T05:40:23Z) - Hybrid ACO-CI Algorithm for Beam Design problems [0.4397520291340694]
A novel hybrid version of the Ant colony optimization (ACO) method is developed using the sample space reduction technique of the Cohort Intelligence (CI) algorithm.
The proposed work could be investigate for real world applications encompassing domains of engineering, and health care problems.
arXiv Detail & Related papers (2023-03-29T04:37:14Z) - Multi-Resource Allocation for On-Device Distributed Federated Learning
Systems [79.02994855744848]
This work poses a distributed multi-resource allocation scheme for minimizing the weighted sum of latency and energy consumption in the on-device distributed federated learning (FL) system.
Each mobile device in the system engages the model training process within the specified area and allocates its computation and communication resources for deriving and uploading parameters, respectively.
arXiv Detail & Related papers (2022-11-01T14:16:05Z) - Solving the vehicle routing problem with deep reinforcement learning [0.0]
This paper focuses on the application of RL for the Vehicle Routing Problem (VRP), a famous problem that belongs to the class of NP-Hard problems.
In a second phase, the neural architecture behind the Actor and Critic has been established, choosing to adopt a neural architecture based on the Convolutional neural networks.
Experiments performed on a wide range of instances show that the algorithm has good generalization capabilities and can reach good solutions in a short time.
arXiv Detail & Related papers (2022-07-30T12:34:26Z) - A Two-stage Framework and Reinforcement Learning-based Optimization
Algorithms for Complex Scheduling Problems [54.61091936472494]
We develop a two-stage framework, in which reinforcement learning (RL) and traditional operations research (OR) algorithms are combined together.
The scheduling problem is solved in two stages, including a finite Markov decision process (MDP) and a mixed-integer programming process, respectively.
Results show that the proposed algorithms could stably and efficiently obtain satisfactory scheduling schemes for agile Earth observation satellite scheduling problems.
arXiv Detail & Related papers (2021-03-10T03:16:12Z) - Combining Deep Learning and Optimization for Security-Constrained
Optimal Power Flow [94.24763814458686]
Security-constrained optimal power flow (SCOPF) is fundamental in power systems.
Modeling of APR within the SCOPF problem results in complex large-scale mixed-integer programs.
This paper proposes a novel approach that combines deep learning and robust optimization techniques.
arXiv Detail & Related papers (2020-07-14T12:38:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.