An Imitation Learning Based Algorithm Enabling Priori Knowledge Transfer
in Modern Electricity Markets for Bayesian Nash Equilibrium Estimation
- URL: http://arxiv.org/abs/2305.06924v2
- Date: Fri, 12 May 2023 00:47:25 GMT
- Title: An Imitation Learning Based Algorithm Enabling Priori Knowledge Transfer
in Modern Electricity Markets for Bayesian Nash Equilibrium Estimation
- Authors: Ziqing Zhu, Ka Wing Chan, Siqi Bu, Ze Hu, Shiwei Xia
- Abstract summary: A Bayes-adaptive Markov Decision Process in FEM (BAMDP-FEM) is developed to model the GENCOs' bidding strategy optimization considering the priori knowledge.
A novel Multi-Agent Generative Adrial Imitation Learning algorithm (MAGAversa) is then proposed to enable GENCOs to learn simultaneously from priori knowledge and interactions with changing environments.
It is concluded that the optimal bidding strategies in the obtained BNE can always lead to more profits than NE due to the effective learning from the priori knowledge.
- Score: 1.5749416770494706
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The Nash Equilibrium (NE) estimation in bidding games of electricity markets
is the key concern of both generation companies (GENCOs) for bidding strategy
optimization and the Independent System Operator (ISO) for market surveillance.
However, existing methods for NE estimation in emerging modern electricity
markets (FEM) are inaccurate and inefficient because the priori knowledge of
bidding strategies before any environment changes, such as load demand
variations, network congestion, and modifications of market design, is not
fully utilized. In this paper, a Bayes-adaptive Markov Decision Process in FEM
(BAMDP-FEM) is therefore developed to model the GENCOs' bidding strategy
optimization considering the priori knowledge. A novel Multi-Agent Generative
Adversarial Imitation Learning algorithm (MAGAIL-FEM) is then proposed to
enable GENCOs to learn simultaneously from priori knowledge and interactions
with changing environments. The obtained NE is a Bayesian Nash Equilibrium
(BNE) with priori knowledge transferred from the previous environment. In the
case study, the superiority of this proposed algorithm in terms of convergence
speed compared with conventional methods is verified. It is concluded that the
optimal bidding strategies in the obtained BNE can always lead to more profits
than NE due to the effective learning from the priori knowledge. Also, BNE is
more accurate and consistent with situations in real-world markets.
Related papers
- Model-based Causal Bayesian Optimization [74.78486244786083]
We introduce the first algorithm for Causal Bayesian Optimization with Multiplicative Weights (CBO-MW)
We derive regret bounds for CBO-MW that naturally depend on graph-related quantities.
Our experiments include a realistic demonstration of how CBO-MW can be used to learn users' demand patterns in a shared mobility system.
arXiv Detail & Related papers (2023-07-31T13:02:36Z) - HireVAE: An Online and Adaptive Factor Model Based on Hierarchical and
Regime-Switch VAE [113.47287249524008]
It is still an open question to build a factor model that can conduct stock prediction in an online and adaptive setting.
We propose the first deep learning based online and adaptive factor model, HireVAE, at the core of which is a hierarchical latent space that embeds the relationship between the market situation and stock-wise latent factors.
Across four commonly used real stock market benchmarks, the proposed HireVAE demonstrate superior performance in terms of active returns over previous methods.
arXiv Detail & Related papers (2023-06-05T12:58:13Z) - E-NER: Evidential Deep Learning for Trustworthy Named Entity Recognition [69.87816981427858]
Most named entity recognition (NER) systems focus on improving model performance, ignoring the need to quantify model uncertainty.
Evidential deep learning (EDL) has recently been proposed as a promising solution to explicitly model predictive uncertainty for classification tasks.
We propose a trustworthy NER framework named E-NER by introducing two uncertainty-guided loss terms to the conventional EDL, along with a series of uncertainty-guided training strategies.
arXiv Detail & Related papers (2023-05-29T02:36:16Z) - Predictive Accuracy of a Hybrid Generalized Long Memory Model for Short
Term Electricity Price Forecasting [0.0]
This study investigates the predictive performance of a new hybrid model based on the Generalized long memory autoregressive model (k-factor GARMA)
The performance of the proposed model is evaluated using data from Nord Pool Electricity markets.
arXiv Detail & Related papers (2022-04-18T12:21:25Z) - Risk-Aware Control and Optimization for High-Renewable Power Grids [11.352041887858322]
RAMC project investigates how to move from this deterministic setting into a risk-aware framework.
This paper reviews how RAMC approaches risk-aware market clearing and presents some of its innovations in uncertainty quantification, optimization, and machine learning.
arXiv Detail & Related papers (2022-04-02T22:58:08Z) - Efficient Model-based Multi-agent Reinforcement Learning via Optimistic
Equilibrium Computation [93.52573037053449]
H-MARL (Hallucinated Multi-Agent Reinforcement Learning) learns successful equilibrium policies after a few interactions with the environment.
We demonstrate our approach experimentally on an autonomous driving simulation benchmark.
arXiv Detail & Related papers (2022-03-14T17:24:03Z) - Bayesian Bilinear Neural Network for Predicting the Mid-price Dynamics
in Limit-Order Book Markets [84.90242084523565]
Traditional time-series econometric methods often appear incapable of capturing the true complexity of the multi-level interactions driving the price dynamics.
By adopting a state-of-the-art second-order optimization algorithm, we train a Bayesian bilinear neural network with temporal attention.
By addressing the use of predictive distributions to analyze errors and uncertainties associated with the estimated parameters and model forecasts, we thoroughly compare our Bayesian model with traditional ML alternatives.
arXiv Detail & Related papers (2022-03-07T18:59:54Z) - Sequential Information Design: Markov Persuasion Process and Its
Efficient Reinforcement Learning [156.5667417159582]
This paper proposes a novel model of sequential information design, namely the Markov persuasion processes (MPPs)
Planning in MPPs faces the unique challenge in finding a signaling policy that is simultaneously persuasive to the myopic receivers and inducing the optimal long-term cumulative utilities of the sender.
We design a provably efficient no-regret learning algorithm, the Optimism-Pessimism Principle for Persuasion Process (OP4), which features a novel combination of both optimism and pessimism principles.
arXiv Detail & Related papers (2022-02-22T05:41:43Z) - Learning Optimization Proxies for Large-Scale Security-Constrained
Economic Dispatch [11.475805963049808]
Security-Constrained Economic Dispatch (SCED) is a fundamental optimization model for Transmission System Operators (TSO)
This paper proposes to learn an optimization proxy for SCED, i.e., a Machine Learning (ML) model that can predict an optimal solution for SCED in milliseconds.
Numerical experiments are reported on the French transmission system, and demonstrate the approach's ability to produce, within a time frame that is compatible with real-time operations.
arXiv Detail & Related papers (2021-12-27T00:44:06Z) - Machine Learning-Driven Virtual Bidding with Electricity Market
Efficiency Analysis [7.014324899009043]
This paper develops a machine learning-driven portfolio optimization framework for virtual bidding in electricity markets.
We leverage the proposed algorithmic virtual bid trading strategy to evaluate both the profitability of the virtual bid portfolio and the efficiency of U.S. wholesale electricity markets.
arXiv Detail & Related papers (2021-04-06T19:30:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.