Related papers: Predictive Control and Regret Analysis of Non-Stationary MDP with Look-ahead Information

Predictive Control and Regret Analysis of Non-Stationary MDP with Look-ahead Information

URL: http://arxiv.org/abs/2409.08434v1
Date: Fri, 13 Sep 2024 00:01:58 GMT
Title: Predictive Control and Regret Analysis of Non-Stationary MDP with Look-ahead Information
Authors: Ziyi Zhang, Yorie Nakahira, Guannan Qu,
Abstract summary: We propose an algorithm designed to achieve low regret in non-stationary MDPs by incorporating look-ahead predictions. Our theoretical analysis demonstrates that, under certain assumptions, the regret decreases exponentially as the look-ahead window expands. We validate our approach through simulations, confirming the efficacy of our algorithm in non-stationary environments.
Score: 11.679770353558041
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Policy design in non-stationary Markov Decision Processes (MDPs) is inherently challenging due to the complexities introduced by time-varying system transition and reward, which make it difficult for learners to determine the optimal actions for maximizing cumulative future rewards. Fortunately, in many practical applications, such as energy systems, look-ahead predictions are available, including forecasts for renewable energy generation and demand. In this paper, we leverage these look-ahead predictions and propose an algorithm designed to achieve low regret in non-stationary MDPs by incorporating such predictions. Our theoretical analysis demonstrates that, under certain assumptions, the regret decreases exponentially as the look-ahead window expands. When the system prediction is subject to error, the regret does not explode even if the prediction error grows sub-exponentially as a function of the prediction horizon. We validate our approach through simulations, confirming the efficacy of our algorithm in non-stationary environments.

Related papers

DiffusionRollout: Uncertainty-Aware Rollout Planning in Long-Horizon PDE Solving [28.440795270548705]
DiffusionRollout is a novel selective rollout planning strategy for autoregressive diffusion models.<n>We introduce a mechanism that adaptively selects step sizes during autoregressive rollouts, improving long-term prediction reliability.
arXiv Detail & Related papers (2026-02-14T06:08:05Z)
Bridging the Gap Between Bayesian Deep Learning and Ensemble Weather Forecasts [100.26854618129039]
Weather forecasting is fundamentally challenged by the chaotic nature of the atmosphere.<n>Recent advances in Bayesian Deep Learning (BDL) offer a promising but often disconnected alternative.<n>We bridge these paradigms through a unified hybrid BDL framework for ensemble weather forecasting.
arXiv Detail & Related papers (2025-11-18T07:49:52Z)
ScenGAN: Attention-Intensive Generative Model for Uncertainty-Aware Renewable Scenario Forecasting [11.600987173982107]
This paper explores uncertainties in the realms of renewable power and deep learning.<n>An uncertainty-aware model is meticulously designed for renewable scenario forecasting.<n>The integration of meteorological information, forecasts, and historical trajectories in the processing layer improves the synergistic forecasting capability.
arXiv Detail & Related papers (2025-09-21T15:18:51Z)
Hierarchical Spatio-Temporal Uncertainty Quantification for Distributed Energy Adoption [7.520138182292564]
DER has introduced significant-temporal uncertainties in power grid management. Existing approaches often produce overly conservative uncertainty intervals at individual spatial units. This paper presents a novel hierarchical predictional model based on a conformal framework to address these challenges.
arXiv Detail & Related papers (2024-11-19T03:18:31Z)
Calibrated Probabilistic Forecasts for Arbitrary Sequences [58.54729945445505]
Real-world data streams can change unpredictably due to distribution shifts, feedback loops and adversarial actors. We present a forecasting framework ensuring valid uncertainty estimates regardless of how data evolves.
arXiv Detail & Related papers (2024-09-27T21:46:42Z)
Uncertainty Quantification for Forward and Inverse Problems of PDEs via Latent Global Evolution [110.99891169486366]
We propose a method that integrates efficient and precise uncertainty quantification into a deep learning-based surrogate model. Our method endows deep learning-based surrogate models with robust and efficient uncertainty quantification capabilities for both forward and inverse problems. Our method excels at propagating uncertainty over extended auto-regressive rollouts, making it suitable for scenarios involving long-term predictions.
arXiv Detail & Related papers (2024-02-13T11:22:59Z)
ExtremeCast: Boosting Extreme Value Prediction for Global Weather Forecast [57.6987191099507]
We introduce Exloss, a novel loss function that performs asymmetric optimization and highlights extreme values to obtain accurate extreme weather forecast. We also introduce ExBooster, which captures the uncertainty in prediction outcomes by employing multiple random samples. Our solution can achieve state-of-the-art performance in extreme weather prediction, while maintaining the overall forecast accuracy comparable to the top medium-range forecast models.
arXiv Detail & Related papers (2024-02-02T10:34:13Z)
Performative Time-Series Forecasting [71.18553214204978]
We formalize performative time-series forecasting (PeTS) from a machine-learning perspective. We propose a novel approach, Feature Performative-Shifting (FPS), which leverages the concept of delayed response to anticipate distribution shifts. We conduct comprehensive experiments using multiple time-series models on COVID-19 and traffic forecasting tasks.
arXiv Detail & Related papers (2023-10-09T18:34:29Z)
Conformalized Multimodal Uncertainty Regression and Reasoning [0.9205582989348333]
This paper introduces a lightweight uncertainty estimator capable of predicting multimodal (disjoint) uncertainty bounds. We specifically discuss its application for visual odometry (VO), where environmental features such as flying domain symmetries can result in multimodal uncertainties.
arXiv Detail & Related papers (2023-09-20T02:40:59Z)
Propagating State Uncertainty Through Trajectory Forecasting [34.53847097769489]
Trajectory forecasting is surrounded by uncertainty as its inputs are produced by (noisy) upstream perception. Most trajectory forecasting methods do not account for upstream uncertainty, instead taking only the most-likely values. We present a novel method for incorporating perceptual state uncertainty in trajectory forecasting, a key component of which is a new statistical distance-based loss function.
arXiv Detail & Related papers (2021-10-07T08:51:16Z)
Distribution Preserving Multiple Hypotheses Prediction for Uncertainty Modeling [0.0]
We propose an alternative loss for preserving the Multiple Hypotheses Prediction approach. We empirically show that our approach yields more representative hypotheses on a synthetic and a real-world motion prediction data set. The outputs of the proposed method can directly be used in sampling-based Monte-Carlo methods.
arXiv Detail & Related papers (2021-10-06T15:36:17Z)
DEUP: Direct Epistemic Uncertainty Prediction [56.087230230128185]
Epistemic uncertainty is part of out-of-sample prediction error due to the lack of knowledge of the learner. We propose a principled approach for directly estimating epistemic uncertainty by learning to predict generalization error and subtracting an estimate of aleatoric uncertainty.
arXiv Detail & Related papers (2021-02-16T23:50:35Z)
Learning to Predict Error for MRI Reconstruction [67.76632988696943]
We demonstrate that predictive uncertainty estimated by the current methods does not highly correlate with prediction error. We propose a novel method that estimates the target labels and magnitude of the prediction error in two steps.
arXiv Detail & Related papers (2020-02-13T15:55:32Z)

This list is automatically generated from the titles and abstracts of the papers in this site.