A VAE Approach to Sample Multivariate Extremes
- URL: http://arxiv.org/abs/2306.10987v1
- Date: Mon, 19 Jun 2023 14:53:40 GMT
- Title: A VAE Approach to Sample Multivariate Extremes
- Authors: Nicolas Lafon, Philippe Naveau, Ronan Fablet
- Abstract summary: This paper describes a variational autoencoder (VAE) approach for sampling heavy-tailed distributions likely to have extremes of particularly large intensities.
We illustrate the relevance of our approach on a synthetic data set and on a real data set of discharge measurements along the Danube river network.
In addition to outperforming the standard VAE for the tested data sets, we also provide a comparison with a competing EVT-based generative approach.
- Score: 6.548734807475054
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Generating accurate extremes from an observational data set is crucial when
seeking to estimate risks associated with the occurrence of future extremes
which could be larger than those already observed. Applications range from the
occurrence of natural disasters to financial crashes. Generative approaches
from the machine learning community do not apply to extreme samples without
careful adaptation. Besides, asymptotic results from extreme value theory (EVT)
give a theoretical framework to model multivariate extreme events, especially
through the notion of multivariate regular variation. Bridging these two
fields, this paper details a variational autoencoder (VAE) approach for
sampling multivariate heavy-tailed distributions, i.e., distributions likely to
have extremes of particularly large intensities. We illustrate the relevance of
our approach on a synthetic data set and on a real data set of discharge
measurements along the Danube river network. The latter shows the potential of
our approach for flood risks' assessment. In addition to outperforming the
standard VAE for the tested data sets, we also provide a comparison with a
competing EVT-based generative approach. On the tested cases, our approach
improves the learning of the dependency structure between extremes.
Related papers
- Evidential time-to-event prediction model with well-calibrated uncertainty estimation [12.446406577462069]
We introduce an evidential regression model designed especially for time-to-event prediction tasks.
The most plausible event time is directly quantified by aggregated Gaussian random fuzzy numbers (GRFNs)
Our model achieves both accurate and reliable performance, outperforming state-of-the-art methods.
arXiv Detail & Related papers (2024-11-12T15:06:04Z) - Risk and cross validation in ridge regression with correlated samples [72.59731158970894]
We provide training examples for the in- and out-of-sample risks of ridge regression when the data points have arbitrary correlations.
We further extend our analysis to the case where the test point has non-trivial correlations with the training set, setting often encountered in time series forecasting.
We validate our theory across a variety of high dimensional data.
arXiv Detail & Related papers (2024-08-08T17:27:29Z) - Distributionally Robust Optimization as a Scalable Framework to Characterize Extreme Value Distributions [22.765095010254118]
The goal of this paper is to develop distributionally robust optimization (DRO) estimators, specifically for multidimensional Extreme Value Theory (EVT) statistics.
In order to mitigate over-conservative estimates while enhancing out-of-sample performance, we study DRO estimators informed by semi-parametric max-stable constraints in the space of point processes.
Both approaches are validated using synthetically generated data, recovering prescribed characteristics, and verifying the efficacy of the proposed techniques.
arXiv Detail & Related papers (2024-07-31T19:45:27Z) - Risk-Sensitive Diffusion: Robustly Optimizing Diffusion Models with Noisy Samples [58.68233326265417]
Non-image data are prevalent in real applications and tend to be noisy.
Risk-sensitive SDE is a type of differential equation (SDE) parameterized by the risk vector.
We conduct systematic studies for both Gaussian and non-Gaussian noise distributions.
arXiv Detail & Related papers (2024-02-03T08:41:51Z) - Distributionally Robust Skeleton Learning of Discrete Bayesian Networks [9.46389554092506]
We consider the problem of learning the exact skeleton of general discrete Bayesian networks from potentially corrupted data.
We propose to optimize the most adverse risk over a family of distributions within bounded Wasserstein distance or KL divergence to the empirical distribution.
We present efficient algorithms and show the proposed methods are closely related to the standard regularized regression approach.
arXiv Detail & Related papers (2023-11-10T15:33:19Z) - Pseudo value-based Deep Neural Networks for Multi-state Survival
Analysis [9.659041001051415]
We propose a new class of pseudo-value-based deep learning models for multi-state survival analysis.
Our proposed models achieve state-of-the-art results under various censoring settings.
arXiv Detail & Related papers (2022-07-12T03:58:05Z) - Trustworthy Multimodal Regression with Mixture of Normal-inverse Gamma
Distributions [91.63716984911278]
We introduce a novel Mixture of Normal-Inverse Gamma distributions (MoNIG) algorithm, which efficiently estimates uncertainty in principle for adaptive integration of different modalities and produces a trustworthy regression result.
Experimental results on both synthetic and different real-world data demonstrate the effectiveness and trustworthiness of our method on various multimodal regression tasks.
arXiv Detail & Related papers (2021-11-11T14:28:12Z) - Regularizing Variational Autoencoder with Diversity and Uncertainty
Awareness [61.827054365139645]
Variational Autoencoder (VAE) approximates the posterior of latent variables based on amortized variational inference.
We propose an alternative model, DU-VAE, for learning a more Diverse and less Uncertain latent space.
arXiv Detail & Related papers (2021-10-24T07:58:13Z) - Improving Maximum Likelihood Training for Text Generation with Density
Ratio Estimation [51.091890311312085]
We propose a new training scheme for auto-regressive sequence generative models, which is effective and stable when operating at large sample space encountered in text generation.
Our method stably outperforms Maximum Likelihood Estimation and other state-of-the-art sequence generative models in terms of both quality and diversity.
arXiv Detail & Related papers (2020-07-12T15:31:24Z) - Decision-Making with Auto-Encoding Variational Bayes [71.44735417472043]
We show that a posterior approximation distinct from the variational distribution should be used for making decisions.
Motivated by these theoretical results, we propose learning several approximate proposals for the best model.
In addition to toy examples, we present a full-fledged case study of single-cell RNA sequencing.
arXiv Detail & Related papers (2020-02-17T19:23:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.