Calibrating Geophysical Predictions under Constrained Probabilistic Distributions
- URL: http://arxiv.org/abs/2512.03081v1
- Date: Fri, 28 Nov 2025 07:15:40 GMT
- Title: Calibrating Geophysical Predictions under Constrained Probabilistic Distributions
- Authors: Zhewen Hou, Jiajin Sun, Subashree Venkatasubramanian, Peter Jin, Shuolin Li, Tian Zheng,
- Abstract summary: We introduce a calibration algorithm based on normalization and the Kernelized Stein Discrepancy (KSD) to enhance machine learning predictions.<n>This not only sharpens pointwise predictions but also enforces consistency with non-local statistical structures rooted in physical principles.
- Score: 4.760743517243988
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Machine learning (ML) has shown significant promise in studying complex geophysical dynamical systems, including turbulence and climate processes. Such systems often display sensitive dependence on initial conditions, reflected in positive Lyapunov exponents, where even small perturbations in short-term forecasts can lead to large deviations in long-term outcomes. Thus, meaningful inference requires not only accurate short-term predictions, but also consistency with the system's long-term attractor that is captured by the marginal distribution of state variables. Existing approaches attempt to address this challenge by incorporating spatial and temporal dependence, but these strategies become impractical when data are extremely sparse. In this work, we show that prior knowledge of marginal distributions offers valuable complementary information to short-term observations, motivating a distribution-informed learning framework. We introduce a calibration algorithm based on normalization and the Kernelized Stein Discrepancy (KSD) to enhance ML predictions. The method here employs KSD within a reproducing kernel Hilbert space to calibrate model outputs, improving their fidelity to known physical distributions. This not only sharpens pointwise predictions but also enforces consistency with non-local statistical structures rooted in physical principles. Through synthetic experiments-spanning offline climatological CO2 fluxes and online quasi-geostrophic flow simulations-we demonstrate the robustness and broad utility of the proposed framework.
Related papers
- GTS: Inference-Time Scaling of Latent Reasoning with a Learnable Gaussian Thought Sampler [54.10960908347221]
We model latent thought exploration as conditional sampling from learnable densities and instantiate this idea as a Gaussian Thought Sampler (GTS)<n>GTS predicts context-dependent perturbation distributions over continuous reasoning states and is trained with GRPO-style policy optimization while keeping the backbone frozen.
arXiv Detail & Related papers (2026-02-15T09:57:47Z) - Deep classifier kriging for probabilistic spatial prediction of air quality index [16.289713160499385]
textitdeep classifier kriging (DCK) is a flexible, distribution-free deep learning framework for estimating full predictive distribution functions.<n>We show that DCK consistently outperforms conventional approaches in predictive accuracy and uncertainty quantification.
arXiv Detail & Related papers (2025-12-29T13:58:34Z) - Physics-Embedded Gaussian Process for Traffic State Estimation [15.0051892692122]
Physical models have difficulty integrating uncertainties and capturing the real complexity of traffic.<n>PEGP is designed to integrate domain knowledge with data-driven methods in traffic state estimation.<n> Experiments on HighD, NGSIM show consistent improvements over non-physics baselines.
arXiv Detail & Related papers (2025-12-03T17:43:40Z) - Predicting partially observable dynamical systems via diffusion models with a multiscale inference scheme [43.7203026580194]
In this paper, we tackle the probabilistic prediction of partially observable, long-memory dynamical systems.<n>We show that standard inference schemes, such as autoregressive rollouts, fail to capture long-range dependencies in the data.<n>Our method generates trajectories that are temporally fine-grained near the present and coarser as we move farther away.
arXiv Detail & Related papers (2025-11-24T18:30:04Z) - Bridging the Gap Between Bayesian Deep Learning and Ensemble Weather Forecasts [100.26854618129039]
Weather forecasting is fundamentally challenged by the chaotic nature of the atmosphere.<n>Recent advances in Bayesian Deep Learning (BDL) offer a promising but often disconnected alternative.<n>We bridge these paradigms through a unified hybrid BDL framework for ensemble weather forecasting.
arXiv Detail & Related papers (2025-11-18T07:49:52Z) - Predicting Forced Responses of Probability Distributions via the Fluctuation-Dissipation Theorem and Generative Modeling [0.0]
We present a data-driven framework for estimating the response of higher-order moments of nonlinear systems to small external perturbations.<n>We combine GFDT with score-based generative modeling to estimate the system's score function directly from data.<n>Our method is validated on several models relevant to climate dynamics.
arXiv Detail & Related papers (2025-04-17T20:54:33Z) - Spatial Conformal Inference through Localized Quantile Regression [6.992239210938067]
Conformal prediction offers valid prediction intervals without relying on parametric assumptions.<n>We propose Localized Spatial Conformal Prediction (L SCP), a conformal prediction method designed specifically for spatial data.<n>L SCP achieves accurate coverage with significantly tighter and more consistent prediction intervals compared to existing methods.
arXiv Detail & Related papers (2024-12-02T04:15:06Z) - Physics-guided Active Sample Reweighting for Urban Flow Prediction [75.24539704456791]
Urban flow prediction is a nuanced-temporal modeling that estimates the throughput of transportation services like buses, taxis and ride-driven models.
Some recent prediction solutions bring remedies with the notion of physics-guided machine learning (PGML)
We develop a atized physics-guided network (PN), and propose a data-aware framework Physics-guided Active Sample Reweighting (P-GASR)
arXiv Detail & Related papers (2024-07-18T15:44:23Z) - Enforcing the Principle of Locality for Physical Simulations with Neural Operators [0.0]
Time-dependent partial differential equations (PDEs) are strictly local-dependent according to the principle of locality in physics.<n>Deep learning architecture cannot strictly enforce the local-dependency as it inevitably increases the scope of information to make local predictions.<n>This paper proposes a data decomposition method to strictly limit the scope of information for neural operators making local predictions.
arXiv Detail & Related papers (2024-05-02T14:24:56Z) - Uncertainty Quantification for Forward and Inverse Problems of PDEs via
Latent Global Evolution [110.99891169486366]
We propose a method that integrates efficient and precise uncertainty quantification into a deep learning-based surrogate model.
Our method endows deep learning-based surrogate models with robust and efficient uncertainty quantification capabilities for both forward and inverse problems.
Our method excels at propagating uncertainty over extended auto-regressive rollouts, making it suitable for scenarios involving long-term predictions.
arXiv Detail & Related papers (2024-02-13T11:22:59Z) - Machine learning in and out of equilibrium [58.88325379746631]
Our study uses a Fokker-Planck approach, adapted from statistical physics, to explore these parallels.
We focus in particular on the stationary state of the system in the long-time limit, which in conventional SGD is out of equilibrium.
We propose a new variation of Langevin dynamics (SGLD) that harnesses without replacement minibatching.
arXiv Detail & Related papers (2023-06-06T09:12:49Z) - Stochastically forced ensemble dynamic mode decomposition for
forecasting and analysis of near-periodic systems [65.44033635330604]
We introduce a novel load forecasting method in which observed dynamics are modeled as a forced linear system.
We show that its use of intrinsic linear dynamics offers a number of desirable properties in terms of interpretability and parsimony.
Results are presented for a test case using load data from an electrical grid.
arXiv Detail & Related papers (2020-10-08T20:25:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.