Sim2Real for Environmental Neural Processes
- URL: http://arxiv.org/abs/2310.19932v1
- Date: Mon, 30 Oct 2023 18:49:06 GMT
- Title: Sim2Real for Environmental Neural Processes
- Authors: Jonas Scholz, Tom R. Andersson, Anna Vaughan, James Requeima, Richard
E. Turner
- Abstract summary: We analyse 'Sim2Real': pre-training on reanalysis and fine-tuning on observational data.
Sim2Real could enable more accurate models for weather prediction and climate monitoring.
- Score: 20.850715955359593
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Machine learning (ML)-based weather models have recently undergone rapid
improvements. These models are typically trained on gridded reanalysis data
from numerical data assimilation systems. However, reanalysis data comes with
limitations, such as assumptions about physical laws and low spatiotemporal
resolution. The gap between reanalysis and reality has sparked growing interest
in training ML models directly on observations such as weather stations.
Modelling scattered and sparse environmental observations requires scalable and
flexible ML architectures, one of which is the convolutional conditional neural
process (ConvCNP). ConvCNPs can learn to condition on both gridded and
off-the-grid context data to make uncertainty-aware predictions at target
locations. However, the sparsity of real observations presents a challenge for
data-hungry deep learning models like the ConvCNP. One potential solution is
'Sim2Real': pre-training on reanalysis and fine-tuning on observational data.
We analyse Sim2Real with a ConvCNP trained to interpolate surface air
temperature over Germany, using varying numbers of weather stations for
fine-tuning. On held-out weather stations, Sim2Real training substantially
outperforms the same model architecture trained only with reanalysis data or
only with station data, showing that reanalysis data can serve as a stepping
stone for learning from real observations. Sim2Real could thus enable more
accurate models for weather prediction and climate monitoring.
Related papers
- On conditional diffusion models for PDE simulations [53.01911265639582]
We study score-based diffusion models for forecasting and assimilation of sparse observations.
We propose an autoregressive sampling approach that significantly improves performance in forecasting.
We also propose a new training strategy for conditional score-based models that achieves stable performance over a range of history lengths.
arXiv Detail & Related papers (2024-10-21T18:31:04Z) - Generative Data Assimilation of Sparse Weather Station Observations at Kilometer Scales [5.453657018459705]
We demonstrate the viability of score-based data assimilation in the context of realistically complex km-scale weather.
By incorporating observations from 40 weather stations, 10% lower RMSEs on left-out stations are attained.
It is a ripe time to explore extensions that combine increasingly ambitious regional state generators with an increasing set of in situ, ground-based, and satellite remote sensing data streams.
arXiv Detail & Related papers (2024-06-19T10:28:11Z) - Bridging the Sim-to-Real Gap with Bayesian Inference [53.61496586090384]
We present SIM-FSVGD for learning robot dynamics from data.
We use low-fidelity physical priors to regularize the training of neural network models.
We demonstrate the effectiveness of SIM-FSVGD in bridging the sim-to-real gap on a high-performance RC racecar system.
arXiv Detail & Related papers (2024-03-25T11:29:32Z) - Grounding Graph Network Simulators using Physical Sensor Observations [12.017054986629846]
We integrate sensory information to ground Graph Network Simulators on real world observations.
We predict the mesh state of deformable objects by utilizing point cloud data.
arXiv Detail & Related papers (2023-02-23T09:06:42Z) - ClimaX: A foundation model for weather and climate [51.208269971019504]
ClimaX is a deep learning model for weather and climate science.
It can be pre-trained with a self-supervised learning objective on climate datasets.
It can be fine-tuned to address a breadth of climate and weather tasks.
arXiv Detail & Related papers (2023-01-24T23:19:01Z) - Differentiable Physics-based Greenhouse Simulation [4.420086316176459]
The model is fully interpretable and is able to do state prediction for both climate and crop dynamics in the greenhouse over a long time horizon.
We propose a procedure to solve the differential equations, handle the problem of missing unobservable states in the data, and train the model efficiently.
arXiv Detail & Related papers (2022-11-21T14:37:25Z) - Learning to Simulate Realistic LiDARs [66.7519667383175]
We introduce a pipeline for data-driven simulation of a realistic LiDAR sensor.
We show that our model can learn to encode realistic effects such as dropped points on transparent surfaces.
We use our technique to learn models of two distinct LiDAR sensors and use them to improve simulated LiDAR data accordingly.
arXiv Detail & Related papers (2022-09-22T13:12:54Z) - Long-term stability and generalization of observationally-constrained
stochastic data-driven models for geophysical turbulence [0.19686770963118383]
Deep learning models can mitigate certain biases in current state-of-the-art weather models.
Data-driven models require a lot of training data which may not be available from reanalysis (observational data) products.
deterministic data-driven forecasting models suffer from issues with long-term stability and unphysical climate drift.
We propose a convolutional variational autoencoder-based data-driven model that is pre-trained on an imperfect climate model simulation.
arXiv Detail & Related papers (2022-05-09T23:52:37Z) - Likelihood-Free Inference in State-Space Models with Unknown Dynamics [71.94716503075645]
We introduce a method for inferring and predicting latent states in state-space models where observations can only be simulated, and transition dynamics are unknown.
We propose a way of doing likelihood-free inference (LFI) of states and state prediction with a limited number of simulations.
arXiv Detail & Related papers (2021-11-02T12:33:42Z) - Lidar Light Scattering Augmentation (LISA): Physics-based Simulation of
Adverse Weather Conditions for 3D Object Detection [60.89616629421904]
Lidar-based object detectors are critical parts of the 3D perception pipeline in autonomous navigation systems such as self-driving cars.
They are sensitive to adverse weather conditions such as rain, snow and fog due to reduced signal-to-noise ratio (SNR) and signal-to-background ratio (SBR)
arXiv Detail & Related papers (2021-07-14T21:10:47Z) - Data-driven geophysical forecasting: Simple, low-cost, and accurate
baselines with kernel methods [0.6875312133832078]
We show that when the kernel of these emulators is also learned from data, the resulting data-driven models are faster than equation-based models.
We see significant improvements over climatology and persistence based forecast techniques.
arXiv Detail & Related papers (2021-02-13T19:57:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.