Modeling Atmospheric Data and Identifying Dynamics: Temporal Data-Driven
Modeling of Air Pollutants
- URL: http://arxiv.org/abs/2010.06538v2
- Date: Tue, 6 Jul 2021 20:29:14 GMT
- Title: Modeling Atmospheric Data and Identifying Dynamics: Temporal Data-Driven
Modeling of Air Pollutants
- Authors: Javier Rubio-Herrero, Carlos Ortiz Marrero, Wai-Tong Louis Fan
- Abstract summary: We present an empirical approach using data-driven techniques to study air quality in Madrid.
We find parsimonious systems of ordinary differential equations that model the concentration of pollutants and their changes over time.
Our results show that Akaike's Information Criterion can work well in conjunction with best subset regression as to find an equilibrium between sparsity and goodness of fit.
- Score: 2.578242050187029
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Atmospheric modeling has recently experienced a surge with the advent of deep
learning. Most of these models, however, predict concentrations of pollutants
following a data-driven approach in which the physical laws that govern their
behaviors and relationships remain hidden. With the aid of real-world air
quality data collected hourly in different stations throughout Madrid, we
present an empirical approach using data-driven techniques with the following
goals: (1) Find parsimonious systems of ordinary differential equations via
sparse identification of nonlinear dynamics (SINDy) that model the
concentration of pollutants and their changes over time; (2) assess the
performance and limitations of our models using stability analysis; (3)
reconstruct the time series of chemical pollutants not measured in certain
stations using delay coordinate embedding results. Our results show that
Akaike's Information Criterion can work well in conjunction with best subset
regression as to find an equilibrium between sparsity and goodness of fit. We
also find that, due to the complexity of the chemical system under study,
identifying the dynamics of this system over longer periods of time require
higher levels of data filtering and smoothing. Stability analysis for the
reconstructed ordinary differential equations (ODEs) reveals that more than
half of the physically relevant critical points are saddle points, suggesting
that the system is unstable even under the idealized assumption that all
environmental conditions are constant over time.
Related papers
- Air Quality Prediction with Physics-Informed Dual Neural ODEs in Open Systems [26.70737906860735]
Air pollution significantly threatens human health and ecosystems, necessitating effective air quality prediction to inform public policy.
Traditional approaches are generally categorized into physics-based and data-driven models.
We propose AirDualODE, a novel physics-informed approach that integrates dual branches of Neural temporalODE.
arXiv Detail & Related papers (2024-10-25T13:56:13Z) - Physics-guided Active Sample Reweighting for Urban Flow Prediction [75.24539704456791]
Urban flow prediction is a nuanced-temporal modeling that estimates the throughput of transportation services like buses, taxis and ride-driven models.
Some recent prediction solutions bring remedies with the notion of physics-guided machine learning (PGML)
We develop a atized physics-guided network (PN), and propose a data-aware framework Physics-guided Active Sample Reweighting (P-GASR)
arXiv Detail & Related papers (2024-07-18T15:44:23Z) - A Temporally Disentangled Contrastive Diffusion Model for Spatiotemporal Imputation [35.46631415365955]
We introduce a conditional diffusion framework called C$2$TSD, which incorporates disentangled temporal (trend and seasonality) representations as conditional information.
Our experiments on three real-world datasets demonstrate the superior performance of our approach compared to a number of state-of-the-art baselines.
arXiv Detail & Related papers (2024-02-18T11:59:04Z) - Long-term instabilities of deep learning-based digital twins of the
climate system: The cause and a solution [0.0]
Long-term stability is a critical property for deep learning-based data-driven digital twins of the Earth system.
We show how turbulence physics and the absence of convergence in deep learning-based time-integrators amplify this bias leading to unstable error propagation.
We develop long-term stable data-driven digital twins for the climate system and demonstrate accurate short-term forecasts, and hundreds of years of long-term stable time-integration with accurate mean and variability.
arXiv Detail & Related papers (2023-04-14T09:49:11Z) - Continuous-Time Modeling of Counterfactual Outcomes Using Neural
Controlled Differential Equations [84.42837346400151]
Estimating counterfactual outcomes over time has the potential to unlock personalized healthcare.
Existing causal inference approaches consider regular, discrete-time intervals between observations and treatment decisions.
We propose a controllable simulation environment based on a model of tumor growth for a range of scenarios.
arXiv Detail & Related papers (2022-06-16T17:15:15Z) - A Priori Denoising Strategies for Sparse Identification of Nonlinear
Dynamical Systems: A Comparative Study [68.8204255655161]
We investigate and compare the performance of several local and global smoothing techniques to a priori denoise the state measurements.
We show that, in general, global methods, which use the entire measurement data set, outperform local methods, which employ a neighboring data subset around a local point.
arXiv Detail & Related papers (2022-01-29T23:31:25Z) - Leveraging the structure of dynamical systems for data-driven modeling [111.45324708884813]
We consider the impact of the training set and its structure on the quality of the long-term prediction.
We show how an informed design of the training set, based on invariants of the system and the structure of the underlying attractor, significantly improves the resulting models.
arXiv Detail & Related papers (2021-12-15T20:09:20Z) - Chaos as an interpretable benchmark for forecasting and data-driven
modelling [7.6146285961466]
Chaotic systems pose a unique challenge to modern statistical learning techniques.
We present a database currently comprising 131 known chaotic dynamical systems spanning fields such as astrophysics, climatology, and biochemistry.
arXiv Detail & Related papers (2021-10-11T13:39:41Z) - Integrating Domain Knowledge in Data-driven Earth Observation with
Process Convolutions [13.13700072257046]
We argue that hybrid learning schemes that combine both approaches can address all these issues efficiently.
We specifically propose the use of a class of GP convolution models called latent force models (LFMs) for time series modelling.
We consider time series of soil moisture from active (ASCAT) and passive (SMOS, AMSR2) microwave satellites.
arXiv Detail & Related papers (2021-04-16T14:30:40Z) - Using Data Assimilation to Train a Hybrid Forecast System that Combines
Machine-Learning and Knowledge-Based Components [52.77024349608834]
We consider the problem of data-assisted forecasting of chaotic dynamical systems when the available data is noisy partial measurements.
We show that by using partial measurements of the state of the dynamical system, we can train a machine learning model to improve predictions made by an imperfect knowledge-based model.
arXiv Detail & Related papers (2021-02-15T19:56:48Z) - Stochastically forced ensemble dynamic mode decomposition for
forecasting and analysis of near-periodic systems [65.44033635330604]
We introduce a novel load forecasting method in which observed dynamics are modeled as a forced linear system.
We show that its use of intrinsic linear dynamics offers a number of desirable properties in terms of interpretability and parsimony.
Results are presented for a test case using load data from an electrical grid.
arXiv Detail & Related papers (2020-10-08T20:25:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.