ClimateLearn: Benchmarking Machine Learning for Weather and Climate
Modeling
- URL: http://arxiv.org/abs/2307.01909v1
- Date: Tue, 4 Jul 2023 20:36:01 GMT
- Title: ClimateLearn: Benchmarking Machine Learning for Weather and Climate
Modeling
- Authors: Tung Nguyen, Jason Jewik, Hritik Bansal, Prakhar Sharma, Aditya Grover
- Abstract summary: ClimateLearn is an open-source library that vastly simplifies the training and evaluation of machine learning models for data-driven climate science.
It is the first large-scale, open-source effort for bridging research in weather and climate modeling with modern machine learning systems.
- Score: 20.63843548201849
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Modeling weather and climate is an essential endeavor to understand the near-
and long-term impacts of climate change, as well as inform technology and
policymaking for adaptation and mitigation efforts. In recent years, there has
been a surging interest in applying data-driven methods based on machine
learning for solving core problems such as weather forecasting and climate
downscaling. Despite promising results, much of this progress has been impaired
due to the lack of large-scale, open-source efforts for reproducibility,
resulting in the use of inconsistent or underspecified datasets, training
setups, and evaluations by both domain scientists and artificial intelligence
researchers. We introduce ClimateLearn, an open-source PyTorch library that
vastly simplifies the training and evaluation of machine learning models for
data-driven climate science. ClimateLearn consists of holistic pipelines for
dataset processing (e.g., ERA5, CMIP6, PRISM), implementation of
state-of-the-art deep learning models (e.g., Transformers, ResNets), and
quantitative and qualitative evaluation for standard weather and climate
modeling tasks. We supplement these functionalities with extensive
documentation, contribution guides, and quickstart tutorials to expand access
and promote community growth. We have also performed comprehensive forecasting
and downscaling experiments to showcase the capabilities and key features of
our library. To our knowledge, ClimateLearn is the first large-scale,
open-source effort for bridging research in weather and climate modeling with
modern machine learning systems. Our library is available publicly at
https://github.com/aditya-grover/climate-learn.
Related papers
- Modeling chaotic Lorenz ODE System using Scientific Machine Learning [1.4633779950109127]
In this paper, we have integrated Scientific Machine Learning (SciML) methods into foundational weather models.
By combining the interpretability of physical climate models with the computational power of neural networks, SciML models can prove to be a reliable tool for modeling climate.
arXiv Detail & Related papers (2024-10-09T01:17:06Z) - Efficient Localized Adaptation of Neural Weather Forecasting: A Case Study in the MENA Region [62.09891513612252]
We focus on limited-area modeling and train our model specifically for localized region-level downstream tasks.
We consider the MENA region due to its unique climatic challenges, where accurate localized weather forecasting is crucial for managing water resources, agriculture and mitigating the impacts of extreme weather events.
Our study aims to validate the effectiveness of integrating parameter-efficient fine-tuning (PEFT) methodologies, specifically Low-Rank Adaptation (LoRA) and its variants, to enhance forecast accuracy, as well as training speed, computational resource utilization, and memory efficiency in weather and climate modeling for specific regions.
arXiv Detail & Related papers (2024-09-11T19:31:56Z) - Aurora: A Foundation Model of the Atmosphere [56.97266186291677]
We introduce Aurora, a large-scale foundation model of the atmosphere trained on over a million hours of diverse weather and climate data.
In under a minute, Aurora produces 5-day global air pollution predictions and 10-day high-resolution weather forecasts.
arXiv Detail & Related papers (2024-05-20T14:45:18Z) - Foundation Models for Weather and Climate Data Understanding: A
Comprehensive Survey [39.08108001903514]
We offer an exhaustive, timely overview of state-of-the-art AI methodologies specifically engineered for weather and climate data.
Our primary coverage encompasses four critical aspects: types of weather and climate data, principal model, model scopes and applications, and datasets for weather and climate.
arXiv Detail & Related papers (2023-12-05T01:10:54Z) - ClimateSet: A Large-Scale Climate Model Dataset for Machine Learning [26.151056828513962]
Climate models have been key for assessing the impact of climate change and simulating future climate scenarios.
The machine learning (ML) community has taken an increased interest in supporting climate scientists' efforts on various tasks such as climate model emulation, downscaling, and prediction tasks.
Here, we introduce ClimateSet, a dataset containing the inputs and outputs of 36 climate models from the Input4MIPs and CMIP6 archives.
arXiv Detail & Related papers (2023-11-07T04:55:36Z) - ClimaX: A foundation model for weather and climate [51.208269971019504]
ClimaX is a deep learning model for weather and climate science.
It can be pre-trained with a self-supervised learning objective on climate datasets.
It can be fine-tuned to address a breadth of climate and weather tasks.
arXiv Detail & Related papers (2023-01-24T23:19:01Z) - Advancing Reacting Flow Simulations with Data-Driven Models [50.9598607067535]
Key to effective use of machine learning tools in multi-physics problems is to couple them to physical and computer models.
The present chapter reviews some of the open opportunities for the application of data-driven reduced-order modeling of combustion systems.
arXiv Detail & Related papers (2022-09-05T16:48:34Z) - Measuring the Carbon Intensity of AI in Cloud Instances [91.28501520271972]
We provide a framework for measuring software carbon intensity, and propose to measure operational carbon emissions.
We evaluate a suite of approaches for reducing emissions on the Microsoft Azure cloud compute platform.
arXiv Detail & Related papers (2022-06-10T17:04:04Z) - SOLIS -- The MLOps journey from data acquisition to actionable insights [62.997667081978825]
In this paper we present a unified deployment pipeline and freedom-to-operate approach that supports all requirements while using basic cross-platform tensor framework and script language engines.
This approach however does not supply the needed procedures and pipelines for the actual deployment of machine learning capabilities in real production grade systems.
arXiv Detail & Related papers (2021-12-22T14:45:37Z) - Climate-Invariant Machine Learning [0.8831201550856289]
Current climate models require representations of processes that occur at scales smaller than model grid size.
Recent machine learning (ML) algorithms hold promise to improve such process representations, but tend to extrapolate poorly to climate regimes they were not trained on.
We propose a new framework - termed "climate-invariant" ML - incorporating knowledge of climate processes into ML algorithms.
arXiv Detail & Related papers (2021-12-14T07:02:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.