DSAF: A Dual-Stage Adaptive Framework for Numerical Weather Prediction
Downscaling
- URL: http://arxiv.org/abs/2312.12476v1
- Date: Tue, 19 Dec 2023 13:13:17 GMT
- Title: DSAF: A Dual-Stage Adaptive Framework for Numerical Weather Prediction
Downscaling
- Authors: Pengwei Liu, Wenwei Wang, Bingqing Peng, Binqing Wu and Liang Sun
- Abstract summary: We propose a novel framework to address regional NWP downscaling and bias correction tasks.
Dual-Stage Adaptive Framework (DSAF) incorporates adaptive elements in its design to ensure a flexible response to evolving weather conditions.
- Score: 6.990912650604992
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: While widely recognized as one of the most substantial weather forecasting
methodologies, Numerical Weather Prediction (NWP) usually suffers from
relatively coarse resolution and inevitable bias due to tempo-spatial
discretization, physical parametrization process, and computation limitation.
With the roaring growth of deep learning-based techniques, we propose the
Dual-Stage Adaptive Framework (DSAF), a novel framework to address regional NWP
downscaling and bias correction tasks. DSAF uniquely incorporates adaptive
elements in its design to ensure a flexible response to evolving weather
conditions. Specifically, NWP downscaling and correction are well-decoupled in
the framework and can be applied independently, which strategically guides the
optimization trajectory of the model. Utilizing a multi-task learning mechanism
and an uncertainty-weighted loss function, DSAF facilitates balanced training
across various weather factors. Additionally, our specifically designed
attention-centric learnable module effectively integrates geographic
information, proficiently managing complex interrelationships. Experimental
validation on the ECMWF operational forecast (HRES) and reanalysis (ERA5)
archive demonstrates DSAF's superior performance over existing state-of-the-art
models and shows substantial improvements when existing models are augmented
using our proposed modules. Code is publicly available at
https://github.com/pengwei07/DSAF.
Related papers
- Demystifying Data-Driven Probabilistic Medium-Range Weather Forecasting [63.8116386935854]
We demonstrate that state-of-the-art probabilistic skill requires neither intricate architectural constraints nor specialized trainings.<n>We introduce a scalable framework for learning multi-scale atmospheric dynamics by combining a directly downsampled latent space with a history-conditioned local projector.<n>We find that our framework design is robust to the choice of probabilistic estimators, seamlessly supporting interpolants, diffusion models, and CRPS-based ensemble training.
arXiv Detail & Related papers (2026-01-26T03:52:16Z) - Steering Vision-Language Pre-trained Models for Incremental Face Presentation Attack Detection [62.89126207012712]
Face Presentation Attack Detection (PAD) demands incremental learning to combat spoofing tactics and domains.<n>Privacy regulations forbid retaining past data, necessitating rehearsal-free learning (RF-IL)
arXiv Detail & Related papers (2025-12-22T04:30:11Z) - Adapformer: Adaptive Channel Management for Multivariate Time Series Forecasting [49.40321003932633]
Adapformer is an advanced Transformer-based framework that merges the benefits of CI and CD methodologies through effective channel management.<n>Adapformer achieves superior performance over existing models, enhancing both predictive accuracy and computational efficiency.
arXiv Detail & Related papers (2025-11-18T16:24:05Z) - Abstain Mask Retain Core: Time Series Prediction by Adaptive Masking Loss with Representation Consistency [4.047219770183742]
Time series forecasting plays a pivotal role in critical domains such as energy management and financial markets.<n>This study reveals a counterintuitive phenomenon: appropriately truncating historical data can enhance prediction accuracy.<n>We propose an innovative solution termed Adaptive Masking Loss with Representation Consistency.
arXiv Detail & Related papers (2025-10-22T19:23:53Z) - Task-Adaptive Parameter-Efficient Fine-Tuning for Weather Foundation Models [28.79157031663951]
We introduce WeatherPEFT, a novel PEFT framework for Weather Foundation Models (WFMs)<n>Task-Tu Dynamic Prompting (TTu) injects the embedding weights within the encoder to the input tokens of the backbone of the pre-trained via internal and external pattern extraction.<n>Fisher-Guided Adaptive Selection (SFAS) identifies the most task-critical randomness, preserving in pre-trained knowledge, but also stabilizing the selection.<n>We demonstrate the effectiveness and efficiency of WeatherPEFT on three downstream tasks, where existing PEFT methods show significant gaps versus Full-Tuning, and WeatherPEFT performance parity
arXiv Detail & Related papers (2025-09-26T07:54:05Z) - Sycophancy Mitigation Through Reinforcement Learning with Uncertainty-Aware Adaptive Reasoning Trajectories [58.988535279557546]
We introduce textbf sycophancy Mitigation through Adaptive Reasoning Trajectories.<n>We show that SMART significantly reduces sycophantic behavior while preserving strong performance on out-of-distribution inputs.
arXiv Detail & Related papers (2025-09-20T17:09:14Z) - ER-LoRA: Effective-Rank Guided Adaptation for Weather-Generalized Depth Estimation [33.632587382356824]
We propose weather-generalized depth estimation using only a small amount of high-visibility (normal) data.<n>We introduce the Selecting-Tuning-Maintaining (STM) strategy, which structurally decomposes the pretrained weights of Vision Foundation Models.<n>In the tuning phase, we adaptively select the proper rank number as well as the task-aware singular directions.<n>While in the maintaining stage, we enforce a principal direction regularization based on the stable-rank.
arXiv Detail & Related papers (2025-08-31T02:24:00Z) - PI-WAN: A Physics-Informed Wind-Adaptive Network for Quadrotor Dynamics Prediction in Unknown Environments [3.4802474792943805]
We introduce the Physics-Informed Wind-Adaptive Network (PI-WAN), which embeds physical constraints directly into the training process for robust quadrotor dynamics learning.<n>Specifically, PI-WAN employs a Temporal Convolutional Network (TCN) architecture that efficiently captures temporal dependencies from historical flight data.<n>By incorporating real-time prediction results into a model predictive control (MPC) framework, we achieve improvements in closed-loop tracking performance.
arXiv Detail & Related papers (2025-07-01T14:48:22Z) - Elucidated Rolling Diffusion Models for Probabilistic Weather Forecasting [52.6508222408558]
We introduce Elucidated Rolling Diffusion Models (ERDM)<n>ERDM is the first framework to unify a rolling forecast structure with the principled, performant design of Elucidated Diffusion Models (EDM)<n>On 2D Navier-Stokes simulations and ERA5 global weather forecasting at 1.5circ resolution, ERDM consistently outperforms key diffusion-based baselines.
arXiv Detail & Related papers (2025-06-24T21:44:31Z) - Implicit Reward as the Bridge: A Unified View of SFT and DPO Connections [65.36449542323277]
We present a unified theoretical framework bridgingSupervised Fine-Tuning (SFT) and preference learning in Large Language Model (LLM) post-training.<n>We propose a simple yet effective learning rate reduction approach that yields significant performance improvements.
arXiv Detail & Related papers (2025-06-15T05:42:29Z) - World Models as Reference Trajectories for Rapid Motor Adaptation [0.0]
Reflexive World Models (RWM) is a dual control framework that uses world model predictions as implicit reference trajectories for rapid adaptation.<n>Our method separates the control problem into long-term reward through reinforcement learning and robust motor execution.
arXiv Detail & Related papers (2025-05-21T14:46:41Z) - DRAN: A Distribution and Relation Adaptive Network for Spatio-temporal Forecasting [19.064628208136273]
We propose a Relation and Adaptive Network Distribution (DRAN) capable of adapting to and distribution changes over time.
We show that our SFL efficiently preserves spatial relationships across various temporal operations.
Our approach outperforms state of the art methods in weather prediction and traffic forecasting flows.
arXiv Detail & Related papers (2025-04-02T09:18:43Z) - ClimateLLM: Efficient Weather Forecasting via Frequency-Aware Large Language Models [13.740208247043258]
We propose ClimateLLM, a foundation model for weather forecasting.
It captures temporal dependencies via a cross-temporal and cross-spatial collaborative framework.
It integrates frequency decomposition with Large Language Models to strengthen spatial and temporal modeling.
arXiv Detail & Related papers (2025-02-16T09:57:50Z) - Context-Conditioned Spatio-Temporal Predictive Learning for Reliable V2V Channel Prediction [25.688521281119037]
Vehicle-to-Vehicle (V2V) channel state information (CSI) prediction is challenging and crucial for optimizing downstream tasks.
Traditional prediction approaches focus on four-dimensional (4D) CSI, which includes predictions over time, bandwidth, and antenna (TX and RX) space.
We propose a novel context-conditionedtemporal predictive learning method to capture dependencies within 4D CSI data.
arXiv Detail & Related papers (2024-09-16T04:15:36Z) - Efficient Localized Adaptation of Neural Weather Forecasting: A Case Study in the MENA Region [62.09891513612252]
We focus on limited-area modeling and train our model specifically for localized region-level downstream tasks.
We consider the MENA region due to its unique climatic challenges, where accurate localized weather forecasting is crucial for managing water resources, agriculture and mitigating the impacts of extreme weather events.
Our study aims to validate the effectiveness of integrating parameter-efficient fine-tuning (PEFT) methodologies, specifically Low-Rank Adaptation (LoRA) and its variants, to enhance forecast accuracy, as well as training speed, computational resource utilization, and memory efficiency in weather and climate modeling for specific regions.
arXiv Detail & Related papers (2024-09-11T19:31:56Z) - A Novel Denoising Technique and Deep Learning Based Hybrid Wind Speed Forecasting Model for Variable Terrain Conditions [2.531156266686649]
This paper presents a novel and adaptive model for short-term forecasting of wind speed (WS)
It has achieved the lowest variance in terms of forecasting accuracy between simple and complex terrain conditions 0.70%.
arXiv Detail & Related papers (2024-08-28T06:07:58Z) - SFANet: Spatial-Frequency Attention Network for Weather Forecasting [54.470205739015434]
Weather forecasting plays a critical role in various sectors, driving decision-making and risk management.
Traditional methods often struggle to capture the complex dynamics of meteorological systems.
We propose a novel framework designed to address these challenges and enhance the accuracy of weather prediction.
arXiv Detail & Related papers (2024-05-29T08:00:15Z) - UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather [55.95708988160047]
LiDAR semantic segmentation (LSS) is a critical task in autonomous driving.
Prior LSS methods are investigated and evaluated on datasets within the same domain in clear weather.
We propose UniMix, a universal method that enhances the adaptability and generalizability of LSS models.
arXiv Detail & Related papers (2024-04-08T02:02:15Z) - Unleashing Network Potentials for Semantic Scene Completion [50.95486458217653]
This paper proposes a novel SSC framework - Adrial Modality Modulation Network (AMMNet)
AMMNet introduces two core modules: a cross-modal modulation enabling the interdependence of gradient flows between modalities, and a customized adversarial training scheme leveraging dynamic gradient competition.
Extensive experimental results demonstrate that AMMNet outperforms state-of-the-art SSC methods by a large margin.
arXiv Detail & Related papers (2024-03-12T11:48:49Z) - InferAligner: Inference-Time Alignment for Harmlessness through
Cross-Model Guidance [56.184255657175335]
We develop textbfInferAligner, a novel inference-time alignment method that utilizes cross-model guidance for harmlessness alignment.
Experimental results show that our method can be very effectively applied to domain-specific models in finance, medicine, and mathematics.
It significantly diminishes the Attack Success Rate (ASR) of both harmful instructions and jailbreak attacks, while maintaining almost unchanged performance in downstream tasks.
arXiv Detail & Related papers (2024-01-20T10:41:03Z) - Learn from the Past: A Proxy Guided Adversarial Defense Framework with
Self Distillation Regularization [53.04697800214848]
Adversarial Training (AT) is pivotal in fortifying the robustness of deep learning models.
AT methods, relying on direct iterative updates for target model's defense, frequently encounter obstacles such as unstable training and catastrophic overfitting.
We present a general proxy guided defense framework, LAST' (bf Learn from the Pbf ast)
arXiv Detail & Related papers (2023-10-19T13:13:41Z) - DA-LSTM: A Dynamic Drift-Adaptive Learning Framework for Interval Load
Forecasting with LSTM Networks [1.3342521220589318]
A drift magnitude threshold should be defined to design change detection methods to identify drifts.
We propose a dynamic drift-adaptive Long Short-Term Memory (DA-LSTM) framework that can improve the performance of load forecasting models.
arXiv Detail & Related papers (2023-05-15T16:26:03Z) - Optimized ensemble deep learning framework for scalable forecasting of
dynamics containing extreme events [0.0]
Two machine learning techniques are jointly used to achieve synergistic improvements in model accuracy, stability, scalability, and prompting a new wave of applications in the forecasting of dynamics.
The proposed OEDL model based on a best convex combination of feed-forward neural networks, reservoir computing, and long short-term memory can play a key role in advancing predictions of dynamics consisting of extreme events.
arXiv Detail & Related papers (2021-06-09T10:59:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.