JaxWildfire: A GPU-Accelerated Wildfire Simulator for Reinforcement Learning
- URL: http://arxiv.org/abs/2512.06102v1
- Date: Fri, 05 Dec 2025 19:06:07 GMT
- Title: JaxWildfire: A GPU-Accelerated Wildfire Simulator for Reinforcement Learning
- Authors: Ufuk Çakır, Victor-Alexandru Darvariu, Bruno Lacerda, Nick Hawes,
- Abstract summary: We introduce $textttJaxWildfire$, a simulator underpinned by a principled probabilistic fire spread model based on cellular automata.<n>We demonstrate that $textttJaxWildfire$ achieves 6-35x speedup over existing software and enables gradient-based optimization of simulator parameters.<n>Our work is an important step towards enabling the advancement of RL techniques for managing natural hazards.
- Score: 17.674265727888063
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Artificial intelligence methods are increasingly being explored for managing wildfires and other natural hazards. In particular, reinforcement learning (RL) is a promising path towards improving outcomes in such uncertain decision-making scenarios and moving beyond reactive strategies. However, training RL agents requires many environment interactions, and the speed of existing wildfire simulators is a severely limiting factor. We introduce $\texttt{JaxWildfire}$, a simulator underpinned by a principled probabilistic fire spread model based on cellular automata. It is implemented in JAX and enables vectorized simulations using $\texttt{vmap}$, allowing high throughput of simulations on GPUs. We demonstrate that $\texttt{JaxWildfire}$ achieves 6-35x speedup over existing software and enables gradient-based optimization of simulator parameters. Furthermore, we show that $\texttt{JaxWildfire}$ can be used to train RL agents to learn wildfire suppression policies. Our work is an important step towards enabling the advancement of RL techniques for managing natural hazards.
Related papers
- Spatiotemporal Wildfire Prediction and Reinforcement Learning for Helitack Suppression [0.7734713569509623]
Wildfire are growing in frequency and intensity, causing billions of dollars in suppression costs and devastating economic damage annually in the U.S.<n>Traditional wildfire management is mostly reactive, addressing fires only after they are detected.<n>We introduce textitFireCastRL, a proactive AI framework that combines wildfire forecasting with intelligent suppression strategies.
arXiv Detail & Related papers (2026-01-20T18:50:12Z) - Autonomous Vehicle Path Planning by Searching With Differentiable Simulation [55.46735086899153]
Planning allows an agent to safely refine its actions before executing them in the real world.<n>In autonomous driving, this is crucial to avoid collisions and navigate in complex, dense traffic scenarios.<n>Here we propose Differentiable Simulation for Search (DSS), a framework that leverages the differentiable simulator Waymax as both a next state predictor and a critic.
arXiv Detail & Related papers (2025-11-14T07:56:34Z) - Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning [49.48615590763914]
We propose a black-box attack algorithm named LCBT, which uses the Monte Carlo tree search method for efficient action searching and manipulation.
We conduct our proposed attack methods on three aggressive algorithms: DDPG, PPO, and TD3 in continuous settings, which show a promising attack performance.
arXiv Detail & Related papers (2024-11-20T08:20:29Z) - Autonomous Vehicle Controllers From End-to-End Differentiable Simulation [57.278726604424556]
We propose a differentiable simulator and design an analytic policy gradients (APG) approach to training AV controllers.<n>Our proposed framework brings the differentiable simulator into an end-to-end training loop, where gradients of environment dynamics serve as a useful prior to help the agent learn a more grounded policy.<n>We find significant improvements in performance and robustness to noise in the dynamics, as well as overall more intuitive human-like handling.
arXiv Detail & Related papers (2024-09-12T11:50:06Z) - ReGentS: Real-World Safety-Critical Driving Scenario Generation Made Stable [88.08120417169971]
Machine learning based autonomous driving systems often face challenges with safety-critical scenarios that are rare in real-world data.
This work explores generating safety-critical driving scenarios by modifying complex real-world regular scenarios through trajectory optimization.
Our approach addresses unrealistic diverging trajectories and unavoidable collision scenarios that are not useful for training robust planner.
arXiv Detail & Related papers (2024-09-12T08:26:33Z) - The Power of Resets in Online Reinforcement Learning [73.64852266145387]
We explore the power of simulators through online reinforcement learning with local simulator access (or, local planning)
We show that MDPs with low coverability can be learned in a sample-efficient fashion with only $Qstar$-realizability.
We show that the notorious Exogenous Block MDP problem is tractable under local simulator access.
arXiv Detail & Related papers (2024-04-23T18:09:53Z) - GarchingSim: An Autonomous Driving Simulator with Photorealistic Scenes
and Minimalist Workflow [24.789118651720045]
We introduce an autonomous driving simulator with photorealistic scenes.
The simulator is able to communicate with external algorithms through ROS2 or Socket.IO.
We implement a highly accurate vehicle dynamics model within the simulator to enhance the realism of the vehicle's physical effects.
arXiv Detail & Related papers (2024-01-28T23:26:15Z) - Reinforcement Learning for Wildfire Mitigation in Simulated Disaster
Environments [39.014859667729375]
Wildfires pose a threat to life, property, ecology, cultural heritage, and critical infrastructure.
SimFire is a versatile wildland fire projection simulator designed to generate realistic wildfire scenarios.
SimHarness is a modular agent-based machine learning wrapper capable of automatically generating land management strategies.
arXiv Detail & Related papers (2023-11-27T15:37:05Z) - Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous
Driving Research [76.93956925360638]
Waymax is a new data-driven simulator for autonomous driving in multi-agent scenes.
It runs entirely on hardware accelerators such as TPUs/GPUs and supports in-graph simulation for training.
We benchmark a suite of popular imitation and reinforcement learning algorithms with ablation studies on different design decisions.
arXiv Detail & Related papers (2023-10-12T20:49:15Z) - JAX-LOB: A GPU-Accelerated limit order book simulator to unlock large
scale reinforcement learning for trading [8.884142720013081]
Financial exchanges use limit order books (LOBs) to process orders and match trades.
For many applications, there is a requirement for processing multiple books, either for the calibration of ABMs or for the training of RL agents.
We showcase the first GPU-enabled LOB simulator designed to process thousands of books in parallel, with a notably reduced per-message processing time.
arXiv Detail & Related papers (2023-08-25T10:26:43Z) - FORFIS: A forest fire firefighting simulation tool for education and
research [90.40304110009733]
We present a forest fire firefighting simulation tool named FORFIS that is implemented in Python.
Our tool is published underv3 license and comes with a GUI as well as additional output functionality.
arXiv Detail & Related papers (2023-05-29T09:14:38Z) - Uncertainty Aware Wildfire Management [6.997483623023005]
Recent wildfires in the United States have resulted in loss of life and billions of dollars.
There are limited resources to be deployed over a massive area and the spread of the fire is challenging to predict.
This paper proposes a decision-theoretic approach to combat wildfires.
arXiv Detail & Related papers (2020-10-15T17:47:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.