California Wildfire Inventory (CAWFI): An Extensive Dataset for Predictive Techniques based on Artificial Intelligence
- URL: http://arxiv.org/abs/2509.11015v1
- Date: Sun, 14 Sep 2025 00:06:01 GMT
- Title: California Wildfire Inventory (CAWFI): An Extensive Dataset for Predictive Techniques based on Artificial Intelligence
- Authors: Rohan Tan Bhowmik, Youn Soo Jung, Juan Aguilera, Mary Prunicki, Kari Nadeau,
- Abstract summary: California Wildfire Inventory (CAWFI) is a wildfire database of over 37 million data points for building and training wildfire prediction solutions.<n>This dataset is intended to enable wildfire prediction research and solutions as well as set a precedent for future wildfire databases in other regions.
- Score: 0.05219568203653522
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Due to climate change and the disruption of ecosystems worldwide, wildfires are increasingly impacting environment, infrastructure, and human lives globally. Additionally, an exacerbating climate crisis means that these losses would continue to grow if preventative measures are not implemented. Though recent advancements in artificial intelligence enable wildfire management techniques, most deployed solutions focus on detecting wildfires after ignition. The development of predictive techniques with high accuracy requires extensive datasets to train machine learning models. This paper presents the California Wildfire Inventory (CAWFI), a wildfire database of over 37 million data points for building and training wildfire prediction solutions, thereby potentially preventing megafires and flash fires by addressing them before they spark. The dataset compiles daily historical California wildfire data from 2012 to 2018 and indicator data from 2012 to 2022. The indicator data consists of leading indicators (meteorological data correlating to wildfire-prone conditions), trailing indicators (environmental data correlating to prior and early wildfire activity), and geological indicators (vegetation and elevation data dictating wildfire risk and spread patterns). CAWFI has already demonstrated success when used to train a spatio-temporal artificial intelligence model, predicting 85.7% of future wildfires larger than 300,000 acres when trained on 2012-2017 indicator data. This dataset is intended to enable wildfire prediction research and solutions as well as set a precedent for future wildfire databases in other regions.
Related papers
- Modular Deep-Learning-Based Early Warning System for Deadly Heatwave Prediction [53.09098740555834]
DeepTherm is a modular early warning system for deadly heatwave prediction without requiring heat-related mortality history.<n>By highlighting the flexibility of deep learning, DeepTherm employs a dual-prediction pipeline, disentangling baseline mortality in the absence of heatwaves.<n>Results demonstrate consistent, robust, and accurate performance across diverse regions, time periods, and population groups while allowing trade-off between missed alarms and false alarms.
arXiv Detail & Related papers (2025-12-09T19:37:49Z) - FireScope: Wildfire Risk Prediction with a Chain-of-Thought Oracle [69.84129020970477]
Existing methods lack the causal reasoning and understanding required for reliable generalization.<n>We introduce $textbfFireScope-Bench, a dataset and benchmark that couples Sentinel-2 imagery and climate data with expert-defined risks.<n>When trained in the USA and tested in Europe, $textbfFireScope$ achieves substantial performance gains.<n>Our findings demonstrate that reasoning can ground prediction models, improving both generalization and interpretability.
arXiv Detail & Related papers (2025-11-21T11:45:22Z) - Deep Autoencoders for Unsupervised Anomaly Detection in Wildfire Prediction [42.447827727628734]
Wildfires pose a significantly increasing hazard to global ecosystems due to the climate crisis.
Due to its complex nature, there is an urgent need for innovative approaches to wildfire prediction, such as machine learning.
This research took a unique approach, differentiating from classical supervised learning, and addressed the gap in unsupervised wildfire prediction.
arXiv Detail & Related papers (2024-11-14T23:19:55Z) - Back to the Future: GNN-based NO$_2$ Forecasting via Future Covariates [49.93577170464313]
We deal with air quality observations in a city-wide network of ground monitoring stations.
We propose a conditioning block that embeds past and future covariates into the current observations.
We find that conditioning on future weather information has a greater impact than considering past traffic conditions.
arXiv Detail & Related papers (2024-04-08T09:13:16Z) - A Synergistic Approach to Wildfire Prevention and Management Using AI, ML, and 5G Technology in the United States [44.99833362998488]
This research investigates proactive methods for detecting and handling wildfires in the United States.
The specific objective of this research covers proactive detection and prevention of wildfires using advanced technology.
Various methods, such as AI-enabled remote sensing and 5G-based active monitoring, can enhance proactive wildfire detection and management.
arXiv Detail & Related papers (2024-02-27T04:09:30Z) - AI-based Wildfire Prevention, Detection and Suppression System [0.0]
Wildfire Prevention, Detection and Suppression System (WPDSS) is a novel, fully automated, end to end, AI based solution to effectively predict hotspots and detect wildfires.
WPDSS will reduce the impacts of climate change, protect ecosystems and biodiversity, avert huge economic losses, and save human lives.
The power of WPDSS developed can be applied to any location globally to prevent and suppress wildfires, reducing climate change.
arXiv Detail & Related papers (2023-12-12T05:18:23Z) - Generative Algorithms for Fusion of Physics-Based Wildfire Spread Models
with Satellite Data for Initializing Wildfire Forecasts [0.0]
Recent progress in using satellites to detect fire locations provides the opportunity to use measurements to improve fire spread forecasts.
This work develops a method for inferring the history of a wildfire from satellite measurements.
arXiv Detail & Related papers (2023-09-05T23:24:34Z) - GraphCast: Learning skillful medium-range global weather forecasting [107.40054095223779]
We introduce a machine learning-based method called "GraphCast", which can be trained directly from reanalysis data.
It predicts hundreds of weather variables, over 10 days at 0.25 degree resolution globally, in under one minute.
We show that GraphCast significantly outperforms the most accurate operational deterministic systems on 90% of 1380 verification targets.
arXiv Detail & Related papers (2022-12-24T18:15:39Z) - Image-Based Fire Detection in Industrial Environments with YOLOv4 [53.180678723280145]
This work looks into the potential of AI to detect and recognize fires and reduce detection time using object detection on an image stream.
To our end, we collected and labeled appropriate data from several public sources, which have been used to train and evaluate several models based on the popular YOLOv4 object detector.
arXiv Detail & Related papers (2022-12-09T11:32:36Z) - Deep Learning for Global Wildfire Forecasting [1.6929753878977016]
We create a global fire dataset and demonstrate a prototype for predicting the presence of global burned areas on a sub-seasonal scale.
We present an open-access global analysis-ready datacube, which contains a variety of variables related to the seasonal and sub-seasonal fire drivers.
We train a deep learning model, which treats global wildfire forecasting as an image segmentation task and skillfully predicts the presence of burned areas 8, 16, 32 and 64 days ahead of time.
arXiv Detail & Related papers (2022-11-01T15:39:01Z) - A Multi-Modal Wildfire Prediction and Personalized Early-Warning System
Based on a Novel Machine Learning Framework [0.0]
California's 2018 wildfire season caused damages of $148.5 billion.
Among millions impacted people, those living with disabilities are disproportionately impacted due to inadequate means of alerts.
In this project, a multi-modal wildfire prediction and personalized early warning system has been developed.
arXiv Detail & Related papers (2022-08-18T22:03:32Z) - Preliminary Wildfire Detection Using State-of-the-art PTZ (Pan, Tilt,
Zoom) Camera Technology and Convolutional Neural Networks [0.0]
Wildfires are uncontrolled fires in the environment that can be caused by humans or nature.
In 2020 alone, wildfires in California have burned 4.2 million acres, damaged 10,500 buildings or structures, and killed more than 31 people.
The objective of the research is to detect forest fires in their earlier stages to prevent them from spreading.
arXiv Detail & Related papers (2021-09-10T19:30:37Z) - From Static to Dynamic Prediction: Wildfire Risk Assessment Based on
Multiple Environmental Factors [69.9674326582747]
Wildfire is one of the biggest disasters that frequently occurs on the west coast of the United States.
We propose static and dynamic prediction models to analyze and assess the areas with high wildfire risks in California.
arXiv Detail & Related papers (2021-03-14T17:56:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.