Related papers: FireScope: Wildfire Risk Prediction with a Chain-of-Thought Oracle

FireScope: Wildfire Risk Prediction with a Chain-of-Thought Oracle

URL: http://arxiv.org/abs/2511.17171v1
Date: Fri, 21 Nov 2025 11:45:22 GMT
Title: FireScope: Wildfire Risk Prediction with a Chain-of-Thought Oracle
Authors: Mario Markov, Stefan Maria Ailuro, Luc Van Gool, Konrad Schindler, Danda Pani Paudel,
Abstract summary: Existing methods lack the causal reasoning and understanding required for reliable generalization.<n>We introduce $textbfFireScope-Bench, a dataset and benchmark that couples Sentinel-2 imagery and climate data with expert-defined risks.<n>When trained in the USA and tested in Europe, $textbfFireScope$ achieves substantial performance gains.<n>Our findings demonstrate that reasoning can ground prediction models, improving both generalization and interpretability.
Score: 69.84129020970477
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Predicting wildfire risk is a reasoning-intensive spatial problem that requires the integration of visual, climatic, and geographic factors to infer continuous risk maps. Existing methods lack the causal reasoning and multimodal understanding required for reliable generalization. We introduce $\textbf{FireScope-Bench}$, a large-scale dataset and benchmark that couples Sentinel-2 imagery and climate data with expert-defined risk rasters across the USA, and real wildfire events in Europe for cross-continental evaluation. Building on this dataset, we propose $\textbf{FireScope}$, a VLM-based reasoning-to-generation framework that learns from both reinforcement learning and visual supervision to predict risk rasters with complementary reasoning traces. When trained in the USA and tested in Europe, $\textbf{FireScope}$ achieves substantial performance gains, while expert feedback and automated analysis confirm that its reasoning traces are faithful and semantically meaningful. Our findings demonstrate that reasoning can ground raster prediction models, improving both generalization and interpretability. To our knowledge, this is the first framework to (1) demonstrate that language-based reasoning can improve generalization in visual generation, (2) propose a high-resolution wildfire risk model that can be applied across continents, and (3) enable systematic studies of robust cross-continental generalization for multimodal fire risk models. We believe that $\textbf{FireScope-Bench}$ has the potential to serve as a foundation for advancing reasoning-driven, interpretable and generalizable spatial modeling. Data and source code will be made publicly available.

Related papers

Benchmarking Knowledge-Extraction Attack and Defense on Retrieval-Augmented Generation [50.87199039334856]
Retrieval-Augmented Generation (RAG) has become a cornerstone of knowledge-intensive applications.<n>Recent studies show that knowledge-extraction attacks can recover sensitive knowledge-base content through maliciously crafted queries.<n>We introduce the first systematic benchmark for knowledge-extraction attacks on RAG systems.
arXiv Detail & Related papers (2026-02-10T01:27:46Z)
WildfireVLM: AI-powered Analysis for Early Wildfire Detection and Risk Assessment Using Satellite Imagery [1.0907929392898186]
Wildfires are a growing threat to ecosystems, human lives, and infrastructure.<n>We introduce WildfireVLM, an AI framework that combines satellite imagery wildfire detection with language-driven risk assessment.
arXiv Detail & Related papers (2026-02-09T19:40:50Z)
Semantically Aware UAV Landing Site Assessment from Remote Sensing Imagery via Multimodal Large Language Models [5.987458168544856]
Safe UAV emergency landing requires understanding complex semantic risks invisible to traditional geometric sensors.<n>We propose a novel framework leveraging Remote Sensing (RS) imagery and Multimodal Large Language Models (MLLMs) for context-aware landing site assessment.
arXiv Detail & Related papers (2026-02-01T11:30:03Z)
Demystifying Data-Driven Probabilistic Medium-Range Weather Forecasting [63.8116386935854]
We demonstrate that state-of-the-art probabilistic skill requires neither intricate architectural constraints nor specialized trainings.<n>We introduce a scalable framework for learning multi-scale atmospheric dynamics by combining a directly downsampled latent space with a history-conditioned local projector.<n>We find that our framework design is robust to the choice of probabilistic estimators, seamlessly supporting interpolants, diffusion models, and CRPS-based ensemble training.
arXiv Detail & Related papers (2026-01-26T03:52:16Z)
Spatiotemporal Wildfire Prediction and Reinforcement Learning for Helitack Suppression [0.7734713569509623]
Wildfire are growing in frequency and intensity, causing billions of dollars in suppression costs and devastating economic damage annually in the U.S.<n>Traditional wildfire management is mostly reactive, addressing fires only after they are detected.<n>We introduce textitFireCastRL, a proactive AI framework that combines wildfire forecasting with intelligent suppression strategies.
arXiv Detail & Related papers (2026-01-20T18:50:12Z)
Proof of Concept: Multi-Target Wildfire Risk Prediction and Large Language Model Synthesis [2.2049183478692593]
Current state-of-the-art approaches to wildfire risk assessment often overlook operational needs, limiting their practical value for first responders and firefighting services.<n>We propose the development of a hybrid framework that combines predictive models for each risk dimension with large language models (LLMs) to synthesize heterogeneous outputs into structured, actionable reports.
arXiv Detail & Related papers (2026-01-16T10:47:13Z)
VLM2GeoVec: Toward Universal Multimodal Embeddings for Remote Sensing [59.73939718087177]
Single-encoder vision-language model trained contrastively to embed interleaved inputs in a unified vector space.<n>VLM2GeoVec unifies scalable retrieval with region-level spatial reasoning, enabling cohesive multimodal analysis in remote sensing.
arXiv Detail & Related papers (2025-12-12T11:39:35Z)
That's Deprecated! Understanding, Detecting, and Steering Knowledge Conflicts in Language Models for Code Generation [55.78914774437411]
Large language models (LLMs) behave when faced with discrepancies between their parametric knowledge and conflicting information contained in a prompt.<n>We propose a domain-agnostic framework for constructing and interpreting such conflicts.<n>We show that activation-level steering can achieve up to a textbf12.6% improvement in steering success over a random baseline.
arXiv Detail & Related papers (2025-10-21T22:27:56Z)
Empowering LLM Agents with Geospatial Awareness: Toward Grounded Reasoning for Wildfire Response [9.801192259936888]
Existing statistical approaches often lack semantic context, generalize poorly across events, and offer limited interpretability.<n>We introduce a Geospatial Awareness Layer (GAL) that grounds LLM agents in structured earth data.<n>GAL automatically retrieves and integrates infrastructure, demographic, terrain, and weather information from external geo databases.<n>This enriched context enables agents to produce evidence-based resource-allocation recommendations.
arXiv Detail & Related papers (2025-10-14T01:59:02Z)
VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning [62.09195763860549]
Reinforcement learning with verifiable rewards (RLVR) improves reasoning in large language models (LLMs) but struggles with exploration.<n>We introduce $textbfVOGUE (Visual Uncertainty Guided Exploration)$, a novel method that shifts exploration from the output (text) to the input (visual) space.<n>Our work shows that grounding exploration in the inherent uncertainty of visual inputs is an effective strategy for improving multimodal reasoning.
arXiv Detail & Related papers (2025-10-01T20:32:08Z)
Diversity-Incentivized Exploration for Versatile Reasoning [63.653348177250756]
We propose textbfDIVER (textbfDi-textbfIncentivized Exploration for textbfVersatiltextbfE textbfReasoning), an innovative framework that highlights the pivotal role of global sequence-level diversity to incentivize deep exploration for versatile reasoning.
arXiv Detail & Related papers (2025-09-30T13:11:46Z)
mSCoRe: a $M$ultilingual and Scalable Benchmark for $S$kill-based $Co$mmonsense $Re$asoning [74.97363626515236]
We propose a textbfMultilingual and Scalable Benchmark for textbfSkill-based textbfCommonsense textbfReasoning (textbfmSCoRe)<n>Our benchmark incorporates three key components that are designed to systematically evaluate LLM's reasoning capabilities.<n>Our results reveal the limitations of such reasoning-reinforced models when confronted with nuanced multilingual general and cultural commonsense.
arXiv Detail & Related papers (2025-08-13T18:59:02Z)
CanadaFireSat: Toward high-resolution wildfire forecasting with multiple modalities [13.151866497852211]
Canada experienced one of the most severe wildfire seasons in recent history in 2023.<n>This extreme wildfire season is symptomatic of a climate-change-induced increase in the length and severity of the fire season that affects the boreal ecosystem.<n>It is critical to empower wildfire management in boreal communities with better mitigation solutions.
arXiv Detail & Related papers (2025-06-10T10:58:43Z)
OneForecast: A Universal Framework for Global and Regional Weather Forecasting [67.61381313555091]
We propose a global-regional nested weather forecasting framework (OneForecast) based on graph neural networks.<n>By combining a dynamic system perspective with multi-grid theory, we construct a multi-scale graph structure and densify the target region.<n>We introduce an adaptive messaging mechanism, using dynamic gating units, to deeply integrate node and edge features for more accurate extreme event forecasting.
arXiv Detail & Related papers (2025-02-01T06:49:16Z)
Advanced Wildfire Prediction in Morocco: Developing a Deep Learning Dataset from Multisource Observations [0.0]
This study introduces a novel and comprehensive dataset specifically designed for wildfire prediction in Morocco. We compile essential environmental indicators such as vegetation health (NDVI), population density, soil moisture levels, and meteorological data. Preliminary results show that models using this dataset achieve an accuracy of up to 90%, significantly improving prediction capabilities.
arXiv Detail & Related papers (2024-11-09T15:01:12Z)
Explainable Global Wildfire Prediction Models using Graph Neural Networks [2.2389592950633705]
We introduce an innovative Graph Neural Network (GNN)-based model for global wildfire prediction. Our approach transforms global climate and wildfire data into a graph representation, addressing challenges such as null oceanic data locations.
arXiv Detail & Related papers (2024-02-11T10:44:41Z)

This list is automatically generated from the titles and abstracts of the papers in this site.