Assessing Building Heat Resilience Using UAV and Street-View Imagery with Coupled Global Context Vision Transformer
- URL: http://arxiv.org/abs/2601.11357v1
- Date: Fri, 16 Jan 2026 15:13:30 GMT
- Title: Assessing Building Heat Resilience Using UAV and Street-View Imagery with Coupled Global Context Vision Transformer
- Authors: Steffen Knoblauch, Ram Kumar Muthusamy, Hao Li, Iddy Chazua, Benedcto Adamu, Innocent Maholi, Alexander Zipf,
- Abstract summary: Climate change is intensifying human heat exposure, particularly in densely built urban centers of the Global South.<n>We propose a machine learning framework that fuses openly available unmanned aerial vehicle (UAV) and street-view (SV) imagery.<n>Our results point to the critical role of localized, data-driven risk assessment in shaping climate adaptation strategies.
- Score: 37.356470410535366
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Climate change is intensifying human heat exposure, particularly in densely built urban centers of the Global South. Low-cost construction materials and high thermal-mass surfaces further exacerbate this risk. Yet scalable methods for assessing such heat-relevant building attributes remain scarce. We propose a machine learning framework that fuses openly available unmanned aerial vehicle (UAV) and street-view (SV) imagery via a coupled global context vision transformer (CGCViT) to learn heat-relevant representations of urban structures. Thermal infrared (TIR) measurements from HotSat-1 are used to quantify the relationship between building attributes and heat-associated health risks. Our dual-modality cross-view learning approach outperforms the best single-modality models by up to $9.3\%$, demonstrating that UAV and SV imagery provide valuable complementary perspectives on urban structures. The presence of vegetation surrounding buildings (versus no vegetation), brighter roofing (versus darker roofing), and roofing made of concrete, clay, or wood (versus metal or tarpaulin) are all significantly associated with lower HotSat-1 TIR values. Deployed across the city of Dar es Salaam, Tanzania, the proposed framework illustrates how household-level inequalities in heat exposure - often linked to socio-economic disadvantage and reflected in building materials - can be identified and addressed using machine learning. Our results point to the critical role of localized, data-driven risk assessment in shaping climate adaptation strategies that deliver equitable outcomes.
Related papers
- LightCity: An Urban Dataset for Outdoor Inverse Rendering and Reconstruction under Multi-illumination Conditions [80.70675855203154]
Inverse rendering in urban scenes is pivotal for applications like autonomous driving and digital twins.<n>Yet, it faces significant challenges due to complex illumination conditions, including multi-illumination and indirect light and shadow effects.<n>We present LightCity, a novel high-quality synthetic urban dataset featuring diverse illumination conditions with realistic indirect light and shadow effects.
arXiv Detail & Related papers (2026-02-01T09:37:00Z) - HeatMat: Simulation of City Material Impact on Urban Heat Island Effect [5.9791504486574425]
The Urban Heat Island (UHI) effect is a significant increase in temperature in urban environments compared to surrounding areas.<n>Among the factors contributing to this effect are the properties of urban materials, which differ from those in rural areas.<n>We propose HeatMat, an approach to analyze at high resolution the individual impact of urban materials on the UHI effect in a real city.
arXiv Detail & Related papers (2026-01-30T10:20:47Z) - FireScope: Wildfire Risk Prediction with a Chain-of-Thought Oracle [69.84129020970477]
Existing methods lack the causal reasoning and understanding required for reliable generalization.<n>We introduce $textbfFireScope-Bench, a dataset and benchmark that couples Sentinel-2 imagery and climate data with expert-defined risks.<n>When trained in the USA and tested in Europe, $textbfFireScope$ achieves substantial performance gains.<n>Our findings demonstrate that reasoning can ground prediction models, improving both generalization and interpretability.
arXiv Detail & Related papers (2025-11-21T11:45:22Z) - Solar PV Installation Potential Assessment on Building Facades Based on Vision and Language Foundation Models [11.037550898765502]
This study introduces SF-SPA (Semantic Facade Solar-PV Assessment), an automated framework that transforms street-view photographs into quantitative PV deployment assessments.<n>The approach combines com puter vision and artificial intelligence techniques to address three key challenges: perspective distortion correction, semantic understanding of facade elements, and spatial reasoning for PV layout optimization.
arXiv Detail & Related papers (2025-10-01T11:51:28Z) - RoSe: Robust Self-supervised Stereo Matching under Adverse Weather Conditions [58.37558408672509]
We propose a robust self-supervised training paradigm, consisting of two key steps: robust self-supervised scene correspondence learning and adverse weather distillation.<n>Experiments demonstrate the effectiveness and versatility of our proposed solution, which outperforms existing state-of-the-art self-supervised methods.
arXiv Detail & Related papers (2025-09-23T15:41:40Z) - Adopting Explainable-AI to investigate the impact of urban morphology design on energy and environmental performance in dry-arid climates [0.0]
This study advances urban morphology evaluation by combining Urban Building Energy Modeling (UBEM) with machine learning methods (ML) and Explainable AI techniques.<n>Using Tehran's dense urban landscape as a case study, this research assesses and ranks the impact of 30 morphology parameters on key energy metrics.<n>Findings reveal that building shape, window-to-wall ratio, and commercial ratio are the most critical parameters affecting energy efficiency.
arXiv Detail & Related papers (2024-12-13T09:19:49Z) - ReCap: Better Gaussian Relighting with Cross-Environment Captures [51.2614945509044]
We present ReCap, a multi-task system for accurate 3D object relighting in unseen environments.<n>Specifically, ReCap jointly optimize multiple lighting representations that share a common set of material attributes.<n>This naturally harmonizes a coherent set of lighting representations around the mutual material attributes, exploiting commonalities and differences across varied object appearances.<n>Together with a streamlined shading function and effective post-processing, ReCap outperforms all leading competitors on an expanded relighting benchmark.
arXiv Detail & Related papers (2024-12-10T14:15:32Z) - Global Estimation of Building-Integrated Facade and Rooftop Photovoltaic Potential by Integrating 3D Building Footprint and Spatio-Temporal Datasets [24.770137545969312]
This research tackles the challenges of estimating Building-Integrated Photovoltaics (BIPV) potential across various temporal and spatial scales.<n>We introduce a holistic methodology for evaluating BIPV potential, integrating 3D building footprint models with diverse meteorological data sources.<n>We highlight the importance of 3D building forms, cityscape morphology, and geographic positioning in measuring BIPV potential at various levels.
arXiv Detail & Related papers (2024-12-02T09:04:16Z) - Estimating Building Energy Efficiency From Street View Imagery, Aerial
Imagery, and Land Surface Temperature Data [0.0]
This work proposes a new method which can estimate a building's energy efficiency using purely remotely sensed data.
We find that in the binary setting of distinguishing efficient from inefficient buildings, our end-to-end deep learning model achieves a macro-averaged F1-score of 62.06%.
arXiv Detail & Related papers (2022-06-05T21:04:20Z) - Maximizing Self-supervision from Thermal Image for Effective
Self-supervised Learning of Depth and Ego-motion [78.19156040783061]
Self-supervised learning of depth and ego-motion from thermal images shows strong robustness and reliability under challenging scenarios.
The inherent thermal image properties such as weak contrast, blurry edges, and noise hinder to generate effective self-supervision from thermal images.
We propose an effective thermal image mapping method that significantly increases image information, such as overall structure, contrast, and details, while preserving temporal consistency.
arXiv Detail & Related papers (2022-01-12T09:49:24Z) - Urban Heat Islands: Beating the Heat with Multi-Modal Spatial Analysis [0.3121997724420106]
Excessive levels of heat stress leads to problems at various levels, ranging from the individual to the world.
At the world level, UHI potentially contributes to global warming and adversely affects the environment.
We propose a framework for investigating how UHI is affected by a city's urban form characteristics through the use of statistical modelling.
arXiv Detail & Related papers (2020-12-05T15:18:22Z) - Dynamical Landscape and Multistability of a Climate Model [64.467612647225]
We find a third intermediate stable state in one of the two climate models we consider.
The combination of our approaches allows to identify how the negative feedback of ocean heat transport and entropy production drastically change the topography of Earth's climate.
arXiv Detail & Related papers (2020-10-20T15:31:38Z) - RescueNet: Joint Building Segmentation and Damage Assessment from
Satellite Imagery [83.49145695899388]
RescueNet is a unified model that can simultaneously segment buildings and assess the damage levels to individual buildings and can be trained end-to-end.
RescueNet is tested on the large scale and diverse xBD dataset and achieves significantly better building segmentation and damage classification performance than previous methods.
arXiv Detail & Related papers (2020-04-15T19:52:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.