Related papers: Building Floor Number Estimation from Crowdsourced Street-Level Images: Munich Dataset and Baseline Method

Building Floor Number Estimation from Crowdsourced Street-Level Images: Munich Dataset and Baseline Method

URL: http://arxiv.org/abs/2505.18021v1
Date: Fri, 23 May 2025 15:27:46 GMT
Title: Building Floor Number Estimation from Crowdsourced Street-Level Images: Munich Dataset and Baseline Method
Authors: Yao Sun, Sining Chen, Yifan Tian, Xiao Xiang Zhu,
Abstract summary: Large-scale floor-count data are rarely available in cadastral and 3D city databases.<n>This study proposes an end-to-end deep learning framework that infers floor numbers directly from street-level imagery.<n>The proposed classification-regression network attains 81.2% exact accuracy and predicts 97.9% of buildings within +/-1 floor.
Score: 17.492721759864505
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Accurate information on the number of building floors, or above-ground storeys, is essential for household estimation, utility provision, risk assessment, evacuation planning, and energy modeling. Yet large-scale floor-count data are rarely available in cadastral and 3D city databases. This study proposes an end-to-end deep learning framework that infers floor numbers directly from unrestricted, crowdsourced street-level imagery, avoiding hand-crafted features and generalizing across diverse facade styles. To enable benchmarking, we release the Munich Building Floor Dataset, a public set of over 6800 geo-tagged images collected from Mapillary and targeted field photography, each paired with a verified storey label. On this dataset, the proposed classification-regression network attains 81.2% exact accuracy and predicts 97.9% of buildings within +/-1 floor. The method and dataset together offer a scalable route to enrich 3D city models with vertical information and lay a foundation for future work in urban informatics, remote sensing, and geographic information science. Source code and data will be released under an open license at https://github.com/ya0-sun/Munich-SVI-Floor-Benchmark.

Related papers

Population estimation using 3D city modelling and Carto2S datasets -- A case study [0.0]
With launch of Carto2S series of satellites, high resolution images (0.6-1.0 meters) are acquired and available for use. High resolution Digital Elevation Model (DEM) with better accuracies can be generated using C2S multi-view and multi date datasets. DEMs are further used to derive Digital terrain models (DTMs) and to extract accurate heights of the objects (building and tree) over the surface of the Earth.
arXiv Detail & Related papers (2024-11-07T10:52:57Z)
Extracting the U.S. building types from OpenStreetMap data [0.16060719742433224]
This work creates a comprehensive dataset by providing residential/non-residential building classification covering the entire United States. We propose and utilize an unsupervised machine learning method to classify building types based on building footprints and available OpenStreetMap information. The validation shows a high precision for non-residential building classification and a high recall for residential buildings.
arXiv Detail & Related papers (2024-09-09T15:05:27Z)
Identifying every building's function in large-scale urban areas with multi-modality remote-sensing data [5.18540804614798]
This study proposes a semi-supervised framework to identify every building's function in large-scale urban areas. optical images, building height, and nighttime-light data are collected to describe the morphological attributes of buildings. Results are evaluated by 20,000 validation points and statistical survey reports from the government.
arXiv Detail & Related papers (2024-05-08T15:32:20Z)
Semi-supervised Learning from Street-View Images and OpenStreetMap for Automatic Building Height Estimation [59.6553058160943]
We propose a semi-supervised learning (SSL) method of automatically estimating building height from Mapillary SVI and OpenStreetMap data. The proposed method leads to a clear performance boosting in estimating building heights with a Mean Absolute Error (MAE) around 2.1 meters. The preliminary result is promising and motivates our future work in scaling up the proposed method based on low-cost VGI data.
arXiv Detail & Related papers (2023-07-05T18:16:30Z)
Building Floorspace in China: A Dataset and Learning Pipeline [0.32228025627337864]
This paper provides a first milestone in measuring the floorspace of buildings in 40 major Chinese cities. We use Sentinel-1 and -2 satellite images as our main data source. We provide a detailed description of our data, algorithms, and evaluations.
arXiv Detail & Related papers (2023-03-03T21:45:36Z)
A Large Scale Homography Benchmark [52.55694707744518]
We present a large-scale dataset of Planes in 3D, Pi3D, of roughly 1000 planes observed in 10 000 images from the 1DSfM dataset. We also present HEB, a large-scale homography estimation benchmark leveraging Pi3D.
arXiv Detail & Related papers (2023-02-20T14:18:09Z)
SensatUrban: Learning Semantics from Urban-Scale Photogrammetric Point Clouds [52.624157840253204]
We introduce SensatUrban, an urban-scale UAV photogrammetry point cloud dataset consisting of nearly three billion points collected from three UK cities, covering 7.6 km2. Each point in the dataset has been labelled with fine-grained semantic annotations, resulting in a dataset that is three times the size of the previous existing largest photogrammetric point cloud dataset.
arXiv Detail & Related papers (2022-01-12T14:48:11Z)
FloorLevel-Net: Recognizing Floor-Level Lines with Height-Attention-Guided Multi-task Learning [49.30194762653723]
This work tackles the problem of locating floor-level lines in street-view images, using a supervised deep learning approach. We first compile a new dataset and develop a new data augmentation scheme to synthesize training samples. Next, we design FloorLevel-Net, a multi-task learning network that associates explicit features of building facades and implicit floor-level lines.
arXiv Detail & Related papers (2021-07-06T08:17:59Z)
Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and Challenges [52.624157840253204]
We present an urban-scale photogrammetric point cloud dataset with nearly three billion richly annotated points. Our dataset consists of large areas from three UK cities, covering about 7.6 km2 of the city landscape. We evaluate the performance of state-of-the-art algorithms on our dataset and provide a comprehensive analysis of the results.
arXiv Detail & Related papers (2020-09-07T14:47:07Z)
Hidden Footprints: Learning Contextual Walkability from 3D Human Trails [70.01257397390361]
Current datasets only tell you where people are, not where they could be. We first augment the set of valid, labeled walkable regions by propagating person observations between images, utilizing 3D information to create what we call hidden footprints. We devise a training strategy designed for such sparse labels, combining a class-balanced classification loss with a contextual adversarial loss.
arXiv Detail & Related papers (2020-08-19T23:19:08Z)

This list is automatically generated from the titles and abstracts of the papers in this site.