FOTBCD: A Large-Scale Building Change Detection Benchmark from French Orthophotos and Topographic Data
- URL: http://arxiv.org/abs/2601.22596v1
- Date: Fri, 30 Jan 2026 05:42:42 GMT
- Title: FOTBCD: A Large-Scale Building Change Detection Benchmark from French Orthophotos and Topographic Data
- Authors: Abdelrrahman Moubane,
- Abstract summary: FOTBCD is a large-scale building change detection dataset derived from authoritative French orthophotos and topographic building data provided by IGN France.<n>It spans 28 departments across mainland France, with 25 used for training and three geographically disjoint departments held out for evaluation.<n>The dataset covers diverse urban, suburban, and rural environments at 0.2m/pixel resolution.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We introduce FOTBCD, a large-scale building change detection dataset derived from authoritative French orthophotos and topographic building data provided by IGN France. Unlike existing benchmarks that are geographically constrained to single cities or limited regions, FOTBCD spans 28 departments across mainland France, with 25 used for training and three geographically disjoint departments held out for evaluation. The dataset covers diverse urban, suburban, and rural environments at 0.2m/pixel resolution. We publicly release FOTBCD-Binary, a dataset comprising approximately 28,000 before/after image pairs with pixel-wise binary building change masks, each associated with patch-level spatial metadata. The dataset is designed for large-scale benchmarking and evaluation under geographic domain shift, with validation and test samples drawn from held-out departments and manually verified to ensure label quality. In addition, we publicly release FOTBCD-Instances, a publicly available instance-level annotated subset comprising several thousand image pairs, which illustrates the complete annotation schema used in the full instance-level version of FOTBCD. Using a fixed reference baseline, we benchmark FOTBCD-Binary against LEVIR-CD+ and WHU-CD, providing strong empirical evidence that geographic diversity at the dataset level is associated with improved cross-domain generalization in building change detection.
Related papers
- NordFKB: a fine-grained benchmark dataset for geospatial AI in Norway [36.2419347658476]
We present NordFKB, a fine-grained benchmark dataset for geospatial AI in Norway.<n>The dataset contains high-resolution orthophotos paired with detailed annotations for 36 semantic classes.<n> NordFKB provides a robust foundation for advancing AI methods in mapping, land administration, and spatial planning.
arXiv Detail & Related papers (2025-12-10T18:47:25Z) - ForensicHub: A Unified Benchmark & Codebase for All-Domain Fake Image Detection and Localization [54.64604722756443]
ForensicHub is the first unified benchmark for all-domain fake image detection and localization.<n>It decomposes forensic pipelines into interchangeable components across datasets, transforms, models, and evaluators.<n>It offers 8 key actionable insights into FIDL model architecture, dataset characteristics, and evaluation standards.
arXiv Detail & Related papers (2025-05-16T08:49:59Z) - Zooming In on Fakes: A Novel Dataset for Localized AI-Generated Image Detection with Forgery Amplification Approach [69.01456182499486]
textbfBR-Gen is a large-scale dataset of 150,000 locally forged images with diverse scene-aware annotations.<n>textbfNFA-ViT is a Noise-guided Forgery Amplification Vision Transformer that enhances the detection of localized forgeries.
arXiv Detail & Related papers (2025-04-16T09:57:23Z) - OAM-TCD: A globally diverse dataset of high-resolution tree cover maps [8.336960607169175]
We present a novel open-access dataset for individual tree crown delineation (TCD) in high-resolution aerial imagery sourced from OpenMap (OAM)
Our dataset, OAM-TCD, comprises 5072 2048x2048px images at 10 cm/px resolution with associated human-labeled instance masks for over 280k individual and 56k groups of trees.
Using our dataset, we train reference instance and semantic segmentation models that compare favorably to existing state-of-the-art models.
arXiv Detail & Related papers (2024-07-16T14:11:29Z) - AGBD: A Global-scale Biomass Dataset [18.976975819550173]
Existing datasets for Above Ground Biomass estimation from satellite imagery are limited.<n>This dataset combines AGB reference data from the GEDI mission with data from Sentinel-2 and PALSAR-2 imagery.<n>It includes pre-processed high-level features such as a dense canopy height map, an elevation map, and a land-cover classification map.<n>It can be easily accessed using a single line of code, offering a solid basis for efforts towards global AGB estimation.
arXiv Detail & Related papers (2024-06-07T13:34:17Z) - SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection [79.23689506129733]
We establish a new benchmark dataset and an open-source method for large-scale SAR object detection.<n>Our dataset, SARDet-100K, is a result of intense surveying, collecting, and standardizing 10 existing SAR detection datasets.<n>To the best of our knowledge, SARDet-100K is the first COCO-level large-scale multi-class SAR object detection dataset ever created.
arXiv Detail & Related papers (2024-03-11T09:20:40Z) - FLAIR: a Country-Scale Land Cover Semantic Segmentation Dataset From
Multi-Source Optical Imagery [4.9687851703152806]
We introduce the French Land cover from Aerospace ImageRy (FLAIR), an extensive dataset from the French National Institute of Geographical and Forest Information (IGN)
FLAIR contains high-resolution aerial imagery with a ground sample distance of 20 cm and over 20 billion individually labeled pixels for precise land-cover classification.
The dataset also integrates temporal and spectral data from optical satellite time series.
arXiv Detail & Related papers (2023-10-20T07:55:12Z) - A Large Scale Homography Benchmark [52.55694707744518]
We present a large-scale dataset of Planes in 3D, Pi3D, of roughly 1000 planes observed in 10 000 images from the 1DSfM dataset.
We also present HEB, a large-scale homography estimation benchmark leveraging Pi3D.
arXiv Detail & Related papers (2023-02-20T14:18:09Z) - Navya3DSeg -- Navya 3D Semantic Segmentation Dataset & split generation
for autonomous vehicles [63.20765930558542]
3D semantic data are useful for core perception tasks such as obstacle detection and ego-vehicle localization.
We propose a new dataset, Navya 3D (Navya3DSeg), with a diverse label space corresponding to a large scale production grade operational domain.
It contains 23 labeled sequences and 25 supplementary sequences without labels, designed to explore self-supervised and semi-supervised semantic segmentation benchmarks on point clouds.
arXiv Detail & Related papers (2023-02-16T13:41:19Z) - CalCROP21: A Georeferenced multi-spectral dataset of Satellite Imagery
and Crop Labels [20.951184753721503]
The U.S. Department of Agriculture (USDA) annually releases the Cropland Data Layer (CDL) which contains crop labels at 30m resolution for the entire U.S.
We create a new semantic segmentation benchmark dataset, which we call CalCROP21, for the diverse crops in the Central Valley region of California at 10m spatial resolution.
arXiv Detail & Related papers (2021-07-26T22:20:16Z) - Adversarial Dual Distinct Classifiers for Unsupervised Domain Adaptation [67.83872616307008]
Unversarial Domain adaptation (UDA) attempts to recognize the unlabeled target samples by building a learning model from a differently-distributed labeled source domain.
In this paper, we propose a novel Adrial Dual Distincts Network (AD$2$CN) to align the source and target domain data distribution simultaneously with matching task-specific category boundaries.
To be specific, a domain-invariant feature generator is exploited to embed the source and target data into a latent common space with the guidance of discriminative cross-domain alignment.
arXiv Detail & Related papers (2020-08-27T01:29:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.