NordFKB: a fine-grained benchmark dataset for geospatial AI in Norway
- URL: http://arxiv.org/abs/2512.09913v1
- Date: Wed, 10 Dec 2025 18:47:25 GMT
- Title: NordFKB: a fine-grained benchmark dataset for geospatial AI in Norway
- Authors: Sander Riisøen Jyhne, Aditya Gupta, Ben Worsley, Marianne Andersen, Ivar Oveland, Alexander Salveson Nossum,
- Abstract summary: We present NordFKB, a fine-grained benchmark dataset for geospatial AI in Norway.<n>The dataset contains high-resolution orthophotos paired with detailed annotations for 36 semantic classes.<n> NordFKB provides a robust foundation for advancing AI methods in mapping, land administration, and spatial planning.
- Score: 36.2419347658476
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We present NordFKB, a fine-grained benchmark dataset for geospatial AI in Norway, derived from the authoritative, highly accurate, national Felles KartdataBase (FKB). The dataset contains high-resolution orthophotos paired with detailed annotations for 36 semantic classes, including both per-class binary segmentation masks in GeoTIFF format and COCO-style bounding box annotations. Data is collected from seven geographically diverse areas, ensuring variation in climate, topography, and urbanization. Only tiles containing at least one annotated object are included, and training/validation splits are created through random sampling across areas to ensure representative class and context distributions. Human expert review and quality control ensures high annotation accuracy. Alongside the dataset, we release a benchmarking repository with standardized evaluation protocols and tools for semantic segmentation and object detection, enabling reproducible and comparable research. NordFKB provides a robust foundation for advancing AI methods in mapping, land administration, and spatial planning, and paves the way for future expansions in coverage, temporal scope, and data modalities.
Related papers
- FOTBCD: A Large-Scale Building Change Detection Benchmark from French Orthophotos and Topographic Data [0.0]
FOTBCD is a large-scale building change detection dataset derived from authoritative French orthophotos and topographic building data provided by IGN France.<n>It spans 28 departments across mainland France, with 25 used for training and three geographically disjoint departments held out for evaluation.<n>The dataset covers diverse urban, suburban, and rural environments at 0.2m/pixel resolution.
arXiv Detail & Related papers (2026-01-30T05:42:42Z) - LC-SLab -- An Object-based Deep Learning Framework for Large-scale Land Cover Classification from Satellite Imagery and Sparse In-situ Labels [25.42215602005236]
We propose LC-SLab, a framework for exploring object-based deep learning methods for large-scale land cover classification under sparse supervision.<n> LC-SLab supports both input-level aggregation via graph neural networks, and output-level aggregation by postprocessing results.<n>Our results show that object-based methods can match or exceed the accuracy of common pixel-wise models while producing substantially more coherent maps.
arXiv Detail & Related papers (2025-09-19T11:08:24Z) - HyBiomass: Global Hyperspectral Imagery Benchmark Dataset for Evaluating Geospatial Foundation Models in Forest Aboveground Biomass Estimation [1.0408909053766147]
We introduce a globally distributed benchmark dataset for forest aboveground biomass (AGB) estimation.<n>This benchmark dataset combines co-located hyperspectral imagery (HSI) from the Environmental Mapping and Analysis Program (EnMAP) satellite and predictions of AGB density estimates.<n>Our experimental results on this dataset demonstrate that the evaluated Geo-FMs can match or, in some cases, surpass the performance of a baseline U-Net.
arXiv Detail & Related papers (2025-06-12T21:29:20Z) - Performance and Generalizability Impacts of Incorporating Location Encoders into Deep Learning for Dynamic PM2.5 Estimation [0.39762912548964846]
We study the effect of geolocation into deep learning for a dynamic and spatially heterogeneous application.<n>We estimate daily surface-level PM2.5 across the contiguous United States using satellite and ground-based observations.<n>Results show that raw coordinates can improve performance within regions by supporting spatial generalization, but can reduce generalizability across regions.
arXiv Detail & Related papers (2025-05-24T02:00:34Z) - A Dataset for Semantic Segmentation in the Presence of Unknowns [49.795683850385956]
Existing datasets allow evaluation of only knowns or unknowns - but not both.<n>We propose a novel anomaly segmentation dataset, ISSU, that features a diverse set of anomaly inputs from cluttered real-world environments.<n>The dataset is twice larger than existing anomaly segmentation datasets.
arXiv Detail & Related papers (2025-03-28T10:31:01Z) - Downstream-Pretext Domain Knowledge Traceback for Active Learning [138.02530777915362]
We propose a downstream-pretext domain knowledge traceback (DOKT) method that traces the data interactions of downstream knowledge and pre-training guidance.
DOKT consists of a traceback diversity indicator and a domain-based uncertainty estimator.
Experiments conducted on ten datasets show that our model outperforms other state-of-the-art methods.
arXiv Detail & Related papers (2024-07-20T01:34:13Z) - Navya3DSeg -- Navya 3D Semantic Segmentation Dataset & split generation
for autonomous vehicles [63.20765930558542]
3D semantic data are useful for core perception tasks such as obstacle detection and ego-vehicle localization.
We propose a new dataset, Navya 3D (Navya3DSeg), with a diverse label space corresponding to a large scale production grade operational domain.
It contains 23 labeled sequences and 25 supplementary sequences without labels, designed to explore self-supervised and semi-supervised semantic segmentation benchmarks on point clouds.
arXiv Detail & Related papers (2023-02-16T13:41:19Z) - Divide and Contrast: Source-free Domain Adaptation via Adaptive
Contrastive Learning [122.62311703151215]
Divide and Contrast (DaC) aims to connect the good ends of both worlds while bypassing their limitations.
DaC divides the target data into source-like and target-specific samples, where either group of samples is treated with tailored goals.
We further align the source-like domain with the target-specific samples using a memory bank-based Maximum Mean Discrepancy (MMD) loss to reduce the distribution mismatch.
arXiv Detail & Related papers (2022-11-12T09:21:49Z) - Nearest Neighborhood-Based Deep Clustering for Source Data-absent
Unsupervised Domain Adaptation [33.394228127643494]
In the classic setting of unsupervised domain adaptation (UDA), the labeled source data are available in the training phase.
In many real-world scenarios, the source data is inaccessible, and only a model trained on the source domain is available.
This paper proposes a novel deep clustering method for this challenging task.
arXiv Detail & Related papers (2021-07-27T04:13:59Z) - A hierarchical deep learning framework for the consistent classification
of land use objects in geospatial databases [8.703408520845645]
In this paper, a hierarchical deep learning framework is proposed to verify the land use information.
A new CNN-based method is proposed aiming to predict land use in multiple levels hierarchically and simultaneously.
Experiments show that the CNN relying on JO outperforms previous results, achieving an overall accuracy up to 92.5%.
arXiv Detail & Related papers (2021-04-14T17:16:35Z) - BREEDS: Benchmarks for Subpopulation Shift [98.90314444545204]
We develop a methodology for assessing the robustness of models to subpopulation shift.
We leverage the class structure underlying existing datasets to control the data subpopulations that comprise the training and test distributions.
Applying this methodology to the ImageNet dataset, we create a suite of subpopulation shift benchmarks of varying granularity.
arXiv Detail & Related papers (2020-08-11T17:04:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.