Related papers: Benchmarking individual tree segmentation using multispectral airborne laser scanning data: the FGI-EMIT dataset

Benchmarking individual tree segmentation using multispectral airborne laser scanning data: the FGI-EMIT dataset

URL: http://arxiv.org/abs/2511.00653v1
Date: Sat, 01 Nov 2025 18:31:18 GMT
Title: Benchmarking individual tree segmentation using multispectral airborne laser scanning data: the FGI-EMIT dataset
Authors: Lassi Ruoppa, Tarmo Hietala, Verneri Seppänen, Josef Taher, Teemu Hakala, Xiaowei Yu, Antero Kukko, Harri Kaartinen, Juha Hyyppä,
Abstract summary: This study introduces FGI-EMIT, the first large-scale airborne laser scanning benchmark dataset for individual tree segmentation.<n>The dataset consists of 1,561 manually annotated trees, with a particular focus on small understory trees.
Score: 4.560913422651555
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Individual tree segmentation (ITS) from LiDAR point clouds is fundamental for applications such as forest inventory, carbon monitoring and biodiversity assessment. Traditionally, ITS has been achieved with unsupervised geometry-based algorithms, while more recent advances have shifted toward supervised deep learning (DL). In the past, progress in method development was hindered by the lack of large-scale benchmark datasets, and the availability of novel data formats, particularly multispectral (MS) LiDAR, remains limited to this day, despite evidence that MS reflectance can improve the accuracy of ITS. This study introduces FGI-EMIT, the first large-scale MS airborne laser scanning benchmark dataset for ITS. Captured at wavelengths 532, 905, and 1,550 nm, the dataset consists of 1,561 manually annotated trees, with a particular focus on small understory trees. Using FGI-EMIT, we comprehensively benchmarked four conventional unsupervised algorithms and four supervised DL approaches. Hyperparameters of unsupervised methods were optimized using a Bayesian approach, while DL models were trained from scratch. Among the unsupervised methods, Treeiso achieved the highest test set F1-score of 52.7%. The DL approaches performed significantly better overall, with the best model, ForestFormer3D, attaining an F1-score of 73.3%. The most significant difference was observed in understory trees, where ForestFormer3D exceeded Treeiso by 25.9 percentage points. An ablation study demonstrated that current DL-based approaches generally fail to leverage MS reflectance information when it is provided as additional input features, although single channel reflectance can improve accuracy marginally, especially for understory trees. A performance analysis across point densities further showed that DL methods consistently remain superior to unsupervised algorithms, even at densities as low as 10 points/m$^2$.

Related papers

ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs [75.72672339168092]
We introduce ReasonFlux-PRM, a novel trajectory-aware PRM to evaluate trajectory-response type of reasoning traces.<n>ReasonFlux-PRM incorporates both step-level and trajectory-level supervision, enabling fine-grained reward assignment aligned with structured chain-of-thought data.<n>Our derived ReasonFlux-PRM-7B yields consistent performance improvements, achieving average gains of 12.1% in supervised fine-tuning, 4.5% in reinforcement learning, and 6.3% in test-time scaling.
arXiv Detail & Related papers (2025-06-23T17:59:02Z)
EfficientLLM: Efficiency in Large Language Models [64.3537131208038]
Large Language Models (LLMs) have driven significant progress, yet their growing counts and context windows incur prohibitive compute, energy, and monetary costs.<n>We introduce EfficientLLM, a novel benchmark and the first comprehensive empirical study evaluating efficiency techniques for LLMs at scale.
arXiv Detail & Related papers (2025-05-20T02:27:08Z)
Multispectral airborne laser scanning for tree species classification: a benchmark of machine learning and deep learning algorithms [3.9167717582896793]
Multispectral airborne laser scanning (ALS) has shown promise in automated point cloud processing and tree segmentation.<n>This study addresses these gaps by conducting a benchmark of machine learning and deep learning methods for tree species classification.
arXiv Detail & Related papers (2025-04-19T16:03:49Z)
Manual Labelling Artificially Inflates Deep Learning-Based Segmentation Performance on RGB Images of Closed Canopy: Validation Using TLS [0.0]
Traditional methods relying on field-based forest inventories are labor-intensive and limited in spatial coverage.<n>We generate high-fidelity validation labels from co-located Terrestrial Laser Scanning (TLS) data for drone imagery of boreal and Mediterranean forests.<n>We evaluate the performance of two widely used deep learning ITC segmentation models - DeepForest (RetinaNet) and Detectree2 (Mask R-CNN)<n>Both models showed very poor localisation accuracy at stricter IoU thresholds, even when restricted to canopy trees.
arXiv Detail & Related papers (2025-03-18T14:09:00Z)
Unsupervised deep learning for semantic segmentation of multispectral LiDAR forest point clouds [1.6633665061166945]
This study proposes a fully unsupervised deep learning method for leaf-wood separation of high-density laser scanning point clouds.<n>GrowSP-ForMS achieved a mean accuracy of 84.3% and a mean intersection over union (mIoU) of 69.6% on our MS test set.
arXiv Detail & Related papers (2025-02-10T07:58:49Z)
Benchmarking tree species classification from proximally-sensed laser scanning data: introducing the FOR-species20K dataset [1.2771525473423657]
FOR-species20K benchmark was created, comprising over 20,000 tree point clouds from 33 species. This dataset enables the benchmarking of DL models for tree species classification. The top model, DetailView, was particularly robust, handling data imbalances well and generalizing effectively across tree sizes.
arXiv Detail & Related papers (2024-08-12T21:47:15Z)
SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection [68.18620488664187]
We propose a simple yet effective Semi-supervised Oriented Object Detection method termed SOOD++.<n> Specifically, we observe that objects from aerial images usually have arbitrary orientations, small scales, and dense distribution.<n>Extensive experiments conducted on various oriented object under various labeled settings demonstrate the effectiveness of our method.
arXiv Detail & Related papers (2024-07-01T07:03:51Z)
Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs [54.05511925104712]
We propose a simple, effective, and data-efficient method called Step-DPO. Step-DPO treats individual reasoning steps as units for preference optimization rather than evaluating answers holistically. Our findings demonstrate that as few as 10K preference data pairs and fewer than 500 Step-DPO training steps can yield a nearly 3% gain in accuracy on MATH for models with over 70B parameters.
arXiv Detail & Related papers (2024-06-26T17:43:06Z)
A Deep Dive into the Trade-Offs of Parameter-Efficient Preference Alignment Techniques [63.10251271444959]
Large language models are first pre-trained on trillions of tokens and then instruction-tuned or aligned to specific preferences. We conduct an in-depth investigation of the impact of popular choices for three crucial axes. Our setup spanning over 300 experiments reveals consistent trends and unexpected findings.
arXiv Detail & Related papers (2024-06-07T12:25:51Z)
SegmentAnyTree: A sensor and platform agnostic deep learning model for tree segmentation using laser scanning data [15.438892555484616]
This research advances individual tree crown (ITC) segmentation in lidar data, using a deep learning model applicable to various laser scanning types. It addresses the challenge of transferability across different data characteristics in 3D forest scene analysis. The model, based on PointGroup architecture, is a 3D CNN with separate heads for semantic and instance segmentation.
arXiv Detail & Related papers (2024-01-28T19:47:17Z)
Vision Transformers, a new approach for high-resolution and large-scale mapping of canopy heights [50.52704854147297]
We present a new vision transformer (ViT) model optimized with a classification (discrete) and a continuous loss function. This model achieves better accuracy than previously used convolutional based approaches (ConvNets) optimized with only a continuous loss function.
arXiv Detail & Related papers (2023-04-22T22:39:03Z)

This list is automatically generated from the titles and abstracts of the papers in this site.