Instance segmentation of fallen trees in aerial color infrared imagery
using active multi-contour evolution with fully convolutional network-based
intensity priors
- URL: http://arxiv.org/abs/2105.01998v1
- Date: Wed, 5 May 2021 11:54:05 GMT
- Title: Instance segmentation of fallen trees in aerial color infrared imagery
using active multi-contour evolution with fully convolutional network-based
intensity priors
- Authors: Przemyslaw Polewski, Jacquelyn Shelton, Wei Yao and Marco Heurich
- Abstract summary: We introduce a framework for segmenting instances of a common object class by multiple active contour evolution over semantic segmentation maps of images.
We instantiate the proposed framework in the context of segmenting individual fallen stems from high-resolution aerial multispectral imagery.
- Score: 0.5276232626689566
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In this paper, we introduce a framework for segmenting instances of a common
object class by multiple active contour evolution over semantic segmentation
maps of images obtained through fully convolutional networks. The contour
evolution is cast as an energy minimization problem, where the aggregate energy
functional incorporates a data fit term, an explicit shape model, and accounts
for object overlap. Efficient solution neighborhood operators are proposed,
enabling optimization through metaheuristics such as simulated annealing. We
instantiate the proposed framework in the context of segmenting individual
fallen stems from high-resolution aerial multispectral imagery. We validated
our approach on 3 real-world scenes of varying complexity. The test plots were
situated in regions of the Bavarian Forest National Park, Germany, which
sustained a heavy bark beetle infestation. Evaluations were performed on both
the polygon and line segment level, showing that the multi-contour segmentation
can achieve up to 0.93 precision and 0.82 recall. An improvement of up to 7
percentage points (pp) in recall and 6 in precision compared to an iterative
sample consensus line segment detection was achieved. Despite the simplicity of
the applied shape parametrization, an explicit shape model incorporated into
the energy function improved the results by up to 4 pp of recall. Finally, we
show the importance of using a deep learning based semantic segmentation method
as the basis for individual stem detection. Our method is a step towards
increased accessibility of automatic fallen tree mapping, due to higher cost
efficiency of aerial imagery acquisition compared to laser scanning. The
precise fallen tree maps could be further used as a basis for plant and animal
habitat modeling, studies on carbon sequestration as well as soil quality in
forest ecosystems.
Related papers
- Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers [59.0181939916084]
Traditional 3D networks mainly focus on local geometric details and ignore the topological structure between local geometries.
We propose a novel Priors Distillation (RPD) method to extract priors from the well-trained transformers on massive images.
Experiments on the PointDA-10 and the Sim-to-Real datasets verify that the proposed method consistently achieves the state-of-the-art performance of UDA for point cloud classification.
arXiv Detail & Related papers (2024-07-26T06:29:09Z) - Towards general deep-learning-based tree instance segmentation models [0.0]
Deep-learning methods have been proposed which show the potential of learning to segment trees.
We use seven diverse datasets found in literature to gain insights into the generalization capabilities under domain-shift.
Our results suggest that a generalization from coniferous dominated sparse point clouds to deciduous dominated high-resolution point clouds is possible.
arXiv Detail & Related papers (2024-05-03T12:42:43Z) - Benchmarking Individual Tree Mapping with Sub-meter Imagery [6.907098367807166]
We introduce an evaluation framework suited for individual tree mapping in any physical environment.
We review and compare different approaches and deep architectures, and introduce a new method that we experimentally prove to be a good compromise between segmentation and detection.
arXiv Detail & Related papers (2023-11-14T08:21:36Z) - Contour Context: Abstract Structural Distribution for 3D LiDAR Loop
Detection and Metric Pose Estimation [31.968749056155467]
This paper proposes a simple, effective, and efficient topological loop closure detection pipeline with accurate 3-DoF metric pose estimation.
We interpret the Cartesian birds' eye view (BEV) image projected from 3D LiDAR points as layered distribution of structures.
A retrieval key is designed to accelerate the search of a database indexed by layered KD-trees.
arXiv Detail & Related papers (2023-02-13T07:18:24Z) - CloudAttention: Efficient Multi-Scale Attention Scheme For 3D Point
Cloud Learning [81.85951026033787]
We set transformers in this work and incorporate them into a hierarchical framework for shape classification and part and scene segmentation.
We also compute efficient and dynamic global cross attentions by leveraging sampling and grouping at each iteration.
The proposed hierarchical model achieves state-of-the-art shape classification in mean accuracy and yields results on par with the previous segmentation methods.
arXiv Detail & Related papers (2022-07-31T21:39:15Z) - Augmenting Convolutional networks with attention-based aggregation [55.97184767391253]
We show how to augment any convolutional network with an attention-based global map to achieve non-local reasoning.
We plug this learned aggregation layer with a simplistic patch-based convolutional network parametrized by 2 parameters (width and depth)
It yields surprisingly competitive trade-offs between accuracy and complexity, in particular in terms of memory consumption.
arXiv Detail & Related papers (2021-12-27T14:05:41Z) - An Effective Leaf Recognition Using Convolutional Neural Networks Based
Features [1.137457877869062]
In this paper, we propose an effective method for the leaf recognition problem.
A leaf goes through some pre-processing to extract its refined color image, vein image, xy-projection histogram, handcrafted shape, texture features, and Fourier descriptors.
These attributes are then transformed into a better representation by neural network-based encoders before a support vector machine (SVM) model is utilized to classify different leaves.
arXiv Detail & Related papers (2021-08-04T02:02:22Z) - Multi-task fully convolutional network for tree species mapping in dense
forests using small training hyperspectral data [0.9646880519252493]
This work proposes a multi-task fully convolutional architecture for tree species mapping in dense forests.
Our model implements a partial loss function that enables dense tree semantic labeling outcomes from non-dense training samples.
A distance regression complementary task that enforces tree crown boundary constraints substantially improves the model performance.
arXiv Detail & Related papers (2021-06-01T21:10:10Z) - Two-View Fine-grained Classification of Plant Species [66.75915278733197]
We propose a novel method based on a two-view leaf image representation and a hierarchical classification strategy for fine-grained recognition of plant species.
A deep metric based on Siamese convolutional neural networks is used to reduce the dependence on a large number of training samples and make the method scalable to new plant species.
arXiv Detail & Related papers (2020-05-18T21:57:47Z) - Joint Multi-Dimension Pruning via Numerical Gradient Update [120.59697866489668]
We present joint multi-dimension pruning (abbreviated as JointPruning), an effective method of pruning a network on three crucial aspects: spatial, depth and channel simultaneously.
We show that our method is optimized collaboratively across the three dimensions in a single end-to-end training and it is more efficient than the previous exhaustive methods.
arXiv Detail & Related papers (2020-05-18T17:57:09Z) - Image Fine-grained Inpainting [89.17316318927621]
We present a one-stage model that utilizes dense combinations of dilated convolutions to obtain larger and more effective receptive fields.
To better train this efficient generator, except for frequently-used VGG feature matching loss, we design a novel self-guided regression loss.
We also employ a discriminator with local and global branches to ensure local-global contents consistency.
arXiv Detail & Related papers (2020-02-07T03:45:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.