Dynamic Convolution for 3D Point Cloud Instance Segmentation
- URL: http://arxiv.org/abs/2107.08392v1
- Date: Sun, 18 Jul 2021 09:05:16 GMT
- Title: Dynamic Convolution for 3D Point Cloud Instance Segmentation
- Authors: Tong He, Chunhua Shen, Anton van den Hengel
- Abstract summary: We propose an approach to instance segmentation from 3D point clouds based on dynamic convolution.
We gather homogeneous points that have identical semantic categories and close votes for the geometric centroids.
The proposed approach is proposal-free, and instead exploits a convolution process that adapts to the spatial and semantic characteristics of each instance.
- Score: 146.7971476424351
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: We propose an approach to instance segmentation from 3D point clouds based on
dynamic convolution. This enables it to adapt, at inference, to varying feature
and object scales. Doing so avoids some pitfalls of bottom up approaches,
including a dependence on hyper-parameter tuning and heuristic post-processing
pipelines to compensate for the inevitable variability in object sizes, even
within a single scene. The representation capability of the network is greatly
improved by gathering homogeneous points that have identical semantic
categories and close votes for the geometric centroids. Instances are then
decoded via several simple convolution layers, where the parameters are
generated conditioned on the input. The proposed approach is proposal-free, and
instead exploits a convolution process that adapts to the spatial and semantic
characteristics of each instance. A light-weight transformer, built on the
bottleneck layer, allows the model to capture long-range dependencies, with
limited computational overhead. The result is a simple, efficient, and robust
approach that yields strong performance on various datasets: ScanNetV2, S3DIS,
and PartNet. The consistent improvements on both voxel- and point-based
architectures imply the effectiveness of the proposed method. Code is available
at: https://git.io/DyCo3D
Related papers
- ISBNet: a 3D Point Cloud Instance Segmentation Network with
Instance-aware Sampling and Box-aware Dynamic Convolution [14.88505076974645]
ISBNet is a novel method that represents instances as kernels and decodes instance masks via dynamic convolution.
We set new state-of-the-art results on ScanNetV2 (55.9), S3DIS (60.8), S3LS3D (49.2) in terms of AP and retains fast inference time (237ms per scene on ScanNetV2.
arXiv Detail & Related papers (2023-03-01T06:06:28Z) - CloudAttention: Efficient Multi-Scale Attention Scheme For 3D Point
Cloud Learning [81.85951026033787]
We set transformers in this work and incorporate them into a hierarchical framework for shape classification and part and scene segmentation.
We also compute efficient and dynamic global cross attentions by leveraging sampling and grouping at each iteration.
The proposed hierarchical model achieves state-of-the-art shape classification in mean accuracy and yields results on par with the previous segmentation methods.
arXiv Detail & Related papers (2022-07-31T21:39:15Z) - PointInst3D: Segmenting 3D Instances by Points [136.7261709896713]
We propose a fully-convolutional 3D point cloud instance segmentation method that works in a per-point prediction fashion.
We find the key to its success is assigning a suitable target to each sampled point.
Our approach achieves promising results on both ScanNet and S3DIS benchmarks.
arXiv Detail & Related papers (2022-04-25T02:41:46Z) - Stratified Transformer for 3D Point Cloud Segmentation [89.9698499437732]
Stratified Transformer is able to capture long-range contexts and demonstrates strong generalization ability and high performance.
To combat the challenges posed by irregular point arrangements, we propose first-layer point embedding to aggregate local information.
Experiments demonstrate the effectiveness and superiority of our method on S3DIS, ScanNetv2 and ShapeNetPart datasets.
arXiv Detail & Related papers (2022-03-28T05:35:16Z) - Fine-Grained Dynamic Head for Object Detection [68.70628757217939]
We propose a fine-grained dynamic head to conditionally select a pixel-level combination of FPN features from different scales for each instance.
Experiments demonstrate the effectiveness and efficiency of the proposed method on several state-of-the-art detection benchmarks.
arXiv Detail & Related papers (2020-12-07T08:16:32Z) - DyCo3D: Robust Instance Segmentation of 3D Point Clouds through Dynamic
Convolution [136.7261709896713]
We propose a data-driven approach that generates the appropriate convolution kernels to apply in response to the nature of the instances.
The proposed method achieves promising results on both ScanetNetV2 and S3DIS.
It also improves inference speed by more than 25% over the current state-of-the-art.
arXiv Detail & Related papers (2020-11-26T14:56:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.