Window Normalization: Enhancing Point Cloud Understanding by Unifying
Inconsistent Point Densities
- URL: http://arxiv.org/abs/2212.02287v1
- Date: Mon, 5 Dec 2022 14:09:07 GMT
- Title: Window Normalization: Enhancing Point Cloud Understanding by Unifying
Inconsistent Point Densities
- Authors: Qi Wang, Sheng Shi, Jiahui Li, Wuming Jiang, Xiangde Zhang
- Abstract summary: Downsampling and feature extraction are essential procedures for 3D point cloud understanding.
Window-normalization method is leveraged to unify the point densities in different parts.
Group-wise strategy is proposed to obtain multi-type features, including texture and spatial information.
- Score: 16.770190781915673
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Downsampling and feature extraction are essential procedures for 3D point
cloud understanding. Existing methods are limited by the inconsistent point
densities of different parts in the point cloud. In this work, we analyze the
limitation of the downsampling stage and propose the pre-abstraction group-wise
window-normalization module. In particular, the window-normalization method is
leveraged to unify the point densities in different parts. Furthermore, the
group-wise strategy is proposed to obtain multi-type features, including
texture and spatial information. We also propose the pre-abstraction module to
balance local and global features. Extensive experiments show that our module
performs better on several tasks. In segmentation tasks on S3DIS (Area 5), the
proposed module performs better on small object recognition, and the results
have more precise boundaries than others. The recognition of the sofa and the
column is improved from 69.2% to 84.4% and from 42.7% to 48.7%, respectively.
The benchmarks are improved from 71.7%/77.6%/91.9% (mIoU/mAcc/OA) to
72.2%/78.2%/91.4%. The accuracies of 6-fold cross-validation on S3DIS are
77.6%/85.8%/91.7%. It outperforms the best model PointNeXt-XL
(74.9%/83.0%/90.3%) by 2.7% on mIoU and achieves state-of-the-art performance.
The code and models are available at
https://github.com/DBDXSS/Window-Normalization.git.
Related papers
- RISurConv: Rotation Invariant Surface Attention-Augmented Convolutions for 3D Point Cloud Classification and Segmentation [17.558376773179337]
We propose a novel yet effective rotation invariant architecture for 3D point cloud classification and segmentation.
We build an effective neural network for 3D point cloud analysis that is invariant to arbitrary rotations while maintaining high accuracy.
arXiv Detail & Related papers (2024-08-12T12:47:37Z) - On-the-fly Point Feature Representation for Point Clouds Analysis [7.074010861305738]
We propose On-the-fly Point Feature Representation (OPFR), which captures abundant geometric information explicitly through Curve Feature Generator module.
We also introduce the Local Reference Constructor module, which approximates the local coordinate systems based on triangle sets.
OPFR only requires extra 1.56ms for inference (65x faster than vanilla PFH) and 0.012M more parameters, and it can serve as a versatile plug-and-play module for various backbones.
arXiv Detail & Related papers (2024-07-31T04:57:06Z) - Point Cloud Mamba: Point Cloud Learning via State Space Model [73.7454734756626]
We show that Mamba-based point cloud methods can outperform previous methods based on transformer or multi-layer perceptrons (MLPs)
In particular, we demonstrate that Mamba-based point cloud methods can outperform previous methods based on transformer or multi-layer perceptrons (MLPs)
Point Cloud Mamba surpasses the state-of-the-art (SOTA) point-based method PointNeXt and achieves new SOTA performance on the ScanNN, ModelNet40, ShapeNetPart, and S3DIS datasets.
arXiv Detail & Related papers (2024-03-01T18:59:03Z) - Less is More: Fewer Interpretable Region via Submodular Subset Selection [54.07758302264416]
This paper re-models the above image attribution problem as a submodular subset selection problem.
We construct a novel submodular function to discover more accurate small interpretation regions.
For correctly predicted samples, the proposed method improves the Deletion and Insertion scores with an average of 4.9% and 2.5% gain relative to HSIC-Attribution.
arXiv Detail & Related papers (2024-02-14T13:30:02Z) - Sparse4D v3: Advancing End-to-End 3D Detection and Tracking [12.780544029261353]
We introduce two auxiliary training tasks and propose decoupled attention to make structural improvements.
We extend the detector into a tracker using a straightforward approach that assigns instance ID during inference.
Our best model achieved 71.9% NDS and 67.7% AMOTA on the nuScenes test set.
arXiv Detail & Related papers (2023-11-20T12:37:58Z) - Spherical Transformer for LiDAR-based 3D Recognition [48.44153945515335]
We study the varying-sparsity distribution of LiDAR points and present SphereFormer to directly aggregate information from dense close points to sparse distant ones.
We design radial window self-attention that partitions the space into multiple non-overlapping narrow and long windows.
To fit the narrow and long windows, we propose exponential splitting to yield fine-grained position encoding and dynamic feature selection.
arXiv Detail & Related papers (2023-03-22T17:30:14Z) - CloudAttention: Efficient Multi-Scale Attention Scheme For 3D Point
Cloud Learning [81.85951026033787]
We set transformers in this work and incorporate them into a hierarchical framework for shape classification and part and scene segmentation.
We also compute efficient and dynamic global cross attentions by leveraging sampling and grouping at each iteration.
The proposed hierarchical model achieves state-of-the-art shape classification in mean accuracy and yields results on par with the previous segmentation methods.
arXiv Detail & Related papers (2022-07-31T21:39:15Z) - Dynamic Convolution for 3D Point Cloud Instance Segmentation [146.7971476424351]
We propose an approach to instance segmentation from 3D point clouds based on dynamic convolution.
We gather homogeneous points that have identical semantic categories and close votes for the geometric centroids.
The proposed approach is proposal-free, and instead exploits a convolution process that adapts to the spatial and semantic characteristics of each instance.
arXiv Detail & Related papers (2021-07-18T09:05:16Z) - PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation [111.7241018610573]
We present PointGroup, a new end-to-end bottom-up architecture for instance segmentation.
We design a two-branch network to extract point features and predict semantic labels and offsets, for shifting each point towards its respective instance centroid.
A clustering component is followed to utilize both the original and offset-shifted point coordinate sets, taking advantage of their complementary strength.
We conduct extensive experiments on two challenging datasets, ScanNet v2 and S3DIS, on which our method achieves the highest performance, 63.6% and 64.0%, compared to 54.9% and 54.4% achieved by former best
arXiv Detail & Related papers (2020-04-03T16:26:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.