FPConv: Learning Local Flattening for Point Convolution
- URL: http://arxiv.org/abs/2002.10701v3
- Date: Sat, 14 Mar 2020 04:13:13 GMT
- Title: FPConv: Learning Local Flattening for Point Convolution
- Authors: Yiqun Lin, Zizheng Yan, Haibin Huang, Dong Du, Ligang Liu, Shuguang
Cui and Xiaoguang Han
- Abstract summary: We introduce FPConv, a novel surface-style convolution operator designed for 3D point cloud analysis.
Unlike previous methods, FPConv doesn't require transforming to intermediate representation like 3D grid or graph.
FPConv can be easily integrated into various network architectures for tasks like 3D object classification and 3D scene segmentation.
- Score: 64.01196188303483
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We introduce FPConv, a novel surface-style convolution operator designed for
3D point cloud analysis. Unlike previous methods, FPConv doesn't require
transforming to intermediate representation like 3D grid or graph and directly
works on surface geometry of point cloud. To be more specific, for each point,
FPConv performs a local flattening by automatically learning a weight map to
softly project surrounding points onto a 2D grid. Regular 2D convolution can
thus be applied for efficient feature learning. FPConv can be easily integrated
into various network architectures for tasks like 3D object classification and
3D scene segmentation, and achieve comparable performance with existing
volumetric-type convolutions. More importantly, our experiments also show that
FPConv can be a complementary of volumetric convolutions and jointly training
them can further boost overall performance into state-of-the-art results.
Related papers
- Dynamic 3D Point Cloud Sequences as 2D Videos [81.46246338686478]
3D point cloud sequences serve as one of the most common and practical representation modalities of real-world environments.
We propose a novel generic representation called textitStructured Point Cloud Videos (SPCVs)
SPCVs re-organizes a point cloud sequence as a 2D video with spatial smoothness and temporal consistency, where the pixel values correspond to the 3D coordinates of points.
arXiv Detail & Related papers (2024-03-02T08:18:57Z) - MVTN: Learning Multi-View Transformations for 3D Understanding [60.15214023270087]
We introduce the Multi-View Transformation Network (MVTN), which uses differentiable rendering to determine optimal view-points for 3D shape recognition.
MVTN can be trained end-to-end with any multi-view network for 3D shape recognition.
Our approach demonstrates state-of-the-art performance in 3D classification and shape retrieval on several benchmarks.
arXiv Detail & Related papers (2022-12-27T12:09:16Z) - Flattening-Net: Deep Regular 2D Representation for 3D Point Cloud
Analysis [66.49788145564004]
We present an unsupervised deep neural architecture called Flattening-Net to represent irregular 3D point clouds of arbitrary geometry and topology.
Our methods perform favorably against the current state-of-the-art competitors.
arXiv Detail & Related papers (2022-12-17T15:05:25Z) - Projection-based Point Convolution for Efficient Point Cloud
Segmentation [24.375383511061955]
Projection-based Point Convolution (PPConv) is a point convolutional module that uses 2D convolutions and multi-layer perceptrons (MLPs) as its components.
PPConv achieves superior efficiency compared to state-of-the-art methods, even with a simple architecture based on PointNet++.
arXiv Detail & Related papers (2022-02-04T06:22:33Z) - CpT: Convolutional Point Transformer for 3D Point Cloud Processing [10.389972581905]
We present CpT: Convolutional point Transformer - a novel deep learning architecture for dealing with the unstructured nature of 3D point cloud data.
CpT is an improvement over existing attention-based Convolutions Neural Networks as well as previous 3D point cloud processing transformers.
Our model can serve as an effective backbone for various point cloud processing tasks when compared to the existing state-of-the-art approaches.
arXiv Detail & Related papers (2021-11-21T17:45:55Z) - Concentric Spherical GNN for 3D Representation Learning [53.45704095146161]
We propose a novel multi-resolution convolutional architecture for learning over concentric spherical feature maps.
Our hierarchical architecture is based on alternatively learning to incorporate both intra-sphere and inter-sphere information.
We demonstrate the effectiveness of our approach in improving state-of-the-art performance on 3D classification tasks with rotated data.
arXiv Detail & Related papers (2021-03-18T19:05:04Z) - The Devils in the Point Clouds: Studying the Robustness of Point Cloud
Convolutions [15.997907568429177]
This paper investigates different variants of PointConv, a convolution network on point clouds, to examine their robustness to input scale and rotation changes.
We derive a novel viewpoint-invariant descriptor by utilizing 3D geometric properties as the input to PointConv.
Experiments are conducted on the 2D MNIST & CIFAR-10 datasets as well as the 3D Semantic KITTI & ScanNet dataset.
arXiv Detail & Related papers (2021-01-19T19:32:38Z) - KAPLAN: A 3D Point Descriptor for Shape Completion [80.15764700137383]
KAPLAN is a 3D point descriptor that aggregates local shape information via a series of 2D convolutions.
In each of those planes, point properties like normals or point-to-plane distances are aggregated into a 2D grid and abstracted into a feature representation with an efficient 2D convolutional encoder.
Experiments on public datasets show that KAPLAN achieves state-of-the-art performance for 3D shape completion.
arXiv Detail & Related papers (2020-07-31T21:56:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.