CorrNet3D: Unsupervised End-to-end Learning of Dense Correspondence for
3D Point Clouds
- URL: http://arxiv.org/abs/2012.15638v1
- Date: Thu, 31 Dec 2020 14:55:51 GMT
- Title: CorrNet3D: Unsupervised End-to-end Learning of Dense Correspondence for
3D Point Clouds
- Authors: Yiming Zeng, Yue Qian, Zhiyu Zhu, Junhui Hou, Hui Yuan, Ying He
- Abstract summary: This paper addresses the problem of computing dense correspondence between 3D shapes in the form of point clouds.
We propose CorrNet3D -- the first unsupervised and end-to-end deep learning-based framework.
- Score: 48.22275177437932
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper addresses the problem of computing dense correspondence between 3D
shapes in the form of point clouds, which is a challenging and fundamental
problem in computer vision and digital geometry processing. Conventional
approaches often solve the problem in a supervised manner, requiring massive
annotated data, which is difficult and/or expensive to obtain. Motivated by the
intuition that one can transform two aligned point clouds to each other more
easily and meaningfully than a misaligned pair, we propose CorrNet3D -- the
first unsupervised and end-to-end deep learning-based framework -- to drive the
learning of dense correspondence by means of deformation-like reconstruction to
overcome the need for annotated data. Specifically, CorrNet3D consists of a
deep feature embedding module and two novel modules called correspondence
indicator and symmetric deformation. Feeding a pair of raw point clouds, our
model first learns the pointwise features and passes them into the indicator to
generate a learnable correspondence matrix used to permute the input pair. The
symmetric deformer, with an additional regularized loss, transforms the two
permuted point clouds to each other to drive the unsupervised learning of the
correspondence. The extensive experiments on both synthetic and real-world
datasets of rigid and non-rigid 3D shapes show our CorrNet3D outperforms
state-of-the-art methods to a large extent, including those taking meshes as
input. CorrNet3D is a flexible framework in that it can be easily adapted to
supervised learning if annotated data are available.
Related papers
- IDEA-Net: Dynamic 3D Point Cloud Interpolation via Deep Embedding
Alignment [58.8330387551499]
We formulate the problem as estimation of point-wise trajectories (i.e., smooth curves)
We propose IDEA-Net, an end-to-end deep learning framework, which disentangles the problem under the assistance of the explicitly learned temporal consistency.
We demonstrate the effectiveness of our method on various point cloud sequences and observe large improvement over state-of-the-art methods both quantitatively and visually.
arXiv Detail & Related papers (2022-03-22T10:14:08Z) - CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D
Point Cloud Understanding [2.8661021832561757]
CrossPoint is a simple cross-modal contrastive learning approach to learn transferable 3D point cloud representations.
Our approach outperforms the previous unsupervised learning methods on a diverse range of downstream tasks including 3D object classification and segmentation.
arXiv Detail & Related papers (2022-03-01T18:59:01Z) - DFC: Deep Feature Consistency for Robust Point Cloud Registration [0.4724825031148411]
We present a novel learning-based alignment network for complex alignment scenes.
We validate our approach on the 3DMatch dataset and the KITTI odometry dataset.
arXiv Detail & Related papers (2021-11-15T08:27:21Z) - Unsupervised Dense Deformation Embedding Network for Template-Free Shape
Correspondence [18.48814403488283]
Current deep learning based methods require the supervision of dense annotations to learn per-point translations.
We develop a new Unsupervised Deformation Embedding Network (i.e., UD2E-Net), which learns to predict deformations between non-rigid shapes from dense local features.
Our UD2E-Net outperforms state-of-the-art unsupervised methods by 24% on Faust Inter challenge and even supervised methods by 13% on Faust Intra challenge.
arXiv Detail & Related papers (2021-08-26T07:07:19Z) - Adjoint Rigid Transform Network: Task-conditioned Alignment of 3D Shapes [86.2129580231191]
Adjoint Rigid Transform (ART) Network is a neural module which can be integrated with a variety of 3D networks.
ART learns to rotate input shapes to a learned canonical orientation, which is crucial for a lot of tasks.
We will release our code and pre-trained models for further research.
arXiv Detail & Related papers (2021-02-01T20:58:45Z) - Learning Geometry-Disentangled Representation for Complementary
Understanding of 3D Object Point Cloud [50.56461318879761]
We propose Geometry-Disentangled Attention Network (GDANet) for 3D image processing.
GDANet disentangles point clouds into contour and flat part of 3D objects, respectively denoted by sharp and gentle variation components.
Experiments on 3D object classification and segmentation benchmarks demonstrate that GDANet achieves the state-of-the-arts with fewer parameters.
arXiv Detail & Related papers (2020-12-20T13:35:00Z) - ParaNet: Deep Regular Representation for 3D Point Clouds [62.81379889095186]
ParaNet is a novel end-to-end deep learning framework for representing 3D point clouds.
It converts an irregular 3D point cloud into a regular 2D color image, named point geometry image (PGI)
In contrast to conventional regular representation modalities based on multi-view projection and voxelization, the proposed representation is differentiable and reversible.
arXiv Detail & Related papers (2020-12-05T13:19:55Z) - Gram Regularization for Multi-view 3D Shape Retrieval [3.655021726150368]
We propose a novel regularization term called Gram regularization.
By forcing the variance between weight kernels to be large, the regularizer can help to extract discriminative features.
The proposed Gram regularization is data independent and can converge stably and quickly without bells and whistles.
arXiv Detail & Related papers (2020-11-16T05:37:24Z) - Learning 2D-3D Correspondences To Solve The Blind Perspective-n-Point
Problem [98.92148855291363]
This paper proposes a deep CNN model which simultaneously solves for both 6-DoF absolute camera pose 2D--3D correspondences.
Tests on both real and simulated data have shown that our method substantially outperforms existing approaches.
arXiv Detail & Related papers (2020-03-15T04:17:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.