Improving RGB-D Point Cloud Registration by Learning Multi-scale Local
Linear Transformation
- URL: http://arxiv.org/abs/2208.14893v2
- Date: Thu, 1 Sep 2022 01:01:29 GMT
- Title: Improving RGB-D Point Cloud Registration by Learning Multi-scale Local
Linear Transformation
- Authors: Ziming Wang, Xiaoliang Huo, Zhenghao Chen, Jing Zhang, Lu Sheng, Dong
Xu
- Abstract summary: Point cloud registration aims at estimating the geometric transformation between two point cloud scans.
Recent point cloud registration methods have tried to apply RGB-D data to achieve more accurate correspondence.
We propose a new Geometry-Aware Visual Feature Extractor (GAVE) that employs multi-scale local linear transformation.
- Score: 38.64501645574878
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Point cloud registration aims at estimating the geometric transformation
between two point cloud scans, in which point-wise correspondence estimation is
the key to its success. In addition to previous methods that seek
correspondences by hand-crafted or learnt geometric features, recent point
cloud registration methods have tried to apply RGB-D data to achieve more
accurate correspondence. However, it is not trivial to effectively fuse the
geometric and visual information from these two distinctive modalities,
especially for the registration problem. In this work, we propose a new
Geometry-Aware Visual Feature Extractor (GAVE) that employs multi-scale local
linear transformation to progressively fuse these two modalities, where the
geometric features from the depth data act as the geometry-dependent
convolution kernels to transform the visual features from the RGB data. The
resultant visual-geometric features are in canonical feature spaces with
alleviated visual dissimilarity caused by geometric changes, by which more
reliable correspondence can be achieved. The proposed GAVE module can be
readily plugged into recent RGB-D point cloud registration framework. Extensive
experiments on 3D Match and ScanNet demonstrate that our method outperforms the
state-of-the-art point cloud registration methods even without correspondence
or pose supervision. The code is available at: https://github.com/514DNA/LLT.
Related papers
- RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration [0.0]
We propose a new feature combination framework, which applies a looser but more effective combination.
An explicit filter based on transformation consistency is designed for the combination framework, which can overcome each feature's weakness.
Experiments on ScanNet and 3DMatch show that our method achieves a state-of-the-art performance.
arXiv Detail & Related papers (2024-05-13T09:56:28Z) - Zero-Shot Point Cloud Registration [94.39796531154303]
ZeroReg is the first zero-shot point cloud registration approach that eliminates the need for training on point cloud datasets.
The cornerstone of ZeroReg is the novel transfer of image features from keypoints to the point cloud, enriched by aggregating information from 3D geometric neighborhoods.
On benchmarks such as 3DMatch, 3DLoMatch, and ScanNet, ZeroReg achieves impressive Recall Ratios (RR) of over 84%, 46%, and 75%, respectively.
arXiv Detail & Related papers (2023-12-05T11:33:16Z) - PointMBF: A Multi-scale Bidirectional Fusion Network for Unsupervised
RGB-D Point Cloud Registration [6.030097207369754]
We propose a network implementing multi-scale bidirectional fusion between RGB images and point clouds generated from depth images.
Our method achieves new state-of-the-art performance.
arXiv Detail & Related papers (2023-08-09T08:13:46Z) - PCR-CG: Point Cloud Registration via Deep Explicit Color and Geometry [28.653015760036602]
We introduce a novel 3D point cloud registration module explicitly embedding the color signals into the geometry representation.
Our key contribution is a 2D-3D cross-modality learning algorithm that embeds the deep features learned from color signals to the geometry representation.
Our study reveals a significant advantages of correlating explicit deep color features to the point cloud in the registration task.
arXiv Detail & Related papers (2023-02-28T08:50:17Z) - Robust Point Cloud Registration Framework Based on Deep Graph
Matching(TPAMI Version) [13.286247750893681]
3D point cloud registration is a fundamental problem in computer vision and robotics.
We propose a novel deep graph matching-based framework for point cloud registration.
arXiv Detail & Related papers (2022-11-09T06:05:25Z) - PointMCD: Boosting Deep Point Cloud Encoders via Multi-view Cross-modal
Distillation for 3D Shape Recognition [55.38462937452363]
We propose a unified multi-view cross-modal distillation architecture, including a pretrained deep image encoder as the teacher and a deep point encoder as the student.
By pair-wise aligning multi-view visual and geometric descriptors, we can obtain more powerful deep point encoders without exhausting and complicated network modification.
arXiv Detail & Related papers (2022-07-07T07:23:20Z) - Robust Point Cloud Registration Framework Based on Deep Graph Matching [5.865029600972316]
3D point cloud registration is a fundamental problem in computer vision and robotics.
We propose a novel deep graph matchingbased framework for point cloud registration.
arXiv Detail & Related papers (2021-03-07T04:20:29Z) - Learning Geometry-Disentangled Representation for Complementary
Understanding of 3D Object Point Cloud [50.56461318879761]
We propose Geometry-Disentangled Attention Network (GDANet) for 3D image processing.
GDANet disentangles point clouds into contour and flat part of 3D objects, respectively denoted by sharp and gentle variation components.
Experiments on 3D object classification and segmentation benchmarks demonstrate that GDANet achieves the state-of-the-arts with fewer parameters.
arXiv Detail & Related papers (2020-12-20T13:35:00Z) - ParaNet: Deep Regular Representation for 3D Point Clouds [62.81379889095186]
ParaNet is a novel end-to-end deep learning framework for representing 3D point clouds.
It converts an irregular 3D point cloud into a regular 2D color image, named point geometry image (PGI)
In contrast to conventional regular representation modalities based on multi-view projection and voxelization, the proposed representation is differentiable and reversible.
arXiv Detail & Related papers (2020-12-05T13:19:55Z) - RPM-Net: Robust Point Matching using Learned Features [79.52112840465558]
RPM-Net is a less sensitive and more robust deep learning-based approach for rigid point cloud registration.
Unlike some existing methods, our RPM-Net handles missing correspondences and point clouds with partial visibility.
arXiv Detail & Related papers (2020-03-30T13:45:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.