Iterative Feedback Network for Unsupervised Point Cloud Registration
- URL: http://arxiv.org/abs/2401.04357v1
- Date: Tue, 9 Jan 2024 04:44:12 GMT
- Title: Iterative Feedback Network for Unsupervised Point Cloud Registration
- Authors: Yifan Xie, Boyu Wang, Shiqi Li and Jihua Zhu
- Abstract summary: We propose a novel Iterative Feedback Network (IFNet) for unsupervised point cloud registration.
Our IFNet is built upon a series of Feedback Registration Block (FRB) modules, with each module responsible for generating the feedforward rigid transformation and feedback high-level features.
Our experiments on various benchmark datasets demonstrate the superior registration performance of our IFNet.
- Score: 17.41663459141476
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: As a fundamental problem in computer vision, point cloud registration aims to
seek the optimal transformation for aligning a pair of point clouds. In most
existing methods, the information flows are usually forward transferring, thus
lacking the guidance from high-level information to low-level information.
Besides, excessive high-level information may be overly redundant, and directly
using it may conflict with the original low-level information. In this paper,
we propose a novel Iterative Feedback Network (IFNet) for unsupervised point
cloud registration, in which the representation of low-level features is
efficiently enriched by rerouting subsequent high-level features. Specifically,
our IFNet is built upon a series of Feedback Registration Block (FRB) modules,
with each module responsible for generating the feedforward rigid
transformation and feedback high-level features. These FRB modules are cascaded
and recurrently unfolded over time. Further, the Feedback Transformer is
designed to efficiently select relevant information from feedback high-level
features, which is utilized to refine the low-level features. What's more, we
incorporate a geometry-awareness descriptor to empower the network for making
full use of most geometric information, which leads to more precise
registration results. Extensive experiments on various benchmark datasets
demonstrate the superior registration performance of our IFNet.
Related papers
- HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models [96.76995840807615]
HiRes-LLaVA is a novel framework designed to process any size of high-resolution input without altering the original contextual and geometric information.
HiRes-LLaVA comprises two innovative components: (i) a SliceRestore adapter that reconstructs sliced patches into their original form, efficiently extracting both global and local features via down-up-sampling and convolution layers, and (ii) a Self-Mining Sampler to compress the vision tokens based on themselves.
arXiv Detail & Related papers (2024-07-11T17:42:17Z) - Bidirectional Knowledge Reconfiguration for Lightweight Point Cloud
Analysis [74.00441177577295]
Point cloud analysis faces computational system overhead, limiting its application on mobile or edge devices.
This paper explores feature distillation for lightweight point cloud models.
We propose bidirectional knowledge reconfiguration to distill informative contextual knowledge from the teacher to the student.
arXiv Detail & Related papers (2023-10-08T11:32:50Z) - RegFormer: An Efficient Projection-Aware Transformer Network for
Large-Scale Point Cloud Registration [73.69415797389195]
We propose an end-to-end transformer network (RegFormer) for large-scale point cloud alignment.
Specifically, a projection-aware hierarchical transformer is proposed to capture long-range dependencies and filter outliers.
Our transformer has linear complexity, which guarantees high efficiency even for large-scale scenes.
arXiv Detail & Related papers (2023-03-22T08:47:37Z) - FBNet: Feedback Network for Point Cloud Completion [35.89264923599902]
We propose a novel Feedback Network (FBNet) for point cloud completion, in which present features are efficiently refined by rerouting subsequent fine-grained ones.
The main challenge of building feedback connections is the mismatching between present and subsequent features.
To address this, the elaborately designed point Cross Transformer exploits efficient information from feedback features via cross attention strategy.
arXiv Detail & Related papers (2022-10-08T09:12:37Z) - LC3Net: Ladder context correlation complementary network for salient
object detection [0.32116198597240836]
We propose a novel ladder context correlation complementary network (LC3Net)
FCB is a filterable convolution block to assist the automatic collection of information on the diversity of initial features.
DCM is a dense cross module to facilitate the intimate aggregation of different levels of features.
BCD is a bidirectional compression decoder to help the progressive shrinkage of multi-scale features.
arXiv Detail & Related papers (2021-10-21T03:12:32Z) - Feedback Pyramid Attention Networks for Single Image Super-Resolution [37.58180059860872]
We propose feedback pyramid attention networks (FPAN) to fully exploit the mutual dependencies of features.
In our method, the output of each layer in the first stage is also used as the input of the corresponding layer in the next state to re-update the previous low-level filters.
We introduce a pyramid non-local structure to model global contextual information in different scales and improve the discriminative representation of the network.
arXiv Detail & Related papers (2021-06-13T11:32:53Z) - Densely Nested Top-Down Flows for Salient Object Detection [137.74130900326833]
This paper revisits the role of top-down modeling in salient object detection.
It designs a novel densely nested top-down flows (DNTDF)-based framework.
In every stage of DNTDF, features from higher levels are read in via the progressive compression shortcut paths (PCSP)
arXiv Detail & Related papers (2021-02-18T03:14:02Z) - Hierarchical Bi-Directional Feature Perception Network for Person
Re-Identification [12.259747100939078]
Previous Person Re-Identification (Re-ID) models aim to focus on the most discriminative region of an image.
We propose a novel model named Hierarchical Bi-directional Feature Perception Network (HBFP-Net) to correlate multi-level information and reinforce each other.
Experiments implemented on the mainstream evaluation including Market-1501, CUHK03 and DukeMTMC-ReID datasets show that our method outperforms the recent SOTA Re-ID models.
arXiv Detail & Related papers (2020-08-08T12:33:32Z) - Multi-Scale Boosted Dehazing Network with Dense Feature Fusion [92.92572594942071]
We propose a Multi-Scale Boosted Dehazing Network with Dense Feature Fusion based on the U-Net architecture.
We show that the proposed model performs favorably against the state-of-the-art approaches on the benchmark datasets as well as real-world hazy images.
arXiv Detail & Related papers (2020-04-28T09:34:47Z) - Global Context-Aware Progressive Aggregation Network for Salient Object
Detection [117.943116761278]
We propose a novel network named GCPANet to integrate low-level appearance features, high-level semantic features, and global context features.
We show that the proposed approach outperforms the state-of-the-art methods both quantitatively and qualitatively.
arXiv Detail & Related papers (2020-03-02T04:26:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.