Efficient 3D Point Cloud Feature Learning for Large-Scale Place
Recognition
- URL: http://arxiv.org/abs/2101.02374v1
- Date: Thu, 7 Jan 2021 05:15:31 GMT
- Title: Efficient 3D Point Cloud Feature Learning for Large-Scale Place
Recognition
- Authors: Le Hui, Mingmei Cheng, Jin Xie, Jian Yang
- Abstract summary: We develop an efficient point cloud learning network ( EPC-Net) to form a global descriptor for visual place recognition.
Our proposed method can achieve state-of-the-art performance with lower parameters, FLOPs, and runtime per frame.
- Score: 21.818744369503197
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Point cloud based retrieval for place recognition is still a challenging
problem due to drastic appearance and illumination changes of scenes in
changing environments. Existing deep learning based global descriptors for the
retrieval task usually consume a large amount of computation resources (e.g.,
memory), which may not be suitable for the cases of limited hardware resources.
In this paper, we develop an efficient point cloud learning network (EPC-Net)
to form a global descriptor for visual place recognition, which can obtain good
performance and reduce computation memory and inference time. First, we propose
a lightweight but effective neural network module, called ProxyConv, to
aggregate the local geometric features of point clouds. We leverage the spatial
adjacent matrix and proxy points to simplify the original edge convolution for
lower memory consumption. Then, we design a lightweight grouped VLAD network
(G-VLAD) to form global descriptors for retrieval. Compared with the original
VLAD network, we propose a grouped fully connected (GFC) layer to decompose the
high-dimensional vectors into a group of low-dimensional vectors, which can
reduce the number of parameters of the network and maintain the discrimination
of the feature vector. Finally, to further reduce the inference time, we
develop a simple version of EPC-Net, called EPC-Net-L, which consists of two
ProxyConv modules and one max pooling layer to aggregate global descriptors. By
distilling the knowledge from EPC-Net, EPC-Net-L can obtain discriminative
global descriptors for retrieval. Extensive experiments on the Oxford dataset
and three in-house datasets demonstrate that our proposed method can achieve
state-of-the-art performance with lower parameters, FLOPs, and runtime per
frame.
Related papers
- ELGC-Net: Efficient Local-Global Context Aggregation for Remote Sensing Change Detection [65.59969454655996]
We propose an efficient change detection framework, ELGC-Net, which leverages rich contextual information to precisely estimate change regions.
Our proposed ELGC-Net sets a new state-of-the-art performance in remote sensing change detection benchmarks.
We also introduce ELGC-Net-LW, a lighter variant with significantly reduced computational complexity, suitable for resource-constrained settings.
arXiv Detail & Related papers (2024-03-26T17:46:25Z) - PointeNet: A Lightweight Framework for Effective and Efficient Point
Cloud Analysis [28.54939134635978]
PointeNet is a network designed specifically for point cloud analysis.
Our method demonstrates flexibility by seamlessly integrating with a classification/segmentation head or embedding into off-the-shelf 3D object detection networks.
Experiments on object-level datasets, including ModelNet40, ScanObjectNN, ShapeNet KITTI, and the scene-level dataset KITTI, demonstrate the superior performance of PointeNet over state-of-the-art methods in point cloud analysis.
arXiv Detail & Related papers (2023-12-20T03:34:48Z) - LACV-Net: Semantic Segmentation of Large-Scale Point Cloud Scene via
Local Adaptive and Comprehensive VLAD [13.907586081922345]
We propose an end-to-end deep neural network called LACV-Net for large-scale point cloud semantic segmentation.
The proposed network contains three main components: 1) a local adaptive feature augmentation module (LAFA) to adaptively learn the similarity of centroids and neighboring points to augment the local context; 2) a comprehensive VLAD module that fuses local features with multi-layer, multi-scale, and multi-resolution to represent a comprehensive global description vector; and 3) an aggregation loss function to effectively optimize the segmentation boundaries by constraining the adaptive weight from the LAFA module.
arXiv Detail & Related papers (2022-10-12T02:11:00Z) - UPDesc: Unsupervised Point Descriptor Learning for Robust Registration [54.95201961399334]
UPDesc is an unsupervised method to learn point descriptors for robust point cloud registration.
We show that our learned descriptors yield superior performance over existing unsupervised methods.
arXiv Detail & Related papers (2021-08-05T17:11:08Z) - Learning Semantic Segmentation of Large-Scale Point Clouds with Random
Sampling [52.464516118826765]
We introduce RandLA-Net, an efficient and lightweight neural architecture to infer per-point semantics for large-scale point clouds.
The key to our approach is to use random point sampling instead of more complex point selection approaches.
Our RandLA-Net can process 1 million points in a single pass up to 200x faster than existing approaches.
arXiv Detail & Related papers (2021-07-06T05:08:34Z) - PIG-Net: Inception based Deep Learning Architecture for 3D Point Cloud
Segmentation [0.9137554315375922]
We propose a inception based deep network architecture called PIG-Net, that effectively characterizes the local and global geometric details of the point clouds.
We perform an exhaustive experimental analysis of the PIG-Net architecture on two state-of-the-art datasets.
arXiv Detail & Related papers (2021-01-28T13:27:55Z) - SoftPoolNet: Shape Descriptor for Point Cloud Completion and
Classification [93.54286830844134]
We propose a method for 3D object completion and classification based on point clouds.
For the decoder stage, we propose regional convolutions, a novel operator aimed at maximizing the global activation entropy.
We evaluate our approach on different 3D tasks such as object completion and classification, achieving state-of-the-art accuracy.
arXiv Detail & Related papers (2020-08-17T14:32:35Z) - Local Grid Rendering Networks for 3D Object Detection in Point Clouds [98.02655863113154]
CNNs are powerful but it would be computationally costly to directly apply convolutions on point data after voxelizing the entire point clouds to a dense regular 3D grid.
We propose a novel and principled Local Grid Rendering (LGR) operation to render the small neighborhood of a subset of input points into a low-resolution 3D grid independently.
We validate LGR-Net for 3D object detection on the challenging ScanNet and SUN RGB-D datasets.
arXiv Detail & Related papers (2020-07-04T13:57:43Z) - LiteDenseNet: A Lightweight Network for Hyperspectral Image
Classification [2.696926374562295]
We propose a lightweight network architecture (LiteDenseNet) based on DenseNet for Hyperspectral Image Classification.
Inspired by GoogLeNet and PeleeNet, we design a 3D two-way dense layer to capture the local and global features of the input.
As convolution is a computationally intensive operation, we introduce group convolution to decrease calculation cost and parameter size further.
arXiv Detail & Related papers (2020-04-17T08:38:52Z) - Dense Residual Network: Enhancing Global Dense Feature Flow for
Character Recognition [75.4027660840568]
This paper explores how to enhance the local and global dense feature flow by exploiting hierarchical features fully from all the convolution layers.
Technically, we propose an efficient and effective CNN framework, i.e., Fast Dense Residual Network (FDRN) for text recognition.
arXiv Detail & Related papers (2020-01-23T06:55:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.