Related papers: One Point is All You Need: Directional Attention Point for Feature Learning

One Point is All You Need: Directional Attention Point for Feature Learning

URL: http://arxiv.org/abs/2012.06257v2
Date: Mon, 14 Dec 2020 06:47:12 GMT
Title: One Point is All You Need: Directional Attention Point for Feature Learning
Authors: Liqiang Lin, Pengdi Huang, Chi-Wing Fu, Kai Xu, Hao Zhang, Hui Huang
Abstract summary: We present a novel attention-based mechanism for learning enhanced point features for tasks such as point cloud classification and segmentation. We show that our attention mechanism can be easily incorporated into state-of-the-art point cloud classification and segmentation networks.
Score: 51.44837108615402
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: We present a novel attention-based mechanism for learning enhanced point features for tasks such as point cloud classification and segmentation. Our key message is that if the right attention point is selected, then "one point is all you need" -- not a sequence as in a recurrent model and not a pre-selected set as in all prior works. Also, where the attention point is should be learned, from data and specific to the task at hand. Our mechanism is characterized by a new and simple convolution, which combines the feature at an input point with the feature at its associated attention point. We call such a point a directional attention point (DAP), since it is found by adding to the original point an offset vector that is learned by maximizing the task performance in training. We show that our attention mechanism can be easily incorporated into state-of-the-art point cloud classification and segmentation networks. Extensive experiments on common benchmarks such as ModelNet40, ShapeNetPart, and S3DIS demonstrate that our DAP-enabled networks consistently outperform the respective original networks, as well as all other competitive alternatives, including those employing pre-selected sets of attention points.

Related papers

Point Cloud Understanding via Attention-Driven Contrastive Learning [64.65145700121442]
Transformer-based models have advanced point cloud understanding by leveraging self-attention mechanisms. PointACL is an attention-driven contrastive learning framework designed to address these limitations. Our method employs an attention-driven dynamic masking strategy that guides the model to focus on under-attended regions.
arXiv Detail & Related papers (2024-11-22T05:41:00Z)
DHGCN: Dynamic Hop Graph Convolution Network for Self-Supervised Point Cloud Learning [23.048005152646592]
We propose the Dynamic Hop Graph Convolution Network (DHGCN) for explicitly learning the contextual relationships between point parts. We devise a novel self-supervised part-level hop distance reconstruction task and design a novel loss function accordingly to facilitate training. The proposed DHGCN is a plug-and-play module that is compatible with point-based backbone networks.
arXiv Detail & Related papers (2024-01-05T02:54:23Z)
Point Cloud Pre-training with Diffusion Models [62.12279263217138]
We propose a novel pre-training method called Point cloud Diffusion pre-training (PointDif) PointDif achieves substantial improvement across various real-world datasets for diverse downstream tasks such as classification, segmentation and detection.
arXiv Detail & Related papers (2023-11-25T08:10:05Z)
ResMatch: Residual Attention Learning for Local Feature Matching [51.07496081296863]
We rethink cross- and self-attention from the viewpoint of traditional feature matching and filtering. We inject the similarity of descriptors and relative positions into cross- and self-attention score. We mine intra- and inter-neighbors according to the similarity of descriptors and relative positions.
arXiv Detail & Related papers (2023-07-11T11:32:12Z)
D-Net: Learning for Distinctive Point Clouds by Self-Attentive Point Searching and Learnable Feature Fusion [48.57170130169045]
We propose D-Net to learn for distinctive point clouds based on a self-attentive point searching and a learnable feature fusion. To generate a compact feature representation for each distinctive point set, a stacked self-gated convolution is proposed to extract the distinctive features. The results show that the learned distinction distribution of a point cloud is highly consistent with objects of the same class and different from objects of other classes.
arXiv Detail & Related papers (2023-05-10T02:19:00Z)
Towards Class-agnostic Tracking Using Feature Decorrelation in Point Clouds [9.321928362927965]
Single object tracking in point clouds has been attracting more and more attention owing to the presence of LiDAR sensors in 3D vision. Existing methods based on deep neural networks focus mainly on training different models for different categories. In this work, we turn our thoughts to a more challenging task in the LiDAR point clouds, class-agnostic tracking.
arXiv Detail & Related papers (2022-02-28T03:33:03Z)
FatNet: A Feature-attentive Network for 3D Point Cloud Processing [1.502579291513768]
We introduce a novel feature-attentive neural network layer, a FAT layer, that combines both global point-based features and local edge-based features in order to generate better embeddings. Our architecture achieves state-of-the-art results on the task of point cloud classification, as demonstrated on the ModelNet40 dataset.
arXiv Detail & Related papers (2021-04-07T23:13:56Z)
Coordinate Attention for Efficient Mobile Network Design [96.40415345942186]
We propose a novel attention mechanism for mobile networks by embedding positional information into channel attention. Unlike channel attention that transforms a feature tensor to a single feature vector via 2D global pooling, the coordinate attention factorizes channel attention into two 1D feature encoding processes. Our coordinate attention is beneficial to ImageNet classification and behaves better in down-stream tasks, such as object detection and semantic segmentation.
arXiv Detail & Related papers (2021-03-04T09:18:02Z)
SK-Net: Deep Learning on Point Cloud via End-to-end Discovery of Spatial Keypoints [7.223394571022494]
This paper presents an end-to-end framework, SK-Net, to jointly optimize the inference of spatial keypoint with the learning of feature representation of a point cloud. Our proposed method performs better than or comparable with the state-of-the-art approaches in point cloud tasks.
arXiv Detail & Related papers (2020-03-31T08:15:40Z)

This list is automatically generated from the titles and abstracts of the papers in this site.