3D Medical Point Transformer: Introducing Convolution to Attention
Networks for Medical Point Cloud Analysis
- URL: http://arxiv.org/abs/2112.04863v1
- Date: Thu, 9 Dec 2021 12:31:28 GMT
- Title: 3D Medical Point Transformer: Introducing Convolution to Attention
Networks for Medical Point Cloud Analysis
- Authors: Jianhui Yu, Chaoyi Zhang, Heng Wang, Dingxin Zhang, Yang Song, Tiange
Xiang, Dongnan Liu, Weidong Cai
- Abstract summary: We propose an attention-based model specifically for medical point clouds, namely 3D medical point Transformer (3DMedPT)
By augmenting contextual information and summarizing local responses at query, our attention module can capture both local context and global content feature interactions.
Experiments conducted on IntrA dataset proves the superiority of 3DMedPT, where we achieve the best classification and segmentation results.
- Score: 21.934221178688116
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: General point clouds have been increasingly investigated for different tasks,
and recently Transformer-based networks are proposed for point cloud analysis.
However, there are barely related works for medical point clouds, which are
important for disease detection and treatment. In this work, we propose an
attention-based model specifically for medical point clouds, namely 3D medical
point Transformer (3DMedPT), to examine the complex biological structures. By
augmenting contextual information and summarizing local responses at query, our
attention module can capture both local context and global content feature
interactions. However, the insufficient training samples of medical data may
lead to poor feature learning, so we apply position embeddings to learn
accurate local geometry and Multi-Graph Reasoning (MGR) to examine global
knowledge propagation over channel graphs to enrich feature representations.
Experiments conducted on IntrA dataset proves the superiority of 3DMedPT, where
we achieve the best classification and segmentation results. Furthermore, the
promising generalization ability of our method is validated on general 3D point
cloud benchmarks: ModelNet40 and ShapeNetPart. Code will be released soon.
Related papers
- Multi-objective point cloud autoencoders for explainable myocardial
infarction prediction [4.65840670565844]
Myocardial infarction is one of the most common causes of death in the world.
Image-based biomarkers fail to capture more complex patterns in the heart's 3D anatomy.
We present the multi-objective point cloud autoencoder as a novel geometric deep learning approach for explainable infarction prediction.
arXiv Detail & Related papers (2023-07-20T16:45:16Z) - Collect-and-Distribute Transformer for 3D Point Cloud Analysis [82.03517861433849]
We propose a new transformer network equipped with a collect-and-distribute mechanism to communicate short- and long-range contexts of point clouds.
Results show the effectiveness of the proposed CDFormer, delivering several new state-of-the-art performances on point cloud classification and segmentation tasks.
arXiv Detail & Related papers (2023-06-02T03:48:45Z) - AGO-Net: Association-Guided 3D Point Cloud Object Detection Network [86.10213302724085]
We propose a novel 3D detection framework that associates intact features for objects via domain adaptation.
We achieve new state-of-the-art performance on the KITTI 3D detection benchmark in both accuracy and speed.
arXiv Detail & Related papers (2022-08-24T16:54:38Z) - Point-Unet: A Context-aware Point-based Neural Network for Volumetric
Segmentation [18.81644604997336]
We propose Point-Unet, a novel method that incorporates the efficiency of deep learning with 3D point clouds into volumetric segmentation.
Our key idea is to first predict the regions of interest in the volume by learning an attentional probability map.
A comprehensive benchmark on different metrics has shown that our context-aware Point-Unet robustly outperforms the SOTA voxel-based networks.
arXiv Detail & Related papers (2022-03-16T22:02:08Z) - Feature Aggregation and Refinement Network for 2D AnatomicalLandmark
Detection [0.0]
We propose a novel deep network, named feature aggregation and refinement network (FARNet) for the automatic detection of anatomical landmarks.
Our network has been evaluated on three publicly available anatomical landmark detection datasets.
arXiv Detail & Related papers (2021-11-01T02:16:13Z) - PnP-3D: A Plug-and-Play for 3D Point Clouds [38.05362492645094]
We propose a plug-and-play module, -3D, to improve the effectiveness of existing networks in analyzing point cloud data.
To thoroughly evaluate our approach, we conduct experiments on three standard point cloud analysis tasks.
In addition to achieving state-of-the-art results, we present comprehensive studies to demonstrate our approach's advantages.
arXiv Detail & Related papers (2021-08-16T23:59:43Z) - Point Discriminative Learning for Unsupervised Representation Learning
on 3D Point Clouds [54.31515001741987]
We propose a point discriminative learning method for unsupervised representation learning on 3D point clouds.
We achieve this by imposing a novel point discrimination loss on the middle level and global level point features.
Our method learns powerful representations and achieves new state-of-the-art performance.
arXiv Detail & Related papers (2021-08-04T15:11:48Z) - PC-RGNN: Point Cloud Completion and Graph Neural Network for 3D Object
Detection [57.49788100647103]
LiDAR-based 3D object detection is an important task for autonomous driving.
Current approaches suffer from sparse and partial point clouds of distant and occluded objects.
In this paper, we propose a novel two-stage approach, namely PC-RGNN, dealing with such challenges by two specific solutions.
arXiv Detail & Related papers (2020-12-18T18:06:43Z) - Domain Generalization for Medical Imaging Classification with
Linear-Dependency Regularization [59.5104563755095]
We introduce a simple but effective approach to improve the generalization capability of deep neural networks in the field of medical imaging classification.
Motivated by the observation that the domain variability of the medical images is to some extent compact, we propose to learn a representative feature space through variational encoding.
arXiv Detail & Related papers (2020-09-27T12:30:30Z) - Airborne LiDAR Point Cloud Classification with Graph Attention
Convolution Neural Network [5.69168146446103]
We present a graph attention convolution neural network (GACNN) that can be directly applied to the classification of unstructured 3D point clouds obtained by airborne LiDAR.
Based on the proposed graph attention convolution module, we further design an end-to-end encoder-decoder network, named GACNN, to capture multiscale features of the point clouds.
Experiments on the ISPRS 3D labeling dataset show that the proposed model achieves a new state-of-the-art performance in terms of average F1 score (71.5%) and a satisfying overall accuracy (83.2%)
arXiv Detail & Related papers (2020-04-20T05:12:31Z) - Structured Landmark Detection via Topology-Adapting Deep Graph Learning [75.20602712947016]
We present a new topology-adapting deep graph learning approach for accurate anatomical facial and medical landmark detection.
The proposed method constructs graph signals leveraging both local image features and global shape features.
Experiments are conducted on three public facial image datasets (WFLW, 300W, and COFW-68) as well as three real-world X-ray medical datasets (Cephalometric (public), Hand and Pelvis)
arXiv Detail & Related papers (2020-04-17T11:55:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.