3DConvCaps: 3DUnet with Convolutional Capsule Encoder for Medical Image
Segmentation
- URL: http://arxiv.org/abs/2205.09299v1
- Date: Thu, 19 May 2022 03:00:04 GMT
- Title: 3DConvCaps: 3DUnet with Convolutional Capsule Encoder for Medical Image
Segmentation
- Authors: Minh Tran, Viet-Khoa Vo-Ho, Ngan T.H. Le
- Abstract summary: We propose a 3D encoder-decoder network with Convolutional Capsule (called 3DConvCaps) to learn lower-level features (short-range attention) with convolutional layers.
Our experiments on multiple datasets including iSeg-2017, Hippocampus, and Cardiac demonstrate that our 3D 3DConvCaps network considerably outperforms previous capsule networks and 3D-UNets.
- Score: 1.863532786702135
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Convolutional Neural Networks (CNNs) have achieved promising results in
medical image segmentation. However, CNNs require lots of training data and are
incapable of handling pose and deformation of objects. Furthermore, their
pooling layers tend to discard important information such as positions as well
as CNNs are sensitive to rotation and affine transformation. Capsule network is
a recent new architecture that has achieved better robustness in part-whole
representation learning by replacing pooling layers with dynamic routing and
convolutional strides, which has shown potential results on popular tasks such
as digit classification and object segmentation. In this paper, we propose a 3D
encoder-decoder network with Convolutional Capsule Encoder (called 3DConvCaps)
to learn lower-level features (short-range attention) with convolutional layers
while modeling the higher-level features (long-range dependence) with capsule
layers. Our experiments on multiple datasets including iSeg-2017, Hippocampus,
and Cardiac demonstrate that our 3D 3DConvCaps network considerably outperforms
previous capsule networks and 3D-UNets. We further conduct ablation studies of
network efficiency and segmentation performance under various configurations of
convolution layers and capsule layers at both contracting and expanding paths.
Related papers
- 3D-EffiViTCaps: 3D Efficient Vision Transformer with Capsule for Medical Image Segmentation [3.3561661617196843]
We propose a U-shaped 3D encoder-decoder network named 3D-EffiViTCaps.
Our encoder uses capsule blocks and EfficientViT blocks to jointly capture local and global semantic information.
Our decoder employs CNN blocks and EfficientViT blocks to catch ffner details for segmentation.
arXiv Detail & Related papers (2024-03-25T00:59:35Z) - SeMLaPS: Real-time Semantic Mapping with Latent Prior Networks and
Quasi-Planar Segmentation [53.83313235792596]
We present a new methodology for real-time semantic mapping from RGB-D sequences.
It combines a 2D neural network and a 3D network based on a SLAM system with 3D occupancy mapping.
Our system achieves state-of-the-art semantic mapping quality within 2D-3D networks-based systems.
arXiv Detail & Related papers (2023-06-28T22:36:44Z) - 3D-UCaps: 3D Capsules Unet for Volumetric Image Segmentation [11.312343928772993]
We propose 3D-UCaps, a 3D voxel-based Capsule network for medical volumetric image segmentation.
Our method outperforms previous Capsule networks and 3D-Unets.
arXiv Detail & Related papers (2022-03-16T22:02:37Z) - CapsNet for Medical Image Segmentation [8.612958742534673]
Convolutional Neural Networks (CNNs) have been successful in solving tasks in computer vision.
CNNs are sensitive to rotation and affine transformation and their success relies on huge-scale labeled datasets.
CapsNet is a new architecture that has achieved better robustness in representation learning.
arXiv Detail & Related papers (2022-03-16T21:15:07Z) - SS-3DCapsNet: Self-supervised 3D Capsule Networks for Medical
Segmentation on Less Labeled Data [10.371128893952537]
This work extends capsule networks for volumetric medical image segmentation with self-supervised learning.
Our 3D capsule network with self-supervised pre-training considerably outperforms previous capsule networks and 3D-UNets.
arXiv Detail & Related papers (2022-01-15T18:42:38Z) - Training Deep Capsule Networks with Residual Connections [0.0]
Capsule networks are a type of neural network that have recently gained increased popularity.
They consist of groups of neurons, called capsules, which encode properties of objects or object parts.
Most capsule network implementations use two to three capsule layers, which limits their applicability as expressivity grows exponentially with depth.
We propose a methodology to train deeper capsule networks using residual connections, which is evaluated on four datasets and three different routing algorithms.
Our experimental results show that in fact, performance increases when training deeper capsule networks.
arXiv Detail & Related papers (2021-04-15T11:42:44Z) - CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image
Segmentation [95.51455777713092]
Convolutional neural networks (CNNs) have been the de facto standard for nowadays 3D medical image segmentation.
We propose a novel framework that efficiently bridges a bf Convolutional neural network and a bf Transformer bf (CoTr) for accurate 3D medical image segmentation.
arXiv Detail & Related papers (2021-03-04T13:34:22Z) - PCLs: Geometry-aware Neural Reconstruction of 3D Pose with Perspective
Crop Layers [111.55817466296402]
We introduce Perspective Crop Layers (PCLs) - a form of perspective crop of the region of interest based on the camera geometry.
PCLs deterministically remove the location-dependent perspective effects while leaving end-to-end training and the number of parameters of the underlying neural network.
PCL offers an easy way to improve the accuracy of existing 3D reconstruction networks by making them geometry aware.
arXiv Detail & Related papers (2020-11-27T08:48:43Z) - Improving Point Cloud Semantic Segmentation by Learning 3D Object
Detection [102.62963605429508]
Point cloud semantic segmentation plays an essential role in autonomous driving.
Current 3D semantic segmentation networks focus on convolutional architectures that perform great for well represented classes.
We propose a novel Aware 3D Semantic Detection (DASS) framework that explicitly leverages localization features from an auxiliary 3D object detection task.
arXiv Detail & Related papers (2020-09-22T14:17:40Z) - CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Networks [87.02416370081123]
3D Convolution Neural Networks (CNNs) have been widely applied to 3D scene understanding, such as video analysis and volumetric image recognition.
We propose Channel-wise Automatic KErnel Shrinking (CAKES), to enable efficient 3D learning by shrinking standard 3D convolutions into a set of economic operations.
arXiv Detail & Related papers (2020-03-28T14:21:12Z) - Subspace Capsule Network [85.69796543499021]
SubSpace Capsule Network (SCN) exploits the idea of capsule networks to model possible variations in the appearance or implicitly defined properties of an entity.
SCN can be applied to both discriminative and generative models without incurring computational overhead compared to CNN during test time.
arXiv Detail & Related papers (2020-02-07T17:51:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.