3D-UCaps: 3D Capsules Unet for Volumetric Image Segmentation
- URL: http://arxiv.org/abs/2203.08965v1
- Date: Wed, 16 Mar 2022 22:02:37 GMT
- Title: 3D-UCaps: 3D Capsules Unet for Volumetric Image Segmentation
- Authors: Tan Nguyen, Binh-Son Hua, Ngan Le
- Abstract summary: We propose 3D-UCaps, a 3D voxel-based Capsule network for medical volumetric image segmentation.
Our method outperforms previous Capsule networks and 3D-Unets.
- Score: 11.312343928772993
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Medical image segmentation has been so far achieving promising results with
Convolutional Neural Networks (CNNs). However, it is arguable that in
traditional CNNs, its pooling layer tends to discard important information such
as positions. Moreover, CNNs are sensitive to rotation and affine
transformation. Capsule network is a data-efficient network design proposed to
overcome such limitations by replacing pooling layers with dynamic routing and
convolutional strides, which aims to preserve the part-whole relationships.
Capsule network has shown a great performance in image recognition and natural
language processing, but applications for medical image segmentation,
particularly volumetric image segmentation, has been limited. In this work, we
propose 3D-UCaps, a 3D voxel-based Capsule network for medical volumetric image
segmentation. We build the concept of capsules into a CNN by designing a
network with two pathways: the first pathway is encoded by 3D Capsule blocks,
whereas the second pathway is decoded by 3D CNNs blocks. 3D-UCaps, therefore
inherits the merits from both Capsule network to preserve the spatial
relationship and CNNs to learn visual representation. We conducted experiments
on various datasets to demonstrate the robustness of 3D-UCaps including
iSeg-2017, LUNA16, Hippocampus, and Cardiac, where our method outperforms
previous Capsule networks and 3D-Unets.
Related papers
- 3D-EffiViTCaps: 3D Efficient Vision Transformer with Capsule for Medical Image Segmentation [3.3561661617196843]
We propose a U-shaped 3D encoder-decoder network named 3D-EffiViTCaps.
Our encoder uses capsule blocks and EfficientViT blocks to jointly capture local and global semantic information.
Our decoder employs CNN blocks and EfficientViT blocks to catch ffner details for segmentation.
arXiv Detail & Related papers (2024-03-25T00:59:35Z) - Graphics Capsule: Learning Hierarchical 3D Face Representations from 2D
Images [82.5266467869448]
We propose an Inverse Graphics Capsule Network (IGC-Net) to learn the hierarchical 3D face representations from large-scale unlabeled images.
IGC-Net first decomposes the objects into a set of semantic-consistent part-level descriptions and then assembles them into object-level descriptions to build the hierarchy.
arXiv Detail & Related papers (2023-03-20T06:32:55Z) - 3DConvCaps: 3DUnet with Convolutional Capsule Encoder for Medical Image
Segmentation [1.863532786702135]
We propose a 3D encoder-decoder network with Convolutional Capsule (called 3DConvCaps) to learn lower-level features (short-range attention) with convolutional layers.
Our experiments on multiple datasets including iSeg-2017, Hippocampus, and Cardiac demonstrate that our 3D 3DConvCaps network considerably outperforms previous capsule networks and 3D-UNets.
arXiv Detail & Related papers (2022-05-19T03:00:04Z) - CapsNet for Medical Image Segmentation [8.612958742534673]
Convolutional Neural Networks (CNNs) have been successful in solving tasks in computer vision.
CNNs are sensitive to rotation and affine transformation and their success relies on huge-scale labeled datasets.
CapsNet is a new architecture that has achieved better robustness in representation learning.
arXiv Detail & Related papers (2022-03-16T21:15:07Z) - SS-3DCapsNet: Self-supervised 3D Capsule Networks for Medical
Segmentation on Less Labeled Data [10.371128893952537]
This work extends capsule networks for volumetric medical image segmentation with self-supervised learning.
Our 3D capsule network with self-supervised pre-training considerably outperforms previous capsule networks and 3D-UNets.
arXiv Detail & Related papers (2022-01-15T18:42:38Z) - CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image
Segmentation [95.51455777713092]
Convolutional neural networks (CNNs) have been the de facto standard for nowadays 3D medical image segmentation.
We propose a novel framework that efficiently bridges a bf Convolutional neural network and a bf Transformer bf (CoTr) for accurate 3D medical image segmentation.
arXiv Detail & Related papers (2021-03-04T13:34:22Z) - Learning Hybrid Representations for Automatic 3D Vessel Centerline
Extraction [57.74609918453932]
Automatic blood vessel extraction from 3D medical images is crucial for vascular disease diagnoses.
Existing methods may suffer from discontinuities of extracted vessels when segmenting such thin tubular structures from 3D images.
We argue that preserving the continuity of extracted vessels requires to take into account the global geometry.
We propose a hybrid representation learning approach to address this challenge.
arXiv Detail & Related papers (2020-12-14T05:22:49Z) - PCLs: Geometry-aware Neural Reconstruction of 3D Pose with Perspective
Crop Layers [111.55817466296402]
We introduce Perspective Crop Layers (PCLs) - a form of perspective crop of the region of interest based on the camera geometry.
PCLs deterministically remove the location-dependent perspective effects while leaving end-to-end training and the number of parameters of the underlying neural network.
PCL offers an easy way to improve the accuracy of existing 3D reconstruction networks by making them geometry aware.
arXiv Detail & Related papers (2020-11-27T08:48:43Z) - CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Networks [87.02416370081123]
3D Convolution Neural Networks (CNNs) have been widely applied to 3D scene understanding, such as video analysis and volumetric image recognition.
We propose Channel-wise Automatic KErnel Shrinking (CAKES), to enable efficient 3D learning by shrinking standard 3D convolutions into a set of economic operations.
arXiv Detail & Related papers (2020-03-28T14:21:12Z) - Subspace Capsule Network [85.69796543499021]
SubSpace Capsule Network (SCN) exploits the idea of capsule networks to model possible variations in the appearance or implicitly defined properties of an entity.
SCN can be applied to both discriminative and generative models without incurring computational overhead compared to CNN during test time.
arXiv Detail & Related papers (2020-02-07T17:51:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.