CapsNet for Medical Image Segmentation
- URL: http://arxiv.org/abs/2203.08948v1
- Date: Wed, 16 Mar 2022 21:15:07 GMT
- Title: CapsNet for Medical Image Segmentation
- Authors: Minh Tran, Viet-Khoa Vo-Ho, Kyle Quinn, Hien Nguyen, Khoa Luu, and
Ngan Le
- Abstract summary: Convolutional Neural Networks (CNNs) have been successful in solving tasks in computer vision.
CNNs are sensitive to rotation and affine transformation and their success relies on huge-scale labeled datasets.
CapsNet is a new architecture that has achieved better robustness in representation learning.
- Score: 8.612958742534673
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Convolutional Neural Networks (CNNs) have been successful in solving tasks in
computer vision including medical image segmentation due to their ability to
automatically extract features from unstructured data. However, CNNs are
sensitive to rotation and affine transformation and their success relies on
huge-scale labeled datasets capturing various input variations. This network
paradigm has posed challenges at scale because acquiring annotated data for
medical segmentation is expensive, and strict privacy regulations. Furthermore,
visual representation learning with CNNs has its own flaws, e.g., it is
arguable that the pooling layer in traditional CNNs tends to discard positional
information and CNNs tend to fail on input images that differ in orientations
and sizes. Capsule network (CapsNet) is a recent new architecture that has
achieved better robustness in representation learning by replacing pooling
layers with dynamic routing and convolutional strides, which has shown
potential results on popular tasks such as classification, recognition,
segmentation, and natural language processing. Different from CNNs, which
result in scalar outputs, CapsNet returns vector outputs, which aim to preserve
the part-whole relationships. In this work, we first introduce the limitations
of CNNs and fundamentals of CapsNet. We then provide recent developments of
CapsNet for the task of medical image segmentation. We finally discuss various
effective network architectures to implement a CapsNet for both 2D images and
3D volumetric medical image segmentation.
Related papers
- CNN2GNN: How to Bridge CNN with GNN [59.42117676779735]
We propose a novel CNN2GNN framework to unify CNN and GNN together via distillation.
The performance of distilled boosted'' two-layer GNN on Mini-ImageNet is much higher than CNN containing dozens of layers such as ResNet152.
arXiv Detail & Related papers (2024-04-23T08:19:08Z) - Contextual Embedding Learning to Enhance 2D Networks for Volumetric Image Segmentation [5.995633685952995]
2D convolutional neural networks (CNNs) can hardly exploit the spatial correlation of volumetric data.
We propose a contextual embedding learning approach to facilitate 2D CNNs capturing spatial information properly.
Our approach leverages the learned embedding and the slice-wisely neighboring matching as a soft cue to guide the network.
arXiv Detail & Related papers (2024-04-02T08:17:39Z) - 3DConvCaps: 3DUnet with Convolutional Capsule Encoder for Medical Image
Segmentation [1.863532786702135]
We propose a 3D encoder-decoder network with Convolutional Capsule (called 3DConvCaps) to learn lower-level features (short-range attention) with convolutional layers.
Our experiments on multiple datasets including iSeg-2017, Hippocampus, and Cardiac demonstrate that our 3D 3DConvCaps network considerably outperforms previous capsule networks and 3D-UNets.
arXiv Detail & Related papers (2022-05-19T03:00:04Z) - 3D-UCaps: 3D Capsules Unet for Volumetric Image Segmentation [11.312343928772993]
We propose 3D-UCaps, a 3D voxel-based Capsule network for medical volumetric image segmentation.
Our method outperforms previous Capsule networks and 3D-Unets.
arXiv Detail & Related papers (2022-03-16T22:02:37Z) - BreakingBED -- Breaking Binary and Efficient Deep Neural Networks by
Adversarial Attacks [65.2021953284622]
We study robustness of CNNs against white-box and black-box adversarial attacks.
Results are shown for distilled CNNs, agent-based state-of-the-art pruned models, and binarized neural networks.
arXiv Detail & Related papers (2021-03-14T20:43:19Z) - CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image
Segmentation [95.51455777713092]
Convolutional neural networks (CNNs) have been the de facto standard for nowadays 3D medical image segmentation.
We propose a novel framework that efficiently bridges a bf Convolutional neural network and a bf Transformer bf (CoTr) for accurate 3D medical image segmentation.
arXiv Detail & Related papers (2021-03-04T13:34:22Z) - The Mind's Eye: Visualizing Class-Agnostic Features of CNNs [92.39082696657874]
We propose an approach to visually interpret CNN features given a set of images by creating corresponding images that depict the most informative features of a specific layer.
Our method uses a dual-objective activation and distance loss, without requiring a generator network nor modifications to the original model.
arXiv Detail & Related papers (2021-01-29T07:46:39Z) - Test-Time Adaptable Neural Networks for Robust Medical Image
Segmentation [9.372152932156293]
Convolutional Neural Networks (CNNs) work very well for supervised learning problems.
In medical image segmentation, this premise is violated when there is a mismatch between training and test images in terms of their acquisition details.
We design the segmentation CNN as a concatenation of two sub-networks: a relatively shallow image normalization CNN, followed by a deep CNN that segments the normalized image.
arXiv Detail & Related papers (2020-04-09T16:57:27Z) - CRNet: Cross-Reference Networks for Few-Shot Segmentation [59.85183776573642]
Few-shot segmentation aims to learn a segmentation model that can be generalized to novel classes with only a few training images.
With a cross-reference mechanism, our network can better find the co-occurrent objects in the two images.
Experiments on the PASCAL VOC 2012 dataset show that our network achieves state-of-the-art performance.
arXiv Detail & Related papers (2020-03-24T04:55:43Z) - Curriculum By Smoothing [52.08553521577014]
Convolutional Neural Networks (CNNs) have shown impressive performance in computer vision tasks such as image classification, detection, and segmentation.
We propose an elegant curriculum based scheme that smoothes the feature embedding of a CNN using anti-aliasing or low-pass filters.
As the amount of information in the feature maps increases during training, the network is able to progressively learn better representations of the data.
arXiv Detail & Related papers (2020-03-03T07:27:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.