Deep ensembles based on Stochastic Activation Selection for Polyp
Segmentation
- URL: http://arxiv.org/abs/2104.00850v1
- Date: Fri, 2 Apr 2021 02:07:37 GMT
- Title: Deep ensembles based on Stochastic Activation Selection for Polyp
Segmentation
- Authors: Alessandra Lumini, Loris Nanni and Gianluca Maguolo
- Abstract summary: This work deals with medical image segmentation and in particular with accurate polyp detection and segmentation during colonoscopy examinations.
Basic architecture in image segmentation consists of an encoder and a decoder.
We compare some variant of the DeepLab architecture obtained by varying the decoder backbone.
- Score: 82.61182037130406
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Semantic segmentation has a wide array of applications ranging from
medical-image analysis, scene understanding, autonomous driving and robotic
navigation. This work deals with medical image segmentation and in particular
with accurate polyp detection and segmentation during colonoscopy examinations.
Several convolutional neural network architectures have been proposed to
effectively deal with this task and with the problem of segmenting objects at
different scale input. The basic architecture in image segmentation consists of
an encoder and a decoder: the first uses convolutional filters to extract
features from the image, the second is responsible for generating the final
output. In this work, we compare some variant of the DeepLab architecture
obtained by varying the decoder backbone. We compare several decoder
architectures, including ResNet, Xception, EfficentNet, MobileNet and we
perturb their layers by substituting ReLU activation layers with other
functions. The resulting methods are used to create deep ensembles which are
shown to be very effective. Our experimental evaluations show that our best
ensemble produces good segmentation results by achieving high evaluation scores
with a dice coefficient of 0.884, and a mean Intersection over Union (mIoU) of
0.818 for the Kvasir-SEG dataset. To improve reproducibility and research
efficiency the MATLAB source code used for this research is available at
GitHub: https://github.com/LorisNanni.
Related papers
- DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut [62.63481844384229]
Foundation models have emerged as powerful tools across various domains including language, vision, and multimodal tasks.
In this paper, we use a diffusion UNet encoder as a foundation vision encoder and introduce DiffCut, an unsupervised zero-shot segmentation method.
Our work highlights the remarkably accurate semantic knowledge embedded within diffusion UNet encoders that could then serve as foundation vision encoders for downstream tasks.
arXiv Detail & Related papers (2024-06-05T01:32:31Z) - U-Net v2: Rethinking the Skip Connections of U-Net for Medical Image Segmentation [14.450329809640422]
We introduce U-Net v2, a new robust and efficient U-Net variant for medical image segmentation.
It aims to augment the infusion of semantic information into low-level features while simultaneously refining high-level features with finer details.
arXiv Detail & Related papers (2023-11-29T16:35:24Z) - Using DUCK-Net for Polyp Image Segmentation [0.0]
"DUCK-Net" is capable of effectively learning and generalizing from small amounts of medical images to perform accurate segmentation tasks.
We demonstrate its capabilities specifically for polyp segmentation in colonoscopy images.
arXiv Detail & Related papers (2023-11-03T20:58:44Z) - Multi Kernel Positional Embedding ConvNeXt for Polyp Segmentation [7.31341312596412]
We propose a novel framework composed of ConvNeXt backbone and Multi Kernel Positional Embedding block.
Our model achieves the Dice coefficient of 0.8818 and the IOU score of 0.8163 on the Kvasir-SEG dataset.
arXiv Detail & Related papers (2023-01-17T03:12:57Z) - R2U++: A Multiscale Recurrent Residual U-Net with Dense Skip Connections
for Medical Image Segmentation [0.5735035463793008]
We propose a new U-Net based medical image segmentation architecture R2U++.
In the proposed architecture, the plain convolutional backbone is replaced by a deeper recurrent residual convolution block.
The semantic gap between encoder and decoder is reduced by dense skip pathways.
arXiv Detail & Related papers (2022-06-03T19:42:44Z) - Deep ensembles in bioimage segmentation [74.01883650587321]
In this work, we propose an ensemble of convolutional neural networks (CNNs)
In ensemble methods, many different models are trained and then used for classification, the ensemble aggregates the outputs of the single classifiers.
The proposed ensemble is implemented by combining different backbone networks using the DeepLabV3+ and HarDNet environment.
arXiv Detail & Related papers (2021-12-24T05:54:21Z) - Small Lesion Segmentation in Brain MRIs with Subpixel Embedding [105.1223735549524]
We present a method to segment MRI scans of the human brain into ischemic stroke lesion and normal tissues.
We propose a neural network architecture in the form of a standard encoder-decoder where predictions are guided by a spatial expansion embedding network.
arXiv Detail & Related papers (2021-09-18T00:21:17Z) - CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image
Segmentation [95.51455777713092]
Convolutional neural networks (CNNs) have been the de facto standard for nowadays 3D medical image segmentation.
We propose a novel framework that efficiently bridges a bf Convolutional neural network and a bf Transformer bf (CoTr) for accurate 3D medical image segmentation.
arXiv Detail & Related papers (2021-03-04T13:34:22Z) - KiU-Net: Overcomplete Convolutional Architectures for Biomedical Image
and Volumetric Segmentation [71.79090083883403]
"Traditional" encoder-decoder based approaches perform poorly in detecting smaller structures and are unable to segment boundary regions precisely.
We propose KiU-Net which has two branches: (1) an overcomplete convolutional network Kite-Net which learns to capture fine details and accurate edges of the input, and (2) U-Net which learns high level features.
The proposed method achieves a better performance as compared to all the recent methods with an additional benefit of fewer parameters and faster convergence.
arXiv Detail & Related papers (2020-10-04T19:23:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.