Small Lesion Segmentation in Brain MRIs with Subpixel Embedding
- URL: http://arxiv.org/abs/2109.08791v1
- Date: Sat, 18 Sep 2021 00:21:17 GMT
- Title: Small Lesion Segmentation in Brain MRIs with Subpixel Embedding
- Authors: Alex Wong, Allison Chen, Yangchao Wu, Safa Cicek, Alexandre Tiard,
Byung-Woo Hong, Stefano Soatto
- Abstract summary: We present a method to segment MRI scans of the human brain into ischemic stroke lesion and normal tissues.
We propose a neural network architecture in the form of a standard encoder-decoder where predictions are guided by a spatial expansion embedding network.
- Score: 105.1223735549524
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: We present a method to segment MRI scans of the human brain into ischemic
stroke lesion and normal tissues. We propose a neural network architecture in
the form of a standard encoder-decoder where predictions are guided by a
spatial expansion embedding network. Our embedding network learns features that
can resolve detailed structures in the brain without the need for
high-resolution training images, which are often unavailable and expensive to
acquire. Alternatively, the encoder-decoder learns global structures by means
of striding and max pooling. Our embedding network complements the
encoder-decoder architecture by guiding the decoder with fine-grained details
lost to spatial downsampling during the encoder stage. Unlike previous works,
our decoder outputs at 2 times the input resolution, where a single pixel in
the input resolution is predicted by four neighboring subpixels in our output.
To obtain the output at the original scale, we propose a learnable downsampler
(as opposed to hand-crafted ones e.g. bilinear) that combines subpixel
predictions. Our approach improves the baseline architecture by approximately
11.7% and achieves the state of the art on the ATLAS public benchmark dataset
with a smaller memory footprint and faster runtime than the best competing
method. Our source code has been made available at:
https://github.com/alexklwong/subpixel-embedding-segmentation.
Related papers
- SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation [91.91385816767057]
Open-vocabulary semantic segmentation strives to distinguish pixels into different semantic groups from an open set of categories.
We propose a simple encoder-decoder, named SED, for open-vocabulary semantic segmentation.
Our SED method achieves mIoU score of 31.6% on ADE20K with 150 categories at 82 millisecond ($ms$) per image on a single A6000.
arXiv Detail & Related papers (2023-11-27T05:00:38Z) - More complex encoder is not all you need [0.882348769487259]
We introduce neU-Net (i.e., not complex encoder U-Net), which incorporates a novel Sub-pixel Convolution for upsampling to construct a powerful decoder.
Our model design achieves excellent results, surpassing other state-of-the-art methods on both the Synapse and ACDC datasets.
arXiv Detail & Related papers (2023-09-20T08:34:38Z) - Attention Attention Everywhere: Monocular Depth Prediction with Skip
Attention [6.491470878214977]
Monocular Depth Estimation (MDE) aims to predict pixel-wise depth given a single RGB image.
Inspired by the demonstrated benefits of attention in a multitude of computer vision problems, we propose an attention-based fusion of encoder and decoder features.
arXiv Detail & Related papers (2022-10-17T13:14:47Z) - Reducing Redundancy in the Bottleneck Representation of the Autoencoders [98.78384185493624]
Autoencoders are a type of unsupervised neural networks, which can be used to solve various tasks.
We propose a scheme to explicitly penalize feature redundancies in the bottleneck representation.
We tested our approach across different tasks: dimensionality reduction using three different dataset, image compression using the MNIST dataset, and image denoising using fashion MNIST.
arXiv Detail & Related papers (2022-02-09T18:48:02Z) - Dynamic Neural Representational Decoders for High-Resolution Semantic
Segmentation [98.05643473345474]
We propose a novel decoder, termed dynamic neural representational decoder (NRD)
As each location on the encoder's output corresponds to a local patch of the semantic labels, in this work, we represent these local patches of labels with compact neural networks.
This neural representation enables our decoder to leverage the smoothness prior in the semantic label space, and thus makes our decoder more efficient.
arXiv Detail & Related papers (2021-07-30T04:50:56Z) - Atrous Residual Interconnected Encoder to Attention Decoder Framework
for Vertebrae Segmentation via 3D Volumetric CT Images [1.8146155083014204]
This paper proposes a novel algorithm for automated vertebrae segmentation via 3D volumetric spine CT images.
The proposed model is based on the structure of encoder to decoder, using layer normalization to optimize mini-batch training performance.
The experimental results show that our model achieves competitive performance compared with other state-of-the-art medical semantic segmentation methods.
arXiv Detail & Related papers (2021-04-08T12:09:16Z) - A Novel Upsampling and Context Convolution for Image Semantic
Segmentation [0.966840768820136]
Recent methods for semantic segmentation often employ an encoder-decoder structure using deep convolutional neural networks.
We propose a dense upsampling convolution method based on guided filtering to effectively preserve the spatial information of the image in the network.
We report a new record of 82.86% and 81.62% of pixel accuracy on ADE20K and Pascal-Context benchmark datasets, respectively.
arXiv Detail & Related papers (2021-03-20T06:16:42Z) - Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective
with Transformers [149.78470371525754]
We treat semantic segmentation as a sequence-to-sequence prediction task. Specifically, we deploy a pure transformer to encode an image as a sequence of patches.
With the global context modeled in every layer of the transformer, this encoder can be combined with a simple decoder to provide a powerful segmentation model, termed SEgmentation TRansformer (SETR)
SETR achieves new state of the art on ADE20K (50.28% mIoU), Pascal Context (55.83% mIoU) and competitive results on Cityscapes.
arXiv Detail & Related papers (2020-12-31T18:55:57Z) - Beyond Single Stage Encoder-Decoder Networks: Deep Decoders for Semantic
Image Segmentation [56.44853893149365]
Single encoder-decoder methodologies for semantic segmentation are reaching their peak in terms of segmentation quality and efficiency per number of layers.
We propose a new architecture based on a decoder which uses a set of shallow networks for capturing more information content.
In order to further improve the architecture we introduce a weight function which aims to re-balance classes to increase the attention of the networks to under-represented objects.
arXiv Detail & Related papers (2020-07-19T18:44:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.