Automatic size and pose homogenization with spatial transformer network
to improve and accelerate pediatric segmentation
- URL: http://arxiv.org/abs/2107.02655v1
- Date: Tue, 6 Jul 2021 14:50:03 GMT
- Title: Automatic size and pose homogenization with spatial transformer network
to improve and accelerate pediatric segmentation
- Authors: Giammarco La Barbera and Pietro Gori and Haithem Boussaid and Bruno
Belucci and Alessandro Delmonte and Jeanne Goulin and Sabine Sarnacki and
Laurence Rouet and Isabelle Bloch
- Abstract summary: We propose a new CNN architecture that is pose and scale invariant thanks to the use of Spatial Transformer Network (STN)
Our architecture is composed of three sequential modules that are estimated together during training.
We test the proposed method in kidney and renal tumor segmentation on abdominal pediatric CT scanners.
- Score: 51.916106055115755
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Due to a high heterogeneity in pose and size and to a limited number of
available data, segmentation of pediatric images is challenging for deep
learning methods. In this work, we propose a new CNN architecture that is pose
and scale invariant thanks to the use of Spatial Transformer Network (STN). Our
architecture is composed of three sequential modules that are estimated
together during training: (i) a regression module to estimate a similarity
matrix to normalize the input image to a reference one; (ii) a differentiable
module to find the region of interest to segment; (iii) a segmentation module,
based on the popular UNet architecture, to delineate the object. Unlike the
original UNet, which strives to learn a complex mapping, including pose and
scale variations, from a finite training dataset, our segmentation module
learns a simpler mapping focusing on images with normalized pose and size.
Furthermore, the use of an automatic bounding box detection through STN allows
saving time and especially memory, while keeping similar performance. We test
the proposed method in kidney and renal tumor segmentation on abdominal
pediatric CT scanners. Results indicate that the estimated STN homogenization
of size and pose accelerates the segmentation (25h), compared to standard
data-augmentation (33h), while obtaining a similar quality for the kidney
(88.01\% of Dice score) and improving the renal tumor delineation (from 85.52\%
to 87.12\%).
Related papers
- Dual-scale Enhanced and Cross-generative Consistency Learning for Semi-supervised Medical Image Segmentation [49.57907601086494]
Medical image segmentation plays a crucial role in computer-aided diagnosis.
We propose a novel Dual-scale Enhanced and Cross-generative consistency learning framework for semi-supervised medical image (DEC-Seg)
arXiv Detail & Related papers (2023-12-26T12:56:31Z) - Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection [76.11864242047074]
We propose a novel Affine-Consistent Transformer (AC-Former), which directly yields a sequence of nucleus positions.
We introduce an Adaptive Affine Transformer (AAT) module, which can automatically learn the key spatial transformations to warp original images for local network training.
Experimental results demonstrate that the proposed method significantly outperforms existing state-of-the-art algorithms on various benchmarks.
arXiv Detail & Related papers (2023-10-22T02:27:02Z) - 3DSAM-adapter: Holistic adaptation of SAM from 2D to 3D for promptable tumor segmentation [52.699139151447945]
We propose a novel adaptation method for transferring the segment anything model (SAM) from 2D to 3D for promptable medical image segmentation.
Our model can outperform domain state-of-the-art medical image segmentation models on 3 out of 4 tasks, specifically by 8.25%, 29.87%, and 10.11% for kidney tumor, pancreas tumor, colon cancer segmentation, and achieve similar performance for liver tumor segmentation.
arXiv Detail & Related papers (2023-06-23T12:09:52Z) - NUMSnet: Nested-U Multi-class Segmentation network for 3D Medical Image
Stacks [1.2335698325757494]
NUMSnet is a novel variant of the Unet model that transmits pixel neighborhood features across scans through nested layers.
We analyze the semantic segmentation performance of the NUMSnet model in comparison with several Unet model variants.
The proposed model can standardize multi-class semantic segmentation on a variety of volumetric image stacks with minimal training dataset.
arXiv Detail & Related papers (2023-04-05T19:16:29Z) - Attentive Symmetric Autoencoder for Brain MRI Segmentation [56.02577247523737]
We propose a novel Attentive Symmetric Auto-encoder based on Vision Transformer (ViT) for 3D brain MRI segmentation tasks.
In the pre-training stage, the proposed auto-encoder pays more attention to reconstruct the informative patches according to the gradient metrics.
Experimental results show that our proposed attentive symmetric auto-encoder outperforms the state-of-the-art self-supervised learning methods and medical image segmentation models.
arXiv Detail & Related papers (2022-09-19T09:43:19Z) - ACSGRegNet: A Deep Learning-based Framework for Unsupervised Joint
Affine and Diffeomorphic Registration of Lumbar Spine CT via Cross- and
Self-Attention Fusion [4.068962439293273]
This study proposes a novel end-to-end deep learning-based framework for medical image registration.
ACSGRegNet integrates a cross-attention module for establishing inter-image feature correspondences and a self-attention module for intra-image anatomical structures aware.
Our method achieved an average Dice of 0.963 and an average distance error of 0.321mm, which are better than the state-of-the-art (SOTA)
arXiv Detail & Related papers (2022-08-04T13:13:48Z) - Two-Stream Graph Convolutional Network for Intra-oral Scanner Image
Segmentation [133.02190910009384]
We propose a two-stream graph convolutional network (i.e., TSGCN) to handle inter-view confusion between different raw attributes.
Our TSGCN significantly outperforms state-of-the-art methods in 3D tooth (surface) segmentation.
arXiv Detail & Related papers (2022-04-19T10:41:09Z) - Residual Aligner Network [8.542808644281433]
Motion-Aware (MA) structure captures different motions in a region.
New network achieves results which were indistinguishable from the best-ranked networks.
arXiv Detail & Related papers (2022-03-07T22:48:43Z) - An End-to-End Breast Tumour Classification Model Using Context-Based
Patch Modelling- A BiLSTM Approach for Image Classification [19.594639581421422]
We have tried to integrate this relationship along with feature-based correlation among the extracted patches from the particular tumorous region.
We trained and tested our model on two datasets, microscopy images and WSI tumour regions.
We found out that BiLSTMs with CNN features have performed much better in modelling patches into an end-to-end Image classification network.
arXiv Detail & Related papers (2021-06-05T10:43:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.