A Unified Framework for Generalized Low-Shot Medical Image Segmentation
with Scarce Data
- URL: http://arxiv.org/abs/2110.09260v1
- Date: Mon, 18 Oct 2021 13:01:06 GMT
- Title: A Unified Framework for Generalized Low-Shot Medical Image Segmentation
with Scarce Data
- Authors: Hengji Cui, Dong Wei, Kai Ma, Shi Gu, and Yefeng Zheng
- Abstract summary: We propose a unified framework for generalized low-shot (one- and few-shot) medical image segmentation based on distance metric learning (DML)
Via DML, the framework learns a multimodal mixture representation for each category, and performs dense predictions based on cosine distances between the pixels' deep embeddings and the category representations.
In our experiments on brain MRI and abdominal CT datasets, the proposed framework achieves superior performances for low-shot segmentation towards standard DNN-based (3D U-Net) and classical registration-based (ANTs) methods.
- Score: 24.12765716392381
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Medical image segmentation has achieved remarkable advancements using deep
neural networks (DNNs). However, DNNs often need big amounts of data and
annotations for training, both of which can be difficult and costly to obtain.
In this work, we propose a unified framework for generalized low-shot (one- and
few-shot) medical image segmentation based on distance metric learning (DML).
Unlike most existing methods which only deal with the lack of annotations while
assuming abundance of data, our framework works with extreme scarcity of both,
which is ideal for rare diseases. Via DML, the framework learns a multimodal
mixture representation for each category, and performs dense predictions based
on cosine distances between the pixels' deep embeddings and the category
representations. The multimodal representations effectively utilize the
inter-subject similarities and intraclass variations to overcome overfitting
due to extremely limited data. In addition, we propose adaptive mixing
coefficients for the multimodal mixture distributions to adaptively emphasize
the modes better suited to the current input. The representations are
implicitly embedded as weights of the fc layer, such that the cosine distances
can be computed efficiently via forward propagation. In our experiments on
brain MRI and abdominal CT datasets, the proposed framework achieves superior
performances for low-shot segmentation towards standard DNN-based (3D U-Net)
and classical registration-based (ANTs) methods, e.g., achieving mean Dice
coefficients of 81%/69% for brain tissue/abdominal multiorgan segmentation
using a single training sample, as compared to 52%/31% and 72%/35% by the U-Net
and ANTs, respectively.
Related papers
- Label-efficient Multi-organ Segmentation Method with Diffusion Model [6.413416851085592]
We present a label-efficient learning approach using a pre-trained diffusion model for multi-organ segmentation tasks in CT images.
Our method achieves competitive multi-organ segmentation performance compared to state-of-the-art methods on the FLARE 2022 dataset.
arXiv Detail & Related papers (2024-02-23T09:25:57Z) - Multi-Scale Cross Contrastive Learning for Semi-Supervised Medical Image
Segmentation [14.536384387956527]
We develop a novel Multi-Scale Cross Supervised Contrastive Learning framework to segment structures in medical images.
Our approach contrasts multi-scale features based on ground-truth and cross-predicted labels, in order to extract robust feature representations.
It outperforms state-of-the-art semi-supervised methods by more than 3.0% in Dice.
arXiv Detail & Related papers (2023-06-25T16:55:32Z) - Interpretable Small Training Set Image Segmentation Network Originated
from Multi-Grid Variational Model [5.283735137946097]
Deep learning (DL) methods have been proposed and widely used for image segmentation.
DL methods usually require a large amount of manually segmented data as training data and suffer from poor interpretability.
In this paper, we replace the hand-crafted regularity term in the MS model with a data adaptive generalized learnable regularity term.
arXiv Detail & Related papers (2023-06-25T02:34:34Z) - Rethinking Semi-Supervised Medical Image Segmentation: A
Variance-Reduction Perspective [51.70661197256033]
We propose ARCO, a semi-supervised contrastive learning framework with stratified group theory for medical image segmentation.
We first propose building ARCO through the concept of variance-reduced estimation and show that certain variance-reduction techniques are particularly beneficial in pixel/voxel-level segmentation tasks.
We experimentally validate our approaches on eight benchmarks, i.e., five 2D/3D medical and three semantic segmentation datasets, with different label settings.
arXiv Detail & Related papers (2023-02-03T13:50:25Z) - Med-DANet: Dynamic Architecture Network for Efficient Medical Volumetric
Segmentation [13.158995287578316]
We propose a dynamic architecture network named Med-DANet to achieve effective accuracy and efficiency trade-off.
For each slice of the input 3D MRI volume, our proposed method learns a slice-specific decision by the Decision Network.
Our proposed method achieves comparable or better results than previous state-of-the-art methods for 3D MRI brain tumor segmentation.
arXiv Detail & Related papers (2022-06-14T03:25:58Z) - CORPS: Cost-free Rigorous Pseudo-labeling based on Similarity-ranking
for Brain MRI Segmentation [3.1657395760137406]
We propose a semi-supervised segmentation framework built upon a novel atlas-based pseudo-labeling method and a 3D deep convolutional neural network (DCNN) for 3D brain MRI segmentation.
The experimental results demonstrate the superiority of the proposed framework over the baseline method both qualitatively and quantitatively.
arXiv Detail & Related papers (2022-05-19T14:42:49Z) - Two-Stream Graph Convolutional Network for Intra-oral Scanner Image
Segmentation [133.02190910009384]
We propose a two-stream graph convolutional network (i.e., TSGCN) to handle inter-view confusion between different raw attributes.
Our TSGCN significantly outperforms state-of-the-art methods in 3D tooth (surface) segmentation.
arXiv Detail & Related papers (2022-04-19T10:41:09Z) - Automatic size and pose homogenization with spatial transformer network
to improve and accelerate pediatric segmentation [51.916106055115755]
We propose a new CNN architecture that is pose and scale invariant thanks to the use of Spatial Transformer Network (STN)
Our architecture is composed of three sequential modules that are estimated together during training.
We test the proposed method in kidney and renal tumor segmentation on abdominal pediatric CT scanners.
arXiv Detail & Related papers (2021-07-06T14:50:03Z) - Few-shot Medical Image Segmentation using a Global Correlation Network
with Discriminative Embedding [60.89561661441736]
We propose a novel method for few-shot medical image segmentation.
We construct our few-shot image segmentor using a deep convolutional network trained episodically.
We enhance discriminability of deep embedding to encourage clustering of the feature domains of the same class.
arXiv Detail & Related papers (2020-12-10T04:01:07Z) - Weakly-supervised Learning For Catheter Segmentation in 3D Frustum
Ultrasound [74.22397862400177]
We propose a novel Frustum ultrasound based catheter segmentation method.
The proposed method achieved the state-of-the-art performance with an efficiency of 0.25 second per volume.
arXiv Detail & Related papers (2020-10-19T13:56:22Z) - Unpaired Multi-modal Segmentation via Knowledge Distillation [77.39798870702174]
We propose a novel learning scheme for unpaired cross-modality image segmentation.
In our method, we heavily reuse network parameters, by sharing all convolutional kernels across CT and MRI.
We have extensively validated our approach on two multi-class segmentation problems.
arXiv Detail & Related papers (2020-01-06T20:03:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.