Efficient Medical Image Segmentation with Intermediate Supervision
Mechanism
- URL: http://arxiv.org/abs/2012.03673v1
- Date: Sun, 15 Nov 2020 13:46:00 GMT
- Title: Efficient Medical Image Segmentation with Intermediate Supervision
Mechanism
- Authors: Di Yuan, Junyang Chen, Zhenghua Xu, Thomas Lukasiewicz, Zhigang Fu,
Guizhi Xu
- Abstract summary: The expansion path of U-Net may ignore the characteristics of small targets, so an intermediate supervision mechanism is proposed.
Although the intermediate supervision mechanism improves the segmentation accuracy, the training time is too long due to the extra input and multiple loss functions.
To reduce the redundancy of the model, we combine shared-weight decoder module with tied-weight decoder module.
- Score: 48.244918515770514
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Because the expansion path of U-Net may ignore the characteristics of small
targets, intermediate supervision mechanism is proposed. The original mask is
also entered into the network as a label for intermediate output. However,
U-Net is mainly engaged in segmentation, and the extracted features are also
targeted at segmentation location information, and the input and output are
different. The label we need is that the input and output are both original
masks, which is more similar to the refactoring process, so we propose another
intermediate supervision mechanism. However, the features extracted by the
contraction path of this intermediate monitoring mechanism are not necessarily
consistent. For example, U-Net's contraction path extracts transverse features,
while auto-encoder extracts longitudinal features, which may cause the output
of the expansion path to be inconsistent with the label. Therefore, we put
forward the intermediate supervision mechanism of shared-weight decoder module.
Although the intermediate supervision mechanism improves the segmentation
accuracy, the training time is too long due to the extra input and multiple
loss functions. For one of these problems, we have introduced tied-weight
decoder. To reduce the redundancy of the model, we combine shared-weight
decoder module with tied-weight decoder module.
Related papers
- ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation [32.74195208408193]
Medical image segmentation is a crucial task in computer vision, supporting clinicians in diagnosis, treatment planning, and disease monitoring.
We propose the Adaptive Semantic Network (ASSNet), a transformer architecture that effectively integrates local and global features for precise medical image segmentation.
Tests on diverse medical image segmentation tasks, including multi-organ, liver tumor, and bladder tumor segmentation, demonstrate that ASSNet achieves state-of-the-art results.
arXiv Detail & Related papers (2024-09-12T06:25:44Z) - DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut [62.63481844384229]
Foundation models have emerged as powerful tools across various domains including language, vision, and multimodal tasks.
In this paper, we use a diffusion UNet encoder as a foundation vision encoder and introduce DiffCut, an unsupervised zero-shot segmentation method.
Our work highlights the remarkably accurate semantic knowledge embedded within diffusion UNet encoders that could then serve as foundation vision encoders for downstream tasks.
arXiv Detail & Related papers (2024-06-05T01:32:31Z) - More complex encoder is not all you need [0.882348769487259]
We introduce neU-Net (i.e., not complex encoder U-Net), which incorporates a novel Sub-pixel Convolution for upsampling to construct a powerful decoder.
Our model design achieves excellent results, surpassing other state-of-the-art methods on both the Synapse and ACDC datasets.
arXiv Detail & Related papers (2023-09-20T08:34:38Z) - SegNetr: Rethinking the local-global interactions and skip connections
in U-shaped networks [1.121518046252855]
U-shaped networks have dominated the field of medical image segmentation due to their simple and easily tuned structure.
We introduce a novel SegNetr block that can perform local-global interactions dynamically at any stage and with only linear complexity.
We validate the effectiveness of SegNetr on four mainstream medical image segmentation datasets, with 59% and 76% fewer parameters and GFLOPs than vanilla U-Net.
arXiv Detail & Related papers (2023-07-06T12:39:06Z) - Joint Channel Estimation and Feedback with Masked Token Transformers in
Massive MIMO Systems [74.52117784544758]
This paper proposes an encoder-decoder based network that unveils the intrinsic frequency-domain correlation within the CSI matrix.
The entire encoder-decoder network is utilized for channel compression.
Our method outperforms state-of-the-art channel estimation and feedback techniques in joint tasks.
arXiv Detail & Related papers (2023-06-08T06:15:17Z) - Object Segmentation by Mining Cross-Modal Semantics [68.88086621181628]
We propose a novel approach by mining the Cross-Modal Semantics to guide the fusion and decoding of multimodal features.
Specifically, we propose a novel network, termed XMSNet, consisting of (1) all-round attentive fusion (AF), (2) coarse-to-fine decoder (CFD), and (3) cross-layer self-supervision.
arXiv Detail & Related papers (2023-05-17T14:30:11Z) - Towards Diverse Binary Segmentation via A Simple yet General Gated Network [71.19503376629083]
We propose a simple yet general gated network (GateNet) to tackle binary segmentation tasks.
With the help of multi-level gate units, the valuable context information from the encoder can be selectively transmitted to the decoder.
We introduce a "Fold" operation to improve the atrous convolution and form a novel folded atrous convolution.
arXiv Detail & Related papers (2023-03-18T11:26:36Z) - MA-Unet: An improved version of Unet based on multi-scale and attention
mechanism for medical image segmentation [4.082245106486775]
convolutional neural networks (CNNs) are promoting the development of medical image semantic segmentation.
In this paper, we try to eliminate semantic ambiguity in skip connection operations by adding attention gates (AGs)
Our model obtains better segmentation performance while introducing fewer parameters.
arXiv Detail & Related papers (2020-12-20T15:29:18Z) - Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery [37.36402108843216]
Unsupervised spoken term discovery consists of two tasks: finding the acoustic segment boundaries and labeling acoustically similar segments with the same labels.
We perform segmentation based on the assumption that the frame feature vectors are more similar within a segment than across the segments.
We achieve this via a self-expressing autoencoder framework.
arXiv Detail & Related papers (2020-07-26T00:04:17Z) - Motion-Attentive Transition for Zero-Shot Video Object Segmentation [99.44383412488703]
We present a Motion-Attentive Transition Network (MATNet) for zero-shot object segmentation.
An asymmetric attention block, called Motion-Attentive Transition (MAT), is designed within a two-stream encoder.
In this way, the encoder becomes deeply internative, allowing for closely hierarchical interactions between object motion and appearance.
arXiv Detail & Related papers (2020-03-09T16:58:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.