Crosslink-Net: Double-branch Encoder Segmentation Network via Fusing
  Vertical and Horizontal Convolutions
        - URL: http://arxiv.org/abs/2107.11517v1
- Date: Sat, 24 Jul 2021 02:58:32 GMT
- Title: Crosslink-Net: Double-branch Encoder Segmentation Network via Fusing
  Vertical and Horizontal Convolutions
- Authors: Qian Yu, Lei Qi, Luping Zhou, Lei Wang, Yilong Yin, Yinghuan Shi,
  Wuzhang Wang, Yang Gao
- Abstract summary: We present a novel double-branch encoder architecture for medical image segmentation.
Our architecture is inspired by two observations: 1) Since the discrimination of features learned via square convolutional kernels needs to be further improved, we propose to utilize non-square vertical and horizontal convolutional kernels.
The experiments validate the effectiveness of our model on four datasets.
- Score: 58.71117402626524
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   Accurate image segmentation plays a crucial role in medical image analysis,
yet it faces great challenges of various shapes, diverse sizes, and blurry
boundaries. To address these difficulties, square kernel-based encoder-decoder
architecture has been proposed and widely used, but its performance remains
still unsatisfactory. To further cope with these challenges, we present a novel
double-branch encoder architecture. Our architecture is inspired by two
observations: 1) Since the discrimination of features learned via square
convolutional kernels needs to be further improved, we propose to utilize
non-square vertical and horizontal convolutional kernels in the double-branch
encoder, so features learned by the two branches can be expected to complement
each other. 2) Considering that spatial attention can help models to better
focus on the target region in a large-sized image, we develop an attention loss
to further emphasize the segmentation on small-sized targets. Together, the
above two schemes give rise to a novel double-branch encoder segmentation
framework for medical image segmentation, namely Crosslink-Net. The experiments
validate the effectiveness of our model on four datasets. The code is released
at https://github.com/Qianyu1226/Crosslink-Net.
 
      
        Related papers
        - Rethinking Decoder Design: Improving Biomarker Segmentation Using   Depth-to-Space Restoration and Residual Linear Attention [2.0799865428691393]
 We propose an architecture that captures multi-scale local and global contextual information and a novel decoder design.<n>Our method achieves absolute performance gains of 2.76% on MoNuSeg, 3.12% on DSB, 2.87% on Electron Microscopy, and 4.03% on TNBC datasets.
 arXiv  Detail & Related papers  (2025-06-23T06:32:36Z)
- Dual-scale Enhanced and Cross-generative Consistency Learning for   Semi-supervised Medical Image Segmentation [49.57907601086494]
 Medical image segmentation plays a crucial role in computer-aided diagnosis.
We propose a novel Dual-scale Enhanced and Cross-generative consistency learning framework for semi-supervised medical image (DEC-Seg)
 arXiv  Detail & Related papers  (2023-12-26T12:56:31Z)
- Triple-View Knowledge Distillation for Semi-Supervised Semantic
  Segmentation [54.23510028456082]
 We propose a Triple-view Knowledge Distillation framework, termed TriKD, for semi-supervised semantic segmentation.
The framework includes the triple-view encoder and the dual-frequency decoder.
 arXiv  Detail & Related papers  (2023-09-22T01:02:21Z)
- Towards Diverse Binary Segmentation via A Simple yet General Gated   Network [71.19503376629083]
 We propose a simple yet general gated network (GateNet) to tackle binary segmentation tasks.
With the help of multi-level gate units, the valuable context information from the encoder can be selectively transmitted to the decoder.
We introduce a "Fold" operation to improve the atrous convolution and form a novel folded atrous convolution.
 arXiv  Detail & Related papers  (2023-03-18T11:26:36Z)
- LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text
  Retrieval [117.15862403330121]
 We propose LoopITR, which combines dual encoders and cross encoders in the same network for joint learning.
Specifically, we let the dual encoder provide hard negatives to the cross encoder, and use the more discriminative cross encoder to distill its predictions back to the dual encoder.
 arXiv  Detail & Related papers  (2022-03-10T16:41:12Z)
- Attention W-Net: Improved Skip Connections for better Representations [5.027571997864707]
 We propose Attention W-Net, a new U-Net based architecture for retinal vessel segmentation.
We observe an AUC and F1-Score of 0.8407 and 0.9833 - a sizeable improvement over its LadderNet backbone.
 arXiv  Detail & Related papers  (2021-10-17T12:44:36Z)
- Suppress and Balance: A Simple Gated Network for Salient Object
  Detection [89.88222217065858]
 We propose a simple gated network (GateNet) to solve both issues at once.
With the help of multilevel gate units, the valuable context information from the encoder can be optimally transmitted to the decoder.
In addition, we adopt the atrous spatial pyramid pooling based on the proposed "Fold" operation (Fold-ASPP) to accurately localize salient objects of various scales.
 arXiv  Detail & Related papers  (2020-07-16T02:00:53Z)
- CF2-Net: Coarse-to-Fine Fusion Convolutional Network for Breast
  Ultrasound Image Segmentation [14.807364495808779]
 We propose and evaluate a coarse-to-fine fusion convolutional network (CF2-Net) based on a novel feature integration strategy (forming an 'E'-like type) for BUS image segmentation.
The proposed CF2-Net was evaluated on an open dataset by using four-fold cross validation.
The results of the experiment demonstrate that the CF2-Net obtains state-of-the-art performance when compared with other deep learning-based methods.
 arXiv  Detail & Related papers  (2020-03-23T09:27:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.