Sequential vessel segmentation via deep channel attention network
- URL: http://arxiv.org/abs/2102.05229v1
- Date: Wed, 10 Feb 2021 02:45:08 GMT
- Title: Sequential vessel segmentation via deep channel attention network
- Authors: Dongdong Hao, Song Ding, Linwei Qiu, Yisong Lv, Baowei Fei, Yueqi Zhu,
Binjie Qin
- Abstract summary: This paper develops a novel encoder-decoder deep network architecture.
It exploits the several contextual frames of 2D+t sequential images in a sliding window centered at current frame to segment 2D vessel masks from the current frame.
The architecture is equipped with temporal-spatial feature extraction in encoder stage, feature fusion in skip connection layers and channel attention mechanism in decoder stage.
- Score: 5.941874421818899
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: This paper develops a novel encoder-decoder deep network architecture which
exploits the several contextual frames of 2D+t sequential images in a sliding
window centered at current frame to segment 2D vessel masks from the current
frame. The architecture is equipped with temporal-spatial feature extraction in
encoder stage, feature fusion in skip connection layers and channel attention
mechanism in decoder stage. In the encoder stage, a series of 3D convolutional
layers are employed to hierarchically extract temporal-spatial features. Skip
connection layers subsequently fuse the temporal-spatial feature maps and
deliver them to the corresponding decoder stages. To efficiently discriminate
vessel features from the complex and noisy backgrounds in the XCA images, the
decoder stage effectively utilizes channel attention blocks to refine the
intermediate feature maps from skip connection layers for subsequently decoding
the refined features in 2D ways to produce the segmented vessel masks.
Furthermore, Dice loss function is implemented to train the proposed deep
network in order to tackle the class imbalance problem in the XCA data due to
the wide distribution of complex background artifacts. Extensive experiments by
comparing our method with other state-of-the-art algorithms demonstrate the
proposed method's superior performance over other methods in terms of the
quantitative metrics and visual validation. The source codes are at
https://github.com/Binjie-Qin/SVS-net
Related papers
- FFCA-Net: Stereo Image Compression via Fast Cascade Alignment of Side
Information [44.88123177525665]
Multi-view compression technology, especially Stereo Image Compression (SIC), plays a crucial role in car-mounted cameras and 3D-related applications.
We propose a Feature-based Fast Cascade Alignment network (FFCA-Net) to fully leverage the side information on the decoder.
Our approach achieves significant gains in terms of 3 to 10-fold faster decoding speed than other methods.
arXiv Detail & Related papers (2023-12-28T11:12:03Z) - Triple-View Knowledge Distillation for Semi-Supervised Semantic
Segmentation [54.23510028456082]
We propose a Triple-view Knowledge Distillation framework, termed TriKD, for semi-supervised semantic segmentation.
The framework includes the triple-view encoder and the dual-frequency decoder.
arXiv Detail & Related papers (2023-09-22T01:02:21Z) - S^2-Transformer for Mask-Aware Hyperspectral Image Reconstruction [48.83280067393851]
A representative hyperspectral image acquisition procedure conducts a 3D-to-2D encoding by the coded aperture snapshot spectral imager (CASSI)
Two major challenges stand in the way of a high-fidelity reconstruction: (i) To obtain 2D measurements, CASSI dislocates multiple channels by disperser-titling and squeezes them onto the same spatial region, yielding an entangled data loss.
We propose a spatial-spectral (S2-) transformer architecture with a mask-aware learning strategy to tackle these challenges.
arXiv Detail & Related papers (2022-09-24T19:26:46Z) - Adjacent Context Coordination Network for Salient Object Detection in
Optical Remote Sensing Images [102.75699068451166]
We propose a novel Adjacent Context Coordination Network (ACCoNet) to explore the coordination of adjacent features in an encoder-decoder architecture for optical RSI-SOD.
The proposed ACCoNet outperforms 22 state-of-the-art methods under nine evaluation metrics, and runs up to 81 fps on a single NVIDIA Titan X GPU.
arXiv Detail & Related papers (2022-03-25T14:14:55Z) - Crosslink-Net: Double-branch Encoder Segmentation Network via Fusing
Vertical and Horizontal Convolutions [58.71117402626524]
We present a novel double-branch encoder architecture for medical image segmentation.
Our architecture is inspired by two observations: 1) Since the discrimination of features learned via square convolutional kernels needs to be further improved, we propose to utilize non-square vertical and horizontal convolutional kernels.
The experiments validate the effectiveness of our model on four datasets.
arXiv Detail & Related papers (2021-07-24T02:58:32Z) - FPS-Net: A Convolutional Fusion Network for Large-Scale LiDAR Point
Cloud Segmentation [30.736361776703568]
Scene understanding based on LiDAR point cloud is an essential task for autonomous cars to drive safely.
Most existing methods simply stack different point attributes/modalities as image channels to increase information capacity.
We design FPS-Net, a convolutional fusion network that exploits the uniqueness and discrepancy among the projected image channels for optimal point cloud segmentation.
arXiv Detail & Related papers (2021-03-01T04:08:28Z) - CovSegNet: A Multi Encoder-Decoder Architecture for Improved Lesion
Segmentation of COVID-19 Chest CT Scans [11.946078871080836]
An automated COVID-19 lesion segmentation scheme is proposed utilizing a highly efficient neural network architecture, namely CovSegNet.
Outstanding performances have been achieved in three publicly available datasets that largely outperform other state-of-the-art approaches.
arXiv Detail & Related papers (2020-12-02T19:26:35Z) - Two-stream Encoder-Decoder Network for Localizing Image Forgeries [4.982505311411925]
We propose a novel two-stream encoder-decoder network, which utilizes both the high-level and the low-level image features.
We have carried out experimental analysis on multiple standard forensics datasets to evaluate the performance of the proposed method.
arXiv Detail & Related papers (2020-09-27T15:49:17Z) - Beyond Single Stage Encoder-Decoder Networks: Deep Decoders for Semantic
Image Segmentation [56.44853893149365]
Single encoder-decoder methodologies for semantic segmentation are reaching their peak in terms of segmentation quality and efficiency per number of layers.
We propose a new architecture based on a decoder which uses a set of shallow networks for capturing more information content.
In order to further improve the architecture we introduce a weight function which aims to re-balance classes to increase the attention of the networks to under-represented objects.
arXiv Detail & Related papers (2020-07-19T18:44:34Z) - Rethinking and Improving Natural Language Generation with Layer-Wise
Multi-View Decoding [59.48857453699463]
In sequence-to-sequence learning, the decoder relies on the attention mechanism to efficiently extract information from the encoder.
Recent work has proposed to use representations from different encoder layers for diversified levels of information.
We propose layer-wise multi-view decoding, where for each decoder layer, together with the representations from the last encoder layer, which serve as a global view, those from other encoder layers are supplemented for a stereoscopic view of the source sequences.
arXiv Detail & Related papers (2020-05-16T20:00:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.