Hepatic vessel segmentation based on 3Dswin-transformer with inductive
biased multi-head self-attention
- URL: http://arxiv.org/abs/2111.03368v1
- Date: Fri, 5 Nov 2021 10:17:08 GMT
- Title: Hepatic vessel segmentation based on 3Dswin-transformer with inductive
biased multi-head self-attention
- Authors: Mian Wu, Yinling Qian, Xiangyun Liao, Qiong Wang and Pheng-Ann Heng
- Abstract summary: We propose a robust end-to-end vessel segmentation network called Indu BIased Multi-Head Attention Vessel Net.
We introduce the voxel-wise embedding rather than patch-wise embedding to locate precise liver vessel voxels.
On the other hand, we propose inductive biased multi-head self-attention which learns inductive biased relative positional embedding from absolute position embedding.
- Score: 46.46365941681487
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: Purpose: Segmentation of liver vessels from CT images is indispensable prior
to surgical planning and aroused broad range of interests in the medical image
analysis community. Due to the complex structure and low contrast background,
automatic liver vessel segmentation remains particularly challenging. Most of
the related researches adopt FCN, U-net, and V-net variants as a backbone.
However, these methods mainly focus on capturing multi-scale local features
which may produce misclassified voxels due to the convolutional operator's
limited locality reception field.
Methods: We propose a robust end-to-end vessel segmentation network called
Inductive BIased Multi-Head Attention Vessel Net(IBIMHAV-Net) by expanding swin
transformer to 3D and employing an effective combination of convolution and
self-attention. In practice, we introduce the voxel-wise embedding rather than
patch-wise embedding to locate precise liver vessel voxels, and adopt
multi-scale convolutional operators to gain local spatial information. On the
other hand, we propose the inductive biased multi-head self-attention which
learns inductive biased relative positional embedding from initialized absolute
position embedding. Based on this, we can gain a more reliable query and key
matrix. To validate the generalization of our model, we test on samples which
have different structural complexity.
Results: We conducted experiments on the 3DIRCADb datasets. The average dice
and sensitivity of the four tested cases were 74.8% and 77.5%, which exceed
results of existing deep learning methods and improved graph cuts method.
Conclusion: The proposed model IBIMHAV-Net provides an automatic, accurate 3D
liver vessel segmentation with an interleaved architecture that better utilizes
both global and local spatial features in CT volumes. It can be further
extended for other clinical data.
Related papers
- Improving 3D Medical Image Segmentation at Boundary Regions using Local Self-attention and Global Volume Mixing [14.0825980706386]
Volumetric medical image segmentation is a fundamental problem in medical image analysis where the objective is to accurately classify a given 3D volumetric medical image with voxel-level precision.
In this work, we propose a novel hierarchical encoder-decoder-based framework that strives to explicitly capture the local and global dependencies for 3D medical image segmentation.
The proposed framework exploits local volume-based self-attention to encode the local dependencies at high resolution and introduces a novel volumetric-mixer to capture the global dependencies at low-resolution feature representations.
arXiv Detail & Related papers (2024-10-20T11:08:38Z) - SDR-Former: A Siamese Dual-Resolution Transformer for Liver Lesion
Classification Using 3D Multi-Phase Imaging [59.78761085714715]
This study proposes a novel Siamese Dual-Resolution Transformer (SDR-Former) framework for liver lesion classification.
The proposed framework has been validated through comprehensive experiments on two clinical datasets.
To support the scientific community, we are releasing our extensive multi-phase MR dataset for liver lesion analysis to the public.
arXiv Detail & Related papers (2024-02-27T06:32:56Z) - CIS-UNet: Multi-Class Segmentation of the Aorta in Computed Tomography
Angiography via Context-Aware Shifted Window Self-Attention [10.335899694123711]
We introduce Context Infused Swin-UNet (CIS-UNet), a deep learning model for aortic segmentation.
CIS-UNet adopts a hierarchical encoder-decoder structure comprising a CNN encoder, symmetric decoder, skip connections, and a novel Context-aware Shifted Window Self-Attention (CSW-SA) as the bottleneck block.
We trained our model on computed tomography (CT) scans from 44 patients and tested it on 15 patients. CIS-UNet outperformed the state-of-the-art SwinUNetR segmentation model, by achieving a superior mean Dice coefficient of 0.713 compared
arXiv Detail & Related papers (2024-01-23T19:17:20Z) - Leveraging Frequency Domain Learning in 3D Vessel Segmentation [50.54833091336862]
In this study, we leverage Fourier domain learning as a substitute for multi-scale convolutional kernels in 3D hierarchical segmentation models.
We show that our novel network achieves remarkable dice performance (84.37% on ASACA500 and 80.32% on ImageCAS) in tubular vessel segmentation tasks.
arXiv Detail & Related papers (2024-01-11T19:07:58Z) - Graph Convolution Based Cross-Network Multi-Scale Feature Fusion for
Deep Vessel Segmentation [36.12106059822428]
Vessels reconstructed using existing methods are often not sufficiently accurate to meet clinical use standards.
Our network consists of two cascadedworks performing initial and refined segmentation respectively.
Cross-network multi-scale feature fusion is performed between these two U-shaped networks to effectively support high-quality vessel segmentation.
arXiv Detail & Related papers (2023-01-06T05:56:50Z) - Two-Stream Graph Convolutional Network for Intra-oral Scanner Image
Segmentation [133.02190910009384]
We propose a two-stream graph convolutional network (i.e., TSGCN) to handle inter-view confusion between different raw attributes.
Our TSGCN significantly outperforms state-of-the-art methods in 3D tooth (surface) segmentation.
arXiv Detail & Related papers (2022-04-19T10:41:09Z) - A unified 3D framework for Organs at Risk Localization and Segmentation
for Radiation Therapy Planning [56.52933974838905]
Current medical workflow requires manual delineation of organs-at-risk (OAR)
In this work, we aim to introduce a unified 3D pipeline for OAR localization-segmentation.
Our proposed framework fully enables the exploitation of 3D context information inherent in medical imaging.
arXiv Detail & Related papers (2022-03-01T17:08:41Z) - Contextual Information Enhanced Convolutional Neural Networks for
Retinal Vessel Segmentation in Color Fundus Images [0.0]
An automatic retinal vessel segmentation system can effectively facilitate clinical diagnosis and ophthalmological research.
A deep learning based method has been proposed and several customized modules have been integrated into the well-known encoder-decoder architecture U-net.
As a result, the proposed method outperforms the work of predecessors and achieves state-of-the-art performance in Sensitivity/Recall, F1-score and MCC.
arXiv Detail & Related papers (2021-03-25T06:10:47Z) - Multi-Task Neural Networks with Spatial Activation for Retinal Vessel
Segmentation and Artery/Vein Classification [49.64863177155927]
We propose a multi-task deep neural network with spatial activation mechanism to segment full retinal vessel, artery and vein simultaneously.
The proposed network achieves pixel-wise accuracy of 95.70% for vessel segmentation, and A/V classification accuracy of 94.50%, which is the state-of-the-art performance for both tasks.
arXiv Detail & Related papers (2020-07-18T05:46:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.