Retinal Structure Detection in OCTA Image via Voting-based Multi-task
Learning
- URL: http://arxiv.org/abs/2208.10745v1
- Date: Tue, 23 Aug 2022 05:53:04 GMT
- Title: Retinal Structure Detection in OCTA Image via Voting-based Multi-task
Learning
- Authors: Jinkui Hao, Ting Shen, Xueli Zhu, Yonghuai Liu, Ardhendu Behera, Dan
Zhang, Bang Chen, Jiang Liu, Jiong Zhang, Yitian Zhao
- Abstract summary: We propose a novel Voting-based Adaptive Feature Fusion multi-task network (VAFF-Net) for joint segmentation, detection, and classification of RV, FAZ, and RVJ.
A task-specific voting gate module is proposed to adaptively extract and fuse different features for specific tasks at two levels.
To facilitate further research, part of these datasets with the source code and evaluation benchmark have been released for public access.
- Score: 27.637273690432608
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Automated detection of retinal structures, such as retinal vessels (RV), the
foveal avascular zone (FAZ), and retinal vascular junctions (RVJ), are of great
importance for understanding diseases of the eye and clinical decision-making.
In this paper, we propose a novel Voting-based Adaptive Feature Fusion
multi-task network (VAFF-Net) for joint segmentation, detection, and
classification of RV, FAZ, and RVJ in optical coherence tomography angiography
(OCTA). A task-specific voting gate module is proposed to adaptively extract
and fuse different features for specific tasks at two levels: features at
different spatial positions from a single encoder, and features from multiple
encoders. In particular, since the complexity of the microvasculature in OCTA
images makes simultaneous precise localization and classification of retinal
vascular junctions into bifurcation/crossing a challenging task, we
specifically design a task head by combining the heatmap regression and grid
classification. We take advantage of three different \textit{en face}
angiograms from various retinal layers, rather than following existing methods
that use only a single \textit{en face}. To facilitate further research, part
of these datasets with the source code and evaluation benchmark have been
released for public access:https://github.com/iMED-Lab/VAFF-Net.
Related papers
- DiffVein: A Unified Diffusion Network for Finger Vein Segmentation and
Authentication [50.017055360261665]
We introduce DiffVein, a unified diffusion model-based framework which simultaneously addresses vein segmentation and authentication tasks.
For better feature interaction between these two branches, we introduce two specialized modules.
In this way, our framework allows for a dynamic interplay between diffusion and segmentation embeddings.
arXiv Detail & Related papers (2024-02-03T06:49:42Z) - Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection [76.11864242047074]
We propose a novel Affine-Consistent Transformer (AC-Former), which directly yields a sequence of nucleus positions.
We introduce an Adaptive Affine Transformer (AAT) module, which can automatically learn the key spatial transformations to warp original images for local network training.
Experimental results demonstrate that the proposed method significantly outperforms existing state-of-the-art algorithms on various benchmarks.
arXiv Detail & Related papers (2023-10-22T02:27:02Z) - M$^{2}$SNet: Multi-scale in Multi-scale Subtraction Network for Medical
Image Segmentation [73.10707675345253]
We propose a general multi-scale in multi-scale subtraction network (M$2$SNet) to finish diverse segmentation from medical image.
Our method performs favorably against most state-of-the-art methods under different evaluation metrics on eleven datasets of four different medical image segmentation tasks.
arXiv Detail & Related papers (2023-03-20T06:26:49Z) - Affinity Feature Strengthening for Accurate, Complete and Robust Vessel
Segmentation [48.638327652506284]
Vessel segmentation is crucial in many medical image applications, such as detecting coronary stenoses, retinal vessel diseases and brain aneurysms.
We present a novel approach, the affinity feature strengthening network (AFN), which jointly models geometry and refines pixel-wise segmentation features using a contrast-insensitive, multiscale affinity approach.
arXiv Detail & Related papers (2022-11-12T05:39:17Z) - RetiFluidNet: A Self-Adaptive and Multi-Attention Deep Convolutional
Network for Retinal OCT Fluid Segmentation [3.57686754209902]
Quantification of retinal fluids is necessary for OCT-guided treatment management.
New convolutional neural architecture named RetiFluidNet is proposed for multi-class retinal fluid segmentation.
Model benefits from hierarchical representation learning of textural, contextual, and edge features.
arXiv Detail & Related papers (2022-09-26T07:18:00Z) - Two-Stream Graph Convolutional Network for Intra-oral Scanner Image
Segmentation [133.02190910009384]
We propose a two-stream graph convolutional network (i.e., TSGCN) to handle inter-view confusion between different raw attributes.
Our TSGCN significantly outperforms state-of-the-art methods in 3D tooth (surface) segmentation.
arXiv Detail & Related papers (2022-04-19T10:41:09Z) - (M)SLAe-Net: Multi-Scale Multi-Level Attention embedded Network for
Retinal Vessel Segmentation [0.0]
We propose a multi-scale, multi-level attention embedded CNN architecture ((M)SLAe-Net) to address the issue of multi-stage processing.
We do this by extracting features at multiple scales and multiple levels of the network, enabling our model to holistically extracts the local and global features.
Our unique network design and novel D-DPP module with efficient task-specific loss function for thin vessels enabled our model for better cross data performance.
arXiv Detail & Related papers (2021-09-05T14:29:00Z) - Projective Skip-Connections for Segmentation Along a Subset of
Dimensions in Retinal OCT [5.252775177165399]
In medical imaging, there are clinically relevant segmentation tasks where the output mask is a projection to a subset of input image dimensions.
We propose a novel convolutional neural network architecture that can effectively learn to produce a lower-dimensional segmentation mask.
The network restores encoded representation only in a subset of input spatial dimensions and keeps the representation unchanged in the others.
arXiv Detail & Related papers (2021-08-02T12:41:58Z) - Unsupervised Bidirectional Cross-Modality Adaptation via Deeply
Synergistic Image and Feature Alignment for Medical Image Segmentation [73.84166499988443]
We present a novel unsupervised domain adaptation framework, named as Synergistic Image and Feature Alignment (SIFA)
Our proposed SIFA conducts synergistic alignment of domains from both image and feature perspectives.
Experimental results on two different tasks demonstrate that our SIFA method is effective in improving segmentation performance on unlabeled target images.
arXiv Detail & Related papers (2020-02-06T13:49:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.