The Unreasonable Effectiveness of Encoder-Decoder Networks for Retinal
Vessel Segmentation
- URL: http://arxiv.org/abs/2011.12643v1
- Date: Wed, 25 Nov 2020 11:10:37 GMT
- Title: The Unreasonable Effectiveness of Encoder-Decoder Networks for Retinal
Vessel Segmentation
- Authors: Bj\"orn Browatzki, J\"orn-Philipp Lies, Christian Wallraven
- Abstract summary: We propose an encoder-decoder framework for the segmentation of blood vessels in retinal images.
Experiments on three fundus image datasets demonstrate that this approach achieves state-of-the-art results.
We show that this framework - called VLight - avoids overfitting to specific training images and generalizes well across different datasets.
- Score: 3.351714665243138
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: We propose an encoder-decoder framework for the segmentation of blood vessels
in retinal images that relies on the extraction of large-scale patches at
multiple image-scales during training. Experiments on three fundus image
datasets demonstrate that this approach achieves state-of-the-art results and
can be implemented using a simple and efficient fully-convolutional network
with a parameter count of less than 0.8M. Furthermore, we show that this
framework - called VLight - avoids overfitting to specific training images and
generalizes well across different datasets, which makes it highly suitable for
real-world applications where robustness, accuracy as well as low inference
time on high-resolution fundus images is required.
Related papers
- Light-weight Retinal Layer Segmentation with Global Reasoning [14.558920359236572]
We propose LightReSeg for retinal layer segmentation which can be applied to OCT images.
Our approach achieves a better segmentation performance compared to the current state-of-the-art method TransUnet.
arXiv Detail & Related papers (2024-04-25T05:42:41Z) - Leveraging Neural Radiance Fields for Uncertainty-Aware Visual
Localization [56.95046107046027]
We propose to leverage Neural Radiance Fields (NeRF) to generate training samples for scene coordinate regression.
Despite NeRF's efficiency in rendering, many of the rendered data are polluted by artifacts or only contain minimal information gain.
arXiv Detail & Related papers (2023-10-10T20:11:13Z) - An Accurate and Efficient Neural Network for OCTA Vessel Segmentation
and a New Dataset [2.8743451550676866]
We propose an accurate and efficient neural network for retinal vessel segmentation in OCTA images.
The proposed network achieves accuracy comparable to other SOTA methods, while having fewer parameters and faster inference speed.
We create a new dataset containing 918 OCTA images and their corresponding vessel annotations.
arXiv Detail & Related papers (2023-09-18T04:47:12Z) - Cascaded Cross-Attention Networks for Data-Efficient Whole-Slide Image
Classification Using Transformers [0.11219061154635457]
Whole-Slide Imaging allows for the capturing and digitization of high-resolution images of histological specimen.
transformer architecture has been proposed as a possible candidate for effectively leveraging the high-resolution information.
We propose a novel cascaded cross-attention network (CCAN) based on the cross-attention mechanism that scales linearly with the number of extracted patches.
arXiv Detail & Related papers (2023-05-11T16:42:24Z) - Reducing Redundancy in the Bottleneck Representation of the Autoencoders [98.78384185493624]
Autoencoders are a type of unsupervised neural networks, which can be used to solve various tasks.
We propose a scheme to explicitly penalize feature redundancies in the bottleneck representation.
We tested our approach across different tasks: dimensionality reduction using three different dataset, image compression using the MNIST dataset, and image denoising using fashion MNIST.
arXiv Detail & Related papers (2022-02-09T18:48:02Z) - Leveraging Image Complexity in Macro-Level Neural Network Design for
Medical Image Segmentation [3.974175960216864]
We show that image complexity can be used as a guideline in choosing what is best for a given dataset.
For high-complexity datasets, a shallow network running on the original images may yield better segmentation results than a deep network running on downsampled images.
arXiv Detail & Related papers (2021-12-21T09:49:47Z) - Image Magnification Network for Vessel Segmentation in OCTA Images [5.535039549947958]
We propose a novel image magnification network (IMN) for vessel segmentation in OCTA images.
The experimental results on three open OCTA datasets show that the proposed IMN with an average dice score of 90.2% achieves the best performance.
arXiv Detail & Related papers (2021-10-26T06:18:38Z) - Weakly-supervised Learning For Catheter Segmentation in 3D Frustum
Ultrasound [74.22397862400177]
We propose a novel Frustum ultrasound based catheter segmentation method.
The proposed method achieved the state-of-the-art performance with an efficiency of 0.25 second per volume.
arXiv Detail & Related papers (2020-10-19T13:56:22Z) - Hierarchical Amortized Training for Memory-efficient High Resolution 3D
GAN [52.851990439671475]
We propose a novel end-to-end GAN architecture that can generate high-resolution 3D images.
We achieve this goal by using different configurations between training and inference.
Experiments on 3D thorax CT and brain MRI demonstrate that our approach outperforms state of the art in image generation.
arXiv Detail & Related papers (2020-08-05T02:33:04Z) - Hyperspectral Image Super-resolution via Deep Progressive Zero-centric
Residual Learning [62.52242684874278]
Cross-modality distribution of spatial and spectral information makes the problem challenging.
We propose a novel textitlightweight deep neural network-based framework, namely PZRes-Net.
Our framework learns a high resolution and textitzero-centric residual image, which contains high-frequency spatial details of the scene.
arXiv Detail & Related papers (2020-06-18T06:32:11Z) - Learning Deformable Image Registration from Optimization: Perspective,
Modules, Bilevel Training and Beyond [62.730497582218284]
We develop a new deep learning based framework to optimize a diffeomorphic model via multi-scale propagation.
We conduct two groups of image registration experiments on 3D volume datasets including image-to-atlas registration on brain MRI data and image-to-image registration on liver CT data.
arXiv Detail & Related papers (2020-04-30T03:23:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.