The Little W-Net That Could: State-of-the-Art Retinal Vessel
Segmentation with Minimalistic Models
- URL: http://arxiv.org/abs/2009.01907v1
- Date: Thu, 3 Sep 2020 19:59:51 GMT
- Title: The Little W-Net That Could: State-of-the-Art Retinal Vessel
Segmentation with Minimalistic Models
- Authors: Adrian Galdran, Andr\'e Anjos, Jos\'e Dolz, Hadi Chakor, Herv\'e
Lombaert, Ismail Ben Ayed
- Abstract summary: We show that a minimalistic version of a standard U-Net with several orders of magnitude less parameters closely approximates the performance of current best techniques.
We also propose a simple extension, dubbed W-Net, which reaches outstanding performance on several popular datasets.
We also test our approach on the Artery/Vein segmentation problem, where we again achieve results well-aligned with the state-of-the-art.
- Score: 19.089445797922316
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The segmentation of the retinal vasculature from eye fundus images represents
one of the most fundamental tasks in retinal image analysis. Over recent years,
increasingly complex approaches based on sophisticated Convolutional Neural
Network architectures have been slowly pushing performance on well-established
benchmark datasets. In this paper, we take a step back and analyze the real
need of such complexity. Specifically, we demonstrate that a minimalistic
version of a standard U-Net with several orders of magnitude less parameters,
carefully trained and rigorously evaluated, closely approximates the
performance of current best techniques. In addition, we propose a simple
extension, dubbed W-Net, which reaches outstanding performance on several
popular datasets, still using orders of magnitude less learnable weights than
any previously published approach. Furthermore, we provide the most
comprehensive cross-dataset performance analysis to date, involving up to 10
different databases. Our analysis demonstrates that the retinal vessel
segmentation problem is far from solved when considering test images that
differ substantially from the training data, and that this task represents an
ideal scenario for the exploration of domain adaptation techniques. In this
context, we experiment with a simple self-labeling strategy that allows us to
moderately enhance cross-dataset performance, indicating that there is still
much room for improvement in this area. Finally, we also test our approach on
the Artery/Vein segmentation problem, where we again achieve results
well-aligned with the state-of-the-art, at a fraction of the model complexity
in recent literature. All the code to reproduce the results in this paper is
released.
Related papers
- Few-shot Online Anomaly Detection and Segmentation [29.693357653538474]
This paper focuses on addressing the challenging yet practical few-shot online anomaly detection and segmentation (FOADS) task.
Under the FOADS framework, models are trained on a few-shot normal dataset, followed by inspection and improvement of their capabilities by leveraging unlabeled streaming data containing both normal and abnormal samples simultaneously.
In order to achieve improved performance with limited training samples, we employ multi-scale feature embedding extracted from a CNN pre-trained on ImageNet to obtain a robust representation.
arXiv Detail & Related papers (2024-03-27T02:24:00Z) - Challenging the Myth of Graph Collaborative Filtering: a Reasoned and Reproducibility-driven Analysis [50.972595036856035]
We present a code that successfully replicates results from six popular and recent graph recommendation models.
We compare these graph models with traditional collaborative filtering models that historically performed well in offline evaluations.
By investigating the information flow from users' neighborhoods, we aim to identify which models are influenced by intrinsic features in the dataset structure.
arXiv Detail & Related papers (2023-08-01T09:31:44Z) - Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based
Action Recognition [88.34182299496074]
Action labels are only available on a source dataset, but unavailable on a target dataset in the training stage.
We utilize a self-supervision scheme to reduce the domain shift between two skeleton-based action datasets.
By segmenting and permuting temporal segments or human body parts, we design two self-supervised learning classification tasks.
arXiv Detail & Related papers (2022-07-17T07:05:39Z) - Last Layer Re-Training is Sufficient for Robustness to Spurious
Correlations [51.552870594221865]
We show that last layer retraining can match or outperform state-of-the-art approaches on spurious correlation benchmarks.
We also show that last layer retraining on large ImageNet-trained models can significantly reduce reliance on background and texture information.
arXiv Detail & Related papers (2022-04-06T16:55:41Z) - End-to-end Neuron Instance Segmentation based on Weakly Supervised
Efficient UNet and Morphological Post-processing [0.0]
We present an end-to-end weakly-supervised framework to automatically detect and segment NeuN stained neuronal cells on histological images.
We integrate the state-of-the-art network, EfficientNet, into our U-Net-like architecture.
arXiv Detail & Related papers (2022-02-17T14:35:45Z) - Salient Objects in Clutter [130.63976772770368]
This paper identifies and addresses a serious design bias of existing salient object detection (SOD) datasets.
This design bias has led to a saturation in performance for state-of-the-art SOD models when evaluated on existing datasets.
We propose a new high-quality dataset and update the previous saliency benchmark.
arXiv Detail & Related papers (2021-05-07T03:49:26Z) - The Devil is in Classification: A Simple Framework for Long-tail Object
Detection and Instance Segmentation [93.17367076148348]
We investigate performance drop of the state-of-the-art two-stage instance segmentation model Mask R-CNN on the recent long-tail LVIS dataset.
We unveil that a major cause is the inaccurate classification of object proposals.
We propose a simple calibration framework to more effectively alleviate classification head bias with a bi-level class balanced sampling approach.
arXiv Detail & Related papers (2020-07-23T12:49:07Z) - On Robustness and Transferability of Convolutional Neural Networks [147.71743081671508]
Modern deep convolutional networks (CNNs) are often criticized for not generalizing under distributional shifts.
We study the interplay between out-of-distribution and transfer performance of modern image classification CNNs for the first time.
We find that increasing both the training set and model sizes significantly improve the distributional shift robustness.
arXiv Detail & Related papers (2020-07-16T18:39:04Z) - Segmentation of Macular Edema Datasets with Small Residual 3D U-Net
Architectures [5.881334886616738]
This paper investigates the application of deep convolutional neural networks with prohibitively small datasets to the problem of macular edema segmentation.
We find that, contrary to popular belief, neural architectures within this application setting are able to achieve close to human-level performance on unseen test images without requiring large numbers of training examples.
arXiv Detail & Related papers (2020-05-10T15:34:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.