Context-Aware Optimal Transport Learning for Retinal Fundus Image Enhancement
- URL: http://arxiv.org/abs/2409.07862v1
- Date: Thu, 12 Sep 2024 09:14:37 GMT
- Title: Context-Aware Optimal Transport Learning for Retinal Fundus Image Enhancement
- Authors: Vamsi Krishna Vasa, Peijie Qiu, Wenhui Zhu, Yujian Xiong, Oana Dumitrascu, Yalin Wang,
- Abstract summary: This paper proposes a context-informed optimal transport (OT) learning framework for tackling unpaired fundus image enhancement.
We derive the proposed context-aware OT using the earth's distance mover and show that the proposed context-OT has a solid theoretical guarantee.
Experimental results on a large-scale dataset demonstrate the superiority of the proposed method over several state-of-the-art supervised and unsupervised methods.
- Score: 1.8339026473337505
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Retinal fundus photography offers a non-invasive way to diagnose and monitor a variety of retinal diseases, but is prone to inherent quality glitches arising from systemic imperfections or operator/patient-related factors. However, high-quality retinal images are crucial for carrying out accurate diagnoses and automated analyses. The fundus image enhancement is typically formulated as a distribution alignment problem, by finding a one-to-one mapping between a low-quality image and its high-quality counterpart. This paper proposes a context-informed optimal transport (OT) learning framework for tackling unpaired fundus image enhancement. In contrast to standard generative image enhancement methods, which struggle with handling contextual information (e.g., over-tampered local structures and unwanted artifacts), the proposed context-aware OT learning paradigm better preserves local structures and minimizes unwanted artifacts. Leveraging deep contextual features, we derive the proposed context-aware OT using the earth mover's distance and show that the proposed context-OT has a solid theoretical guarantee. Experimental results on a large-scale dataset demonstrate the superiority of the proposed method over several state-of-the-art supervised and unsupervised methods in terms of signal-to-noise ratio, structural similarity index, as well as two downstream tasks. The code is available at \url{https://github.com/Retinal-Research/Contextual-OT}.
Related papers
- TPOT: Topology Preserving Optimal Transport in Retinal Fundus Image Enhancement [16.84367978693017]
We propose a training paradigm that regularizes blood vessel structures by minimizing the differences of persistence diagrams.
We call the resulting framework Topology Preserving Optimal Transport (TPOT)
Experimental results on a large-scale dataset demonstrate the superiority of the proposed method compared to several state-of-the-art supervised and unsupervised techniques.
arXiv Detail & Related papers (2024-11-03T02:04:35Z) - CUNSB-RFIE: Context-aware Unpaired Neural Schr"{o}dinger Bridge in Retinal Fundus Image Enhancement [15.399449331371402]
We propose an image-to-image translation pipeline for retinal image enhancement.
We name the resulting retinal fundus image enhancement framework the Context-aware Unpaired Neural Schr"odinger Bridge (CUNSB-RFIE)
arXiv Detail & Related papers (2024-09-17T08:07:29Z) - Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis [65.7968515029306]
We propose a novel Coarse-to-Fine Latent Diffusion (CFLD) method for Pose-Guided Person Image Synthesis (PGPIS)
A perception-refined decoder is designed to progressively refine a set of learnable queries and extract semantic understanding of person images as a coarse-grained prompt.
arXiv Detail & Related papers (2024-02-28T06:07:07Z) - DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection [55.48770333927732]
We propose a Difusion-based Anomaly Detection (DiAD) framework for multi-class anomaly detection.
It consists of a pixel-space autoencoder, a latent-space Semantic-Guided (SG) network with a connection to the stable diffusion's denoising network, and a feature-space pre-trained feature extractor.
Experiments on MVTec-AD and VisA datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2023-12-11T18:38:28Z) - OTRE: Where Optimal Transport Guided Unpaired Image-to-Image Translation
Meets Regularization by Enhancing [4.951748109810726]
Optimal retinal image quality is mandated for accurate medical diagnoses and automated analyses.
We propose an unpaired image-to-image translation scheme for mapping low-quality retinal CFPs to high-quality counterparts.
We validated the integrated framework, OTRE, on three publicly available retinal image datasets.
arXiv Detail & Related papers (2023-02-06T18:39:40Z) - Optimal Transport Guided Unsupervised Learning for Enhancing low-quality
Retinal Images [5.4240246179935845]
Real-world non-mydriatic retinal fundus photography is prone to artifacts, imperfections and low-quality.
We propose a simple but effective end-to-end framework for enhancing poor-quality retinal fundus images.
arXiv Detail & Related papers (2023-02-06T18:29:30Z) - Restoring Vision in Hazy Weather with Hierarchical Contrastive Learning [53.85892601302974]
We propose an effective image dehazing method named Hierarchical Contrastive Dehazing (HCD)
HCD consists of a hierarchical dehazing network (HDN) and a novel hierarchical contrastive loss (HCL)
arXiv Detail & Related papers (2022-12-22T03:57:06Z) - Hierarchical Similarity Learning for Aliasing Suppression Image
Super-Resolution [64.15915577164894]
A hierarchical image super-resolution network (HSRNet) is proposed to suppress the influence of aliasing.
HSRNet achieves better quantitative and visual performance than other works, and remits the aliasing more effectively.
arXiv Detail & Related papers (2022-06-07T14:55:32Z) - Parametric Scaling of Preprocessing assisted U-net Architecture for
Improvised Retinal Vessel Segmentation [1.3869502085838448]
We present an image enhancement technique based on the morphological preprocessing coupled with a scaled U-net architecture.
A significant improvement as compared to the other algorithms in the domain, in terms of the area under ROC curve (>0.9762) and classification accuracy (>95.47%) are evident from the results.
arXiv Detail & Related papers (2022-03-18T15:26:05Z) - Sharp-GAN: Sharpness Loss Regularized GAN for Histopathology Image
Synthesis [65.47507533905188]
Conditional generative adversarial networks have been applied to generate synthetic histopathology images.
We propose a sharpness loss regularized generative adversarial network to synthesize realistic histopathology images.
arXiv Detail & Related papers (2021-10-27T18:54:25Z) - Modeling and Enhancing Low-quality Retinal Fundus Images [167.02325845822276]
Low-quality fundus images increase uncertainty in clinical observation and lead to the risk of misdiagnosis.
We propose a clinically oriented fundus enhancement network (cofe-Net) to suppress global degradation factors.
Experiments on both synthetic and real images demonstrate that our algorithm effectively corrects low-quality fundus images without losing retinal details.
arXiv Detail & Related papers (2020-05-12T08:01:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.