SDPA++: A General Framework for Self-Supervised Denoising with Patch Aggregation
- URL: http://arxiv.org/abs/2510.16702v1
- Date: Sun, 19 Oct 2025 04:05:34 GMT
- Title: SDPA++: A General Framework for Self-Supervised Denoising with Patch Aggregation
- Authors: Huy Minh Nhat Nguyen, Triet Hoang Minh Dao, Chau Vinh Hoang Truong, Cuong Tuan Nguyen,
- Abstract summary: We propose SDPA++: A General Framework for Self-Supervised Denoising with Patch Aggregation.<n>Our novel approach leverages only noisy OCT images by first generating pseudo-ground-truth images through self-fusion and self-supervised denoising.<n>Performance improvements are validated via metrics such as Contrast-to-Noise Ratio (CNR), Mean Square Ratio (MSR), Texture Preservation (TP), and Edge Preservation (EP) on the real-world dataset from the IEEE Video and Image Processing Cup.
- Score: 1.1123754733827187
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Optical Coherence Tomography (OCT) is a widely used non-invasive imaging technique that provides detailed three-dimensional views of the retina, which are essential for the early and accurate diagnosis of ocular diseases. Consequently, OCT image analysis and processing have emerged as key research areas in biomedical imaging. However, acquiring paired datasets of clean and real-world noisy OCT images for supervised denoising models remains a formidable challenge due to intrinsic speckle noise and practical constraints in clinical imaging environments. To address these issues, we propose SDPA++: A General Framework for Self-Supervised Denoising with Patch Aggregation. Our novel approach leverages only noisy OCT images by first generating pseudo-ground-truth images through self-fusion and self-supervised denoising. These refined images then serve as targets to train an ensemble of denoising models using a patch-based strategy that effectively enhances image clarity. Performance improvements are validated via metrics such as Contrast-to-Noise Ratio (CNR), Mean Square Ratio (MSR), Texture Preservation (TP), and Edge Preservation (EP) on the real-world dataset from the IEEE SPS Video and Image Processing Cup. Notably, the VIP Cup dataset contains only real-world noisy OCT images without clean references, highlighting our method's potential for improving image quality and diagnostic outcomes in clinical practice.
Related papers
- A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis [82.01597026329158]
We introduce a Correlation-Regulated Alignment Framework for Tissue Synthesis (CRAFTS) for pathology-specific text-to-image synthesis.<n>CRAFTS incorporates a novel alignment mechanism that suppresses semantic drift to ensure biological accuracy.<n>This model generates diverse pathological images spanning 30 cancer types, with quality rigorously validated by objective metrics and pathologist evaluations.
arXiv Detail & Related papers (2025-12-15T10:22:43Z) - A Denoising Framework for Real-World Ultra-Low Dose Lung CT Images Based on an Image Purification Strategy [1.6574413179773761]
Ultra-low dose CT (uLDCT) significantly reduces radiation exposure but introduces severe noise and artifacts.<n>It also leads to substantial spatial misalignment between uLDCT and normal dose CT (NDCT) image pairs.<n>This paper proposes an innovative denoising framework based on an Image Purification (IP) strategy.
arXiv Detail & Related papers (2025-10-08T19:40:38Z) - UOPSL: Unpaired OCT Predilection Sites Learning for Fundus Image Diagnosis Augmentation [47.08936359575974]
We propose a novel unpaired multimodal framework UOPSL that utilizes extensive OCT-derived spatial priors to dynamically identify predilection sites.<n>Our approach bridges unpaired fundus and OCTs via extended disease text descriptions.<n>Experiments conducted on 9 diverse datasets across 28 critical categories demonstrate that our framework outperforms existing benchmarks.
arXiv Detail & Related papers (2025-09-10T14:19:59Z) - Contextual Checkerboard Denoise -- A Novel Neural Network-Based Approach for Classification-Aware OCT Image Denoising [1.8032335403003321]
We introduce a novel neural network based method -- emphContextual Checkerboard Denoising, that can learn denoising from only a dataset of noisy images.<n>Our proposed method significantly improves image quality, providing clearer and more detailed OCT images, while enhancing diagnostic accuracy.
arXiv Detail & Related papers (2024-11-29T08:51:43Z) - Unsupervised Training of a Dynamic Context-Aware Deep Denoising Framework for Low-Dose Fluoroscopic Imaging [6.130738760059542]
Fluoroscopy is critical for real-time X-ray visualization in medical imaging.
Low-dose images are compromised by noise, potentially affecting diagnostic accuracy.
We propose an unsupervised training framework for dynamic context-aware denoising of fluoroscopy image sequences.
arXiv Detail & Related papers (2024-10-29T13:39:31Z) - DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction [45.00528216648563]
Diffusion Prior Driven Neural Representation (DPER) is an unsupervised framework designed to address the exceptionally ill-posed CT reconstruction inverse problems.
DPER adopts the Half Quadratic Splitting (HQS) algorithm to decompose the inverse problem into data fidelity and distribution prior sub-problems.
We conduct comprehensive experiments to evaluate the performance of DPER on LACT and ultra-SVCT reconstruction with two public datasets.
arXiv Detail & Related papers (2024-04-27T12:55:13Z) - Self-supervised OCT Image Denoising with Slice-to-Slice Registration and
Reconstruction [5.972377737617966]
Learning-based self-supervised methods for structure-preserving noise reduction have demonstrated superior performance over traditional methods.
We introduce a new end-to-end self-supervised learning framework specifically tailored for OCT image denoising.
arXiv Detail & Related papers (2023-11-26T02:45:16Z) - Disruptive Autoencoders: Leveraging Low-level features for 3D Medical
Image Pre-training [51.16994853817024]
This work focuses on designing an effective pre-training framework for 3D radiology images.
We introduce Disruptive Autoencoders, a pre-training framework that attempts to reconstruct the original image from disruptions created by a combination of local masking and low-level perturbations.
The proposed pre-training framework is tested across multiple downstream tasks and achieves state-of-the-art performance.
arXiv Detail & Related papers (2023-07-31T17:59:42Z) - Negligible effect of brain MRI data preprocessing for tumor segmentation [36.89606202543839]
We conduct experiments on three publicly available datasets and evaluate the effect of different preprocessing steps in deep neural networks.
Our results demonstrate that most popular standardization steps add no value to the network performance.
We suggest that image intensity normalization approaches do not contribute to model accuracy because of the reduction of signal variance with image standardization.
arXiv Detail & Related papers (2022-04-11T17:29:36Z) - Harmonizing Pathological and Normal Pixels for Pseudo-healthy Synthesis [68.5287824124996]
We present a new type of discriminator, the segmentor, to accurately locate the lesions and improve the visual quality of pseudo-healthy images.
We apply the generated images into medical image enhancement and utilize the enhanced results to cope with the low contrast problem.
Comprehensive experiments on the T2 modality of BraTS demonstrate that the proposed method substantially outperforms the state-of-the-art methods.
arXiv Detail & Related papers (2022-03-29T08:41:17Z) - InDuDoNet+: A Model-Driven Interpretable Dual Domain Network for Metal
Artifact Reduction in CT Images [53.4351366246531]
We construct a novel interpretable dual domain network, termed InDuDoNet+, into which CT imaging process is finely embedded.
We analyze the CT values among different tissues, and merge the prior observations into a prior network for our InDuDoNet+, which significantly improve its generalization performance.
arXiv Detail & Related papers (2021-12-23T15:52:37Z) - Self-supervised Dynamic CT Perfusion Image Denoising with Deep Neural
Networks [6.167259271197635]
Dynamic computed tomography (CTP) imaging is a promising approach for acute ischemic stroke diagnosis and evaluation.
Hemodynamic parametric maps of cerebral parenchyma are calculated from repeated CT scans of the first pass of iodinated contrast through the brain.
It is necessary to reduce the dose of perfusion for routine applications due to the high radiation exposure from the repeated scans, where image denoising is necessary to achieve a reliable diagnosis.
arXiv Detail & Related papers (2020-05-19T21:44:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.