LLCaps: Learning to Illuminate Low-Light Capsule Endoscopy with Curved
Wavelet Attention and Reverse Diffusion
- URL: http://arxiv.org/abs/2307.02452v2
- Date: Sat, 22 Jul 2023 10:08:38 GMT
- Title: LLCaps: Learning to Illuminate Low-Light Capsule Endoscopy with Curved
Wavelet Attention and Reverse Diffusion
- Authors: Long Bai, Tong Chen, Yanan Wu, An Wang, Mobarakol Islam, Hongliang Ren
- Abstract summary: Wireless capsule endoscopy (WCE) is a painless and non-invasive diagnostic tool for gastrointestinal (GI) diseases.
Deep learning-based low-light image enhancement (LLIE) in the medical field gradually attracts researchers.
We introduce a WCE LLIE framework based on the multi-scale convolutional neural network (CNN) and reverse diffusion process.
- Score: 24.560417980602928
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Wireless capsule endoscopy (WCE) is a painless and non-invasive diagnostic
tool for gastrointestinal (GI) diseases. However, due to GI anatomical
constraints and hardware manufacturing limitations, WCE vision signals may
suffer from insufficient illumination, leading to a complicated screening and
examination procedure. Deep learning-based low-light image enhancement (LLIE)
in the medical field gradually attracts researchers. Given the exuberant
development of the denoising diffusion probabilistic model (DDPM) in computer
vision, we introduce a WCE LLIE framework based on the multi-scale
convolutional neural network (CNN) and reverse diffusion process. The
multi-scale design allows models to preserve high-resolution representation and
context information from low-resolution, while the curved wavelet attention
(CWA) block is proposed for high-frequency and local feature learning.
Furthermore, we combine the reverse diffusion procedure to further optimize the
shallow output and generate the most realistic image. The proposed method is
compared with ten state-of-the-art (SOTA) LLIE methods and significantly
outperforms quantitatively and qualitatively. The superior performance on GI
disease segmentation further demonstrates the clinical potential of our
proposed model. Our code is publicly accessible.
Related papers
- CathFlow: Self-Supervised Segmentation of Catheters in Interventional Ultrasound Using Optical Flow and Transformers [66.15847237150909]
We introduce a self-supervised deep learning architecture to segment catheters in longitudinal ultrasound images.
The network architecture builds upon AiAReSeg, a segmentation transformer built with the Attention in Attention mechanism.
We validated our model on a test dataset, consisting of unseen synthetic data and images collected from silicon aorta phantoms.
arXiv Detail & Related papers (2024-03-21T15:13:36Z) - CV-Attention UNet: Attention-based UNet for 3D Cerebrovascular Segmentation of Enhanced TOF-MRA Images [2.2265536092123006]
We propose the 3D cerebrovascular attention UNet method, named CV-AttentionUNet, for precise extraction of brain vessel images.
To combine the low and high semantics, we applied the attention mechanism.
We believe that the novelty of this algorithm lies in its ability to perform well on both labeled and unlabeled data.
arXiv Detail & Related papers (2023-11-16T22:31:05Z) - Semantic Map Guided Synthesis of Wireless Capsule Endoscopy Images using
Diffusion Models [4.187344935012482]
Wireless capsule endoscopy (WCE) is a non-invasive method for visualizing the gastrointestinal (GI) tract.
We propose a novel approach leveraging generative models, specifically the diffusion model (DM), for generating diverse WCE images.
Our model incorporates semantic map resulted from visualization scale (VS) engine, enhancing the controllability and diversity of generated images.
arXiv Detail & Related papers (2023-11-10T06:16:44Z) - Global Structure-Aware Diffusion Process for Low-Light Image Enhancement [64.69154776202694]
This paper studies a diffusion-based framework to address the low-light image enhancement problem.
We advocate for the regularization of its inherent ODE-trajectory.
Experimental evaluations reveal that the proposed framework attains distinguished performance in low-light enhancement.
arXiv Detail & Related papers (2023-10-26T17:01:52Z) - An Evaluation of Lightweight Deep Learning Techniques in Medical Imaging
for High Precision COVID-19 Diagnostics [0.0]
Decision support systems relax the challenges inherent to the physical examination of images.
Most deep learning algorithms utilised approaches are not amenable to implementation on resource-constrained devices.
This paper presents the development and evaluation of the performance of lightweight deep learning technique for the detection of COVID-19 using the MobileNetV2 model.
arXiv Detail & Related papers (2023-05-30T13:14:03Z) - Unsupervised Denoising of Retinal OCT with Diffusion Probabilistic Model [0.2578242050187029]
We present a diffusion probabilistic model that is fully unsupervised to learn from noise instead of signal.
Our method can significantly improve the image quality with a simple working pipeline and a small amount of training data.
arXiv Detail & Related papers (2022-01-27T19:02:38Z) - Deep AUC Maximization for Medical Image Classification: Challenges and
Opportunities [60.079782224958414]
We will present and discuss opportunities and challenges brought by a new deep learning method by AUC (aka underlinebf Deep underlinebf AUC classification)
arXiv Detail & Related papers (2021-11-01T15:31:32Z) - RIDnet: Radiologist-Inspired Deep Neural Network for Low-dose CT
Denoising [10.101822678034393]
Low-dose computed tomography (LDCT) has been widely adopted in the early screening of lung cancer and COVID-19.
LDCT images inevitably suffer from the degradation problem caused by complex noises.
We propose a novel deep learning model named radiologist-inspired deep denoising network (RIDnet) to imitate the workflow of a radiologist reading LDCT images.
arXiv Detail & Related papers (2021-05-15T05:59:01Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Explaining Clinical Decision Support Systems in Medical Imaging using
Cycle-Consistent Activation Maximization [112.2628296775395]
Clinical decision support using deep neural networks has become a topic of steadily growing interest.
clinicians are often hesitant to adopt the technology because its underlying decision-making process is considered to be intransparent and difficult to comprehend.
We propose a novel decision explanation scheme based on CycleGAN activation which generates high-quality visualizations of classifier decisions even in smaller data sets.
arXiv Detail & Related papers (2020-10-09T14:39:27Z) - Multifold Acceleration of Diffusion MRI via Slice-Interleaved Diffusion
Encoding (SIDE) [50.65891535040752]
We propose a diffusion encoding scheme, called Slice-Interleaved Diffusion.
SIDE, that interleaves each diffusion-weighted (DW) image volume with slices encoded with different diffusion gradients.
We also present a method based on deep learning for effective reconstruction of DW images from the highly slice-undersampled data.
arXiv Detail & Related papers (2020-02-25T14:48:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.