Leveraging a realistic synthetic database to learn Shape-from-Shading
for estimating the colon depth in colonoscopy images
- URL: http://arxiv.org/abs/2311.05021v1
- Date: Wed, 8 Nov 2023 21:14:56 GMT
- Title: Leveraging a realistic synthetic database to learn Shape-from-Shading
for estimating the colon depth in colonoscopy images
- Authors: Josu\'e Ruano, Mart\'in G\'omez, Eduardo Romero, Antoine Manzanera
- Abstract summary: This work introduces a novel methodology to estimate colon depth maps in single frames from monocular colonoscopy videos.
The generated depth map is inferred from the shading variation of the colon wall with respect to the light source.
A classic convolutional neural network architecture is trained from scratch to estimate the depth map.
- Score: 0.20482269513546453
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Colonoscopy is the choice procedure to diagnose colon and rectum cancer, from
early detection of small precancerous lesions (polyps), to confirmation of
malign masses. However, the high variability of the organ appearance and the
complex shape of both the colon wall and structures of interest make this
exploration difficult. Learned visuospatial and perceptual abilities mitigate
technical limitations in clinical practice by proper estimation of the
intestinal depth. This work introduces a novel methodology to estimate colon
depth maps in single frames from monocular colonoscopy videos. The generated
depth map is inferred from the shading variation of the colon wall with respect
to the light source, as learned from a realistic synthetic database. Briefly, a
classic convolutional neural network architecture is trained from scratch to
estimate the depth map, improving sharp depth estimations in haustral folds and
polyps by a custom loss function that minimizes the estimation error in edges
and curvatures. The network was trained by a custom synthetic colonoscopy
database herein constructed and released, composed of 248,400 frames (47
videos), with depth annotations at the level of pixels. This collection
comprehends 5 subsets of videos with progressively higher levels of visual
complexity. Evaluation of the depth estimation with the synthetic database
reached a threshold accuracy of 95.65%, and a mean-RMSE of 0.451 cm, while a
qualitative assessment with a real database showed consistent depth
estimations, visually evaluated by the expert gastroenterologist coauthoring
this paper. Finally, the method achieved competitive performance with respect
to another state-of-the-art method using a public synthetic database and
comparable results in a set of images with other five state-of-the-art methods.
Related papers
- Structure-preserving Image Translation for Depth Estimation in Colonoscopy Video [1.0485739694839669]
We propose a pipeline of structure-preserving synthetic-to-real (sim2real) image translation.
This allows us to generate large quantities of realistic-looking synthetic images for supervised depth estimation.
We also propose a dataset of hand-picked sequences from clinical colonoscopies to improve the image translation process.
arXiv Detail & Related papers (2024-08-19T17:02:16Z) - ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation [67.22294293695255]
We propose a novel reconstruction pipeline with a bi-directional adaptation architecture named ToDER to get precise depth estimations.
Experimental results demonstrate that our approach can precisely predict depth maps in both realistic and synthetic colonoscopy videos.
arXiv Detail & Related papers (2024-07-23T14:24:26Z) - High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces [18.948630080040576]
We introduce a novel method for colon section reconstruction by leveraging NeuS applied to endoscopic images, supplemented by a single frame of depth map.
Our approach demonstrates exceptional accuracy in completely rendering colon sections, even capturing unseen portions of the surface.
This breakthrough opens avenues for achieving stable and consistently scaled reconstructions, promising enhanced quality in cancer screening procedures and treatment interventions.
arXiv Detail & Related papers (2024-04-20T18:06:26Z) - Multi-task learning with cross-task consistency for improved depth
estimation in colonoscopy [0.2995885872626565]
We develop a novel multi-task learning (MTL) approach with a shared encoder and two decoders, namely a surface normal decoder and a depth estimator.
We demonstrate an improvement of 14.17% on relative error and 10.4% on $delta_1$ accuracy over the most accurate baseline state-of-the-art BTS approach.
arXiv Detail & Related papers (2023-11-30T16:13:17Z) - OADAT: Experimental and Synthetic Clinical Optoacoustic Data for
Standardized Image Processing [62.993663757843464]
Optoacoustic (OA) imaging is based on excitation of biological tissues with nanosecond-duration laser pulses followed by detection of ultrasound waves generated via light-absorption-mediated thermoelastic expansion.
OA imaging features a powerful combination between rich optical contrast and high resolution in deep tissues.
No standardized datasets generated with different types of experimental set-up and associated processing methods are available to facilitate advances in broader applications of OA in clinical settings.
arXiv Detail & Related papers (2022-06-17T08:11:26Z) - C$^3$Fusion: Consistent Contrastive Colon Fusion, Towards Deep SLAM in
Colonoscopy [0.0]
3D colon reconstruction from Optical Colonoscopy (OC) to detect non-examined surfaces remains an unsolved problem.
Recent methods demonstrate compelling results, but suffer from: (1) frangible frame-to-frame (or frame-to-model) pose estimation resulting in many tracking failures; or (2) rely on point-based representations at the cost of scan quality.
We propose a novel reconstruction framework that addresses these issues end to end, which result in both quantitatively and qualitatively accurate and robust 3D colon reconstruction.
arXiv Detail & Related papers (2022-06-04T10:38:19Z) - Learning Topology from Synthetic Data for Unsupervised Depth Completion [66.26787962258346]
We present a method for inferring dense depth maps from images and sparse depth measurements.
We learn the association of sparse point clouds with dense natural shapes, and using the image as evidence to validate the predicted depth map.
arXiv Detail & Related papers (2021-06-06T00:21:12Z) - A parameter refinement method for Ptychography based on Deep Learning
concepts [55.41644538483948]
coarse parametrisation in propagation distance, position errors and partial coherence frequently menaces the experiment viability.
A modern Deep Learning framework is used to correct autonomously the setup incoherences, thus improving the quality of a ptychography reconstruction.
We tested our system on both synthetic datasets and also on real data acquired at the TwinMic beamline of the Elettra synchrotron facility.
arXiv Detail & Related papers (2021-05-18T10:15:17Z) - Chest x-ray automated triage: a semiologic approach designed for
clinical implementation, exploiting different types of labels through a
combination of four Deep Learning architectures [83.48996461770017]
This work presents a Deep Learning method based on the late fusion of different convolutional architectures.
We built four training datasets combining images from public chest x-ray datasets and our institutional archive.
We trained four different Deep Learning architectures and combined their outputs with a late fusion strategy, obtaining a unified tool.
arXiv Detail & Related papers (2020-12-23T14:38:35Z) - Data Consistent CT Reconstruction from Insufficient Data with Learned
Prior Images [70.13735569016752]
We investigate the robustness of deep learning in CT image reconstruction by showing false negative and false positive lesion cases.
We propose a data consistent reconstruction (DCR) method to improve their image quality, which combines the advantages of compressed sensing and deep learning.
The efficacy of the proposed method is demonstrated in cone-beam CT with truncated data, limited-angle data and sparse-view data, respectively.
arXiv Detail & Related papers (2020-05-20T13:30:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.