ColDE: A Depth Estimation Framework for Colonoscopy Reconstruction
- URL: http://arxiv.org/abs/2111.10371v1
- Date: Fri, 19 Nov 2021 04:44:27 GMT
- Title: ColDE: A Depth Estimation Framework for Colonoscopy Reconstruction
- Authors: Yubo Zhang, Jan-Michael Frahm, Samuel Ehrenstein, Sarah K. McGill,
Julian G. Rosenman, Shuxian Wang, Stephen M. Pizer
- Abstract summary: In this work we have designed a set of training losses to deal with the special challenges of colonoscopy data.
With the training losses powerful enough, our self-supervised framework named ColDE is able to produce better depth maps of colonoscopy data.
- Score: 27.793186578742088
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: One of the key elements of reconstructing a 3D mesh from a monocular video is
generating every frame's depth map. However, in the application of colonoscopy
video reconstruction, producing good-quality depth estimation is challenging.
Neural networks can be easily fooled by photometric distractions or fail to
capture the complex shape of the colon surface, predicting defective shapes
that result in broken meshes. Aiming to fundamentally improve the depth
estimation quality for colonoscopy 3D reconstruction, in this work we have
designed a set of training losses to deal with the special challenges of
colonoscopy data. For better training, a set of geometric consistency
objectives was developed, using both depth and surface normal information.
Also, the classic photometric loss was extended with feature matching to
compensate for illumination noise. With the training losses powerful enough,
our self-supervised framework named ColDE is able to produce better depth maps
of colonoscopy data as compared to the previous work utilizing prior depth
knowledge. Used in reconstruction, our network is able to reconstruct
good-quality colon meshes in real-time without any post-processing, making it
the first to be clinically applicable.
Related papers
- Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy [11.798218793025974]
We introduce an innovative pipeline using Neural Radiance Fields (NeRF) for 3D reconstruction.
Our approach utilizes a preliminary NeRF reconstruction that yields a coarse model, then creates a binocular scene within the reconstructed environment.
High-fidelity depth maps are generated from monocular endoscopic video of a realistic cranial phantom.
arXiv Detail & Related papers (2024-10-05T05:26:21Z) - ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation [67.22294293695255]
We propose a novel reconstruction pipeline with a bi-directional adaptation architecture named ToDER to get precise depth estimations.
Experimental results demonstrate that our approach can precisely predict depth maps in both realistic and synthetic colonoscopy videos.
arXiv Detail & Related papers (2024-07-23T14:24:26Z) - High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces [18.948630080040576]
We introduce a novel method for colon section reconstruction by leveraging NeuS applied to endoscopic images, supplemented by a single frame of depth map.
Our approach demonstrates exceptional accuracy in completely rendering colon sections, even capturing unseen portions of the surface.
This breakthrough opens avenues for achieving stable and consistently scaled reconstructions, promising enhanced quality in cancer screening procedures and treatment interventions.
arXiv Detail & Related papers (2024-04-20T18:06:26Z) - Multi-task learning with cross-task consistency for improved depth
estimation in colonoscopy [0.2995885872626565]
We develop a novel multi-task learning (MTL) approach with a shared encoder and two decoders, namely a surface normal decoder and a depth estimator.
We demonstrate an improvement of 14.17% on relative error and 10.4% on $delta_1$ accuracy over the most accurate baseline state-of-the-art BTS approach.
arXiv Detail & Related papers (2023-11-30T16:13:17Z) - AugUndo: Scaling Up Augmentations for Monocular Depth Completion and Estimation [51.143540967290114]
We propose a method that unlocks a wide range of previously-infeasible geometric augmentations for unsupervised depth computation and estimation.
This is achieved by reversing, or undo''-ing, geometric transformations to the coordinates of the output depth, warping the depth map back to the original reference frame.
arXiv Detail & Related papers (2023-10-15T05:15:45Z) - A Surface-normal Based Neural Framework for Colonoscopy Reconstruction [24.467879991609095]
Reconstructing a 3D surface from colonoscopy video is challenging due to illumination and reflectivity variation in the video frame.
We develop a two-step neural framework that significantly improves the colonoscopy reconstruction quality.
arXiv Detail & Related papers (2023-03-13T16:44:15Z) - C$^3$Fusion: Consistent Contrastive Colon Fusion, Towards Deep SLAM in
Colonoscopy [0.0]
3D colon reconstruction from Optical Colonoscopy (OC) to detect non-examined surfaces remains an unsolved problem.
Recent methods demonstrate compelling results, but suffer from: (1) frangible frame-to-frame (or frame-to-model) pose estimation resulting in many tracking failures; or (2) rely on point-based representations at the cost of scan quality.
We propose a novel reconstruction framework that addresses these issues end to end, which result in both quantitatively and qualitatively accurate and robust 3D colon reconstruction.
arXiv Detail & Related papers (2022-06-04T10:38:19Z) - Is Deep Image Prior in Need of a Good Education? [57.3399060347311]
Deep image prior was introduced as an effective prior for image reconstruction.
Despite its impressive reconstructive properties, the approach is slow when compared to learned or traditional reconstruction techniques.
We develop a two-stage learning paradigm to address the computational challenge.
arXiv Detail & Related papers (2021-11-23T15:08:26Z) - Learning to Recover 3D Scene Shape from a Single Image [98.20106822614392]
We propose a two-stage framework that first predicts depth up to an unknown scale and shift from a single monocular image.
We then use 3D point cloud encoders to predict the missing depth shift and focal length that allow us to recover a realistic 3D scene shape.
arXiv Detail & Related papers (2020-12-17T02:35:13Z) - Consistent Video Depth Estimation [57.712779457632024]
We present an algorithm for reconstructing dense, geometrically consistent depth for all pixels in a monocular video.
We leverage a conventional structure-from-motion reconstruction to establish geometric constraints on pixels in the video.
Our algorithm is able to handle challenging hand-held captured input videos with a moderate degree of dynamic motion.
arXiv Detail & Related papers (2020-04-30T17:59:26Z) - Depth Completion Using a View-constrained Deep Prior [73.21559000917554]
Recent work has shown that the structure of convolutional neural networks (CNNs) induces a strong prior that favors natural images.
This prior, known as a deep image prior (DIP), is an effective regularizer in inverse problems such as image denoising and inpainting.
We extend the concept of the DIP to depth images. Given color images and noisy and incomplete target depth maps, we reconstruct a depth map restored by virtue of using the CNN network structure as a prior.
arXiv Detail & Related papers (2020-01-21T21:56:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.