Motion-based Camera Localization System in Colonoscopy Videos
- URL: http://arxiv.org/abs/2012.01690v3
- Date: Thu, 11 Feb 2021 20:09:22 GMT
- Title: Motion-based Camera Localization System in Colonoscopy Videos
- Authors: Heming Yao, Ryan W. Stidham, Zijun Gao, Jonathan Gryak, Kayvan
Najarian
- Abstract summary: We propose a camera localization system to estimate the relative location of the camera and classify the colon into anatomical segments.
The experimental results show that the performance of the proposed method is superior to other published methods.
- Score: 7.800211144015489
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Optical colonoscopy is an essential diagnostic and prognostic tool for many
gastrointestinal diseases, including cancer screening and staging, intestinal
bleeding, diarrhea, abdominal symptom evaluation, and inflammatory bowel
disease assessment. Automated assessment of colonoscopy is of interest
considering the subjectivity present in qualitative human interpretations of
colonoscopy findings. Localization of the camera is essential to interpreting
the meaning and context of findings for diseases evaluated by colonoscopy. In
this study, we propose a camera localization system to estimate the relative
location of the camera and classify the colon into anatomical segments. The
camera localization system begins with non-informative frame detection and
removal. Then a self-training end-to-end convolutional neural network is built
to estimate the camera motion, where several strategies are proposed to improve
its robustness and generalization on endoscopic videos. Using the estimated
camera motion a camera trajectory can be derived and a relative location index
calculated. Based on the estimated location index, anatomical colon segment
classification is performed by constructing a colon template. The proposed
motion estimation algorithm was evaluated on an external dataset containing the
ground truth for camera pose. The experimental results show that the
performance of the proposed method is superior to other published methods. The
relative location index estimation and anatomical region classification were
further validated using colonoscopy videos collected from routine clinical
practice. This validation yielded an average accuracy in classification of
0.754, which is substantially higher than the performances obtained using
location indices built from other methods.
Related papers
- A novel open-source ultrasound dataset with deep learning benchmarks for
spinal cord injury localization and anatomical segmentation [1.02101998415327]
We present an ultrasound dataset of 10,223-mode (B-mode) images consisting of sagittal slices of porcine spinal cords.
We benchmark the performance metrics of several state-of-the-art object detection algorithms to localize the site of injury.
We evaluate the zero-shot generalization capabilities of the segmentation models on human ultrasound spinal cord images.
arXiv Detail & Related papers (2024-09-24T20:22:59Z) - ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation [67.22294293695255]
We propose a novel reconstruction pipeline with a bi-directional adaptation architecture named ToDER to get precise depth estimations.
Experimental results demonstrate that our approach can precisely predict depth maps in both realistic and synthetic colonoscopy videos.
arXiv Detail & Related papers (2024-07-23T14:24:26Z) - Multi-task Explainable Skin Lesion Classification [54.76511683427566]
We propose a few-shot-based approach for skin lesions that generalizes well with few labelled data.
The proposed approach comprises a fusion of a segmentation network that acts as an attention module and classification network.
arXiv Detail & Related papers (2023-10-11T05:49:47Z) - Region-based Contrastive Pretraining for Medical Image Retrieval with
Anatomic Query [56.54255735943497]
Region-based contrastive pretraining for Medical Image Retrieval (RegionMIR)
We introduce a novel Region-based contrastive pretraining for Medical Image Retrieval (RegionMIR)
arXiv Detail & Related papers (2023-05-09T16:46:33Z) - Live image-based neurosurgical guidance and roadmap generation using
unsupervised embedding [53.992124594124896]
We present a method for live image-only guidance leveraging a large data set of annotated neurosurgical videos.
A generated roadmap encodes the common anatomical paths taken in surgeries in the training set.
We trained and evaluated the proposed method with a data set of 166 transsphenoidal adenomectomy procedures.
arXiv Detail & Related papers (2023-03-31T12:52:24Z) - A Temporal Learning Approach to Inpainting Endoscopic Specularities and
Its effect on Image Correspondence [13.25903945009516]
We propose using a temporal generative adversarial network (GAN) to inpaint the hidden anatomy under specularities.
This is achieved using in-vivo data of gastric endoscopy (Hyper-Kvasir) in a fully unsupervised manner.
We also assess the effect of our method in computer vision tasks that underpin 3D reconstruction and camera motion estimation.
arXiv Detail & Related papers (2022-03-31T13:14:00Z) - Improving Classification Model Performance on Chest X-Rays through Lung
Segmentation [63.45024974079371]
We propose a deep learning approach to enhance abnormal chest x-ray (CXR) identification performance through segmentations.
Our approach is designed in a cascaded manner and incorporates two modules: a deep neural network with criss-cross attention modules (XLSor) for localizing lung region in CXR images and a CXR classification model with a backbone of a self-supervised momentum contrast (MoCo) model pre-trained on large-scale CXR data sets.
arXiv Detail & Related papers (2022-02-22T15:24:06Z) - Deep Learning-based Biological Anatomical Landmark Detection in
Colonoscopy Videos [21.384094148149003]
We propose a novel deep learning-based approach to detect biological anatomical landmarks in colonoscopy videos.
Average detection accuracy reaches 99.75%, while the average IoU of 0.91 shows a high degree of similarity between our predicted landmark periods and ground truth.
arXiv Detail & Related papers (2021-08-06T05:52:32Z) - NanoNet: Real-Time Polyp Segmentation in Video Capsule Endoscopy and
Colonoscopy [0.6125117548653111]
We propose NanoNet, a novel architecture for the segmentation of video capsule endoscopy and colonoscopy images.
Our proposed architecture allows real-time performance and has higher segmentation accuracy compared to other more complex ones.
arXiv Detail & Related papers (2021-04-22T15:40:28Z) - Malignancy Prediction and Lesion Identification from Clinical
Dermatological Images [65.1629311281062]
We consider machine-learning-based malignancy prediction and lesion identification from clinical dermatological images.
We first identify all lesions present in the image regardless of sub-type or likelihood of malignancy, then it estimates their likelihood of malignancy, and through aggregation, it also generates an image-level likelihood of malignancy.
arXiv Detail & Related papers (2021-04-02T20:52:05Z) - A deep learning-based framework for segmenting invisible clinical target
volumes with estimated uncertainties for post-operative prostate cancer
radiotherapy [0.6262579136517118]
In post-operative radiotherapy for prostate cancer, the cancerous prostate gland has been surgically removed, so the clinical target volume (CTV) to be irradiated encompasses the microscopic spread of tumor cells.
In current clinical practice, physicians segment CTVs manually based on their relationship with nearby organs and other clinical information.
Here, we propose a deep learning model to overcome this problem by segmenting nearby organs first, then using their relationship with the CTV to assist CTV segmentation.
arXiv Detail & Related papers (2020-04-28T04:29:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.