Combining Deep Learning with Geometric Features for Image based
Localization in the Gastrointestinal Tract
- URL: http://arxiv.org/abs/2005.05481v2
- Date: Wed, 13 May 2020 19:25:49 GMT
- Title: Combining Deep Learning with Geometric Features for Image based
Localization in the Gastrointestinal Tract
- Authors: Jingwei Song, Mitesh Patel, Andreas Girgensohn, Chelhwon Kim
- Abstract summary: We propose a novel approach to combine Deep Learning method with traditional feature based approach to achieve better localization with small training data.
Our method fully exploits the best of both worlds by introducing a Siamese network structure to perform few-shot classification to the closest zone in the segmented training image set.
The accuracy is improved by 28.94% (Position) and 10.97% (Orientation) with respect to state-of-art method.
- Score: 8.510792628268824
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Tracking monocular colonoscope in the Gastrointestinal tract (GI) is a
challenging problem as the images suffer from deformation, blurred textures,
significant changes in appearance. They greatly restrict the tracking ability
of conventional geometry based methods. Even though Deep Learning (DL) can
overcome these issues, limited labeling data is a roadblock to state-of-art DL
method. Considering these, we propose a novel approach to combine DL method
with traditional feature based approach to achieve better localization with
small training data. Our method fully exploits the best of both worlds by
introducing a Siamese network structure to perform few-shot classification to
the closest zone in the segmented training image set. The classified label is
further adopted to initialize the pose of scope. To fully use the training
dataset, a pre-generated triangulated map points within the zone in the
training set are registered with observation and contribute to estimating the
optimal pose of the test image. The proposed hybrid method is extensively
tested and compared with existing methods, and the result shows significant
improvement over traditional geometric based or DL based localization. The
accuracy is improved by 28.94% (Position) and 10.97% (Orientation) with respect
to state-of-art method.
Related papers
- Domain Generalization for Endoscopic Image Segmentation by Disentangling Style-Content Information and SuperPixel Consistency [1.4991956341367338]
We propose an approach for style-content disentanglement using instance normalization and instance selective whitening (ISW) for improved domain generalization.
We evaluate our approach on two datasets: EndoUDA Barrett's Esophagus and EndoUDA polyps, and compare its performance with three state-of-the-art (SOTA) methods.
arXiv Detail & Related papers (2024-09-19T04:10:04Z) - Deep Learning Based Speckle Filtering for Polarimetric SAR Images. Application to Sentinel-1 [51.404644401997736]
We propose a complete framework to remove speckle in polarimetric SAR images using a convolutional neural network.
Experiments show that the proposed approach offers exceptional results in both speckle reduction and resolution preservation.
arXiv Detail & Related papers (2024-08-28T10:07:17Z) - SwIPE: Efficient and Robust Medical Image Segmentation with Implicit Patch Embeddings [12.79344668998054]
We propose SwIPE (Segmentation with Implicit Patch Embeddings) to enable accurate local boundary delineation and global shape coherence.
We show that SwIPE significantly improves over recent implicit approaches and outperforms state-of-the-art discrete methods with over 10x fewer parameters.
arXiv Detail & Related papers (2023-07-23T20:55:11Z) - LatentForensics: Towards frugal deepfake detection in the StyleGAN latent space [2.629091178090276]
We propose a deepfake detection method that operates in the latent space of a state-of-the-art generative adversarial network (GAN) trained on high-quality face images.
Experimental results on standard datasets reveal that the proposed approach outperforms other state-of-the-art deepfake classification methods.
arXiv Detail & Related papers (2023-03-30T08:36:48Z) - Parallax-Tolerant Unsupervised Deep Image Stitching [57.76737888499145]
We propose UDIS++, a parallax-tolerant unsupervised deep image stitching technique.
First, we propose a robust and flexible warp to model the image registration from global homography to local thin-plate spline motion.
To further eliminate the parallax artifacts, we propose to composite the stitched image seamlessly by unsupervised learning for seam-driven composition masks.
arXiv Detail & Related papers (2023-02-16T10:40:55Z) - CAMERAS: Enhanced Resolution And Sanity preserving Class Activation
Mapping for image saliency [61.40511574314069]
Backpropagation image saliency aims at explaining model predictions by estimating model-centric importance of individual pixels in the input.
We propose CAMERAS, a technique to compute high-fidelity backpropagation saliency maps without requiring any external priors.
arXiv Detail & Related papers (2021-06-20T08:20:56Z) - An Adaptive Framework for Learning Unsupervised Depth Completion [59.17364202590475]
We present a method to infer a dense depth map from a color image and associated sparse depth measurements.
We show that regularization and co-visibility are related via the fitness of the model to data and can be unified into a single framework.
arXiv Detail & Related papers (2021-06-06T02:27:55Z) - Scalable Semi-supervised Landmark Localization for X-ray Images using
Few-shot Deep Adaptive Graph [19.588348005574165]
Based on a fully-supervised graph-based method, DAG, we proposed a semi-supervised extension of it, termed few-shot DAG.
It first trains a DAG model on the labeled data and then fine-tunes the pre-trained model on the unlabeled data with a teacher-student SSL mechanism.
We extensively evaluated our method on pelvis, hand and chest landmark detection tasks.
arXiv Detail & Related papers (2021-04-29T19:46:18Z) - Contrastive Rendering for Ultrasound Image Segmentation [59.23915581079123]
The lack of sharp boundaries in US images remains an inherent challenge for segmentation.
We propose a novel and effective framework to improve boundary estimation in US images.
Our proposed method outperforms state-of-the-art methods and has the potential to be used in clinical practice.
arXiv Detail & Related papers (2020-10-10T07:14:03Z) - Attentive CutMix: An Enhanced Data Augmentation Approach for Deep
Learning Based Image Classification [58.20132466198622]
We propose Attentive CutMix, a naturally enhanced augmentation strategy based on CutMix.
In each training iteration, we choose the most descriptive regions based on the intermediate attention maps from a feature extractor.
Our proposed method is simple yet effective, easy to implement and can boost the baseline significantly.
arXiv Detail & Related papers (2020-03-29T15:01:05Z) - Deep Weakly-Supervised Learning Methods for Classification and
Localization in Histology Images: A Survey [25.429124017422385]
Using deep learning models to diagnose cancer presents several challenges.
Deep weakly-supervised object localization (WSOL) methods provide strategies for low-cost training of deep learning models.
This paper provides a review of state-of-art DL methods for WSOL.
arXiv Detail & Related papers (2019-09-08T00:01:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.