A metric learning approach for endoscopic kidney stone identification
- URL: http://arxiv.org/abs/2307.07046v1
- Date: Thu, 13 Jul 2023 20:02:07 GMT
- Title: A metric learning approach for endoscopic kidney stone identification
- Authors: Jorge Gonzalez-Zapata and Francisco Lopez-Tiro and Elias
Villalvazo-Avila and Daniel Flores-Araiza and Jacques Hubert and Andres
Mendez-Vazquez and Gilberto Ochoa-Ruiz and Christian Daul
- Abstract summary: This paper exploits Deep Metric Learning (DML) methods to handle classes with few samples, ii) to generalize well to out of distribution samples, andiii) to cope better with new classes which are added to the database.
The proposed Guided Deep Metric Learning approach is based on a novel architecture which was designed to learn data representations in an improved way.
The teacher model (GEMINI) generates a reduced hypothesis space based on prior knowledge from the labeled data, and is used it as a guide to a student model (i.e., ResNet50) through a Knowledge Distillation scheme.
- Score: 0.879504058268139
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Several Deep Learning (DL) methods have recently been proposed for an
automated identification of kidney stones during an ureteroscopy to enable
rapid therapeutic decisions. Even if these DL approaches led to promising
results, they are mainly appropriate for kidney stone types for which numerous
labelled data are available. However, only few labelled images are available
for some rare kidney stone types. This contribution exploits Deep Metric
Learning (DML) methods i) to handle such classes with few samples, ii) to
generalize well to out of distribution samples, and iii) to cope better with
new classes which are added to the database. The proposed Guided Deep Metric
Learning approach is based on a novel architecture which was designed to learn
data representations in an improved way. The solution was inspired by Few-Shot
Learning (FSL) and makes use of a teacher-student approach. The teacher model
(GEMINI) generates a reduced hypothesis space based on prior knowledge from the
labeled data, and is used it as a guide to a student model (i.e., ResNet50)
through a Knowledge Distillation scheme. Extensive tests were first performed
on two datasets separately used for the recognition, namely a set of images
acquired for the surfaces of the kidney stone fragments, and a set of images of
the fragment sections. The proposed DML-approach improved the identification
accuracy by 10% and 12% in comparison to DL-methods and other DML-approaches,
respectively. Moreover, model embeddings from the two dataset types were merged
in an organized way through a multi-view scheme to simultaneously exploit the
information of surface and section fragments. Test with the resulting mixed
model improves the identification accuracy by at least 3% and up to 30% with
respect to DL-models and shallow machine learning methods, respectively.
Related papers
- Multi Teacher Privileged Knowledge Distillation for Multimodal Expression Recognition [58.41784639847413]
Human emotion is a complex phenomenon conveyed and perceived through facial expressions, vocal tones, body language, and physiological signals.
In this paper, a multi-teacher PKD (MT-PKDOT) method with self-distillation is introduced to align diverse teacher representations before distilling them to the student.
Results indicate that our proposed method can outperform SOTA PKD methods.
arXiv Detail & Related papers (2024-08-16T22:11:01Z) - Deep Prototypical-Parts Ease Morphological Kidney Stone Identification
and are Competitively Robust to Photometric Perturbations [0.9236074230806579]
We learn Prototypical Parts (PPs) per kidney stone subtype to generate an output classification.
Our implementation's average accuracy is lower than state-of-the-art (SOTA) non-interpretable DL models by 1.5 %.
Our models perform 2.8% better on perturbed images with a lower standard deviation, without adversarial training.
arXiv Detail & Related papers (2023-04-08T17:43:31Z) - PCA: Semi-supervised Segmentation with Patch Confidence Adversarial
Training [52.895952593202054]
We propose a new semi-supervised adversarial method called Patch Confidence Adrial Training (PCA) for medical image segmentation.
PCA learns the pixel structure and context information in each patch to get enough gradient feedback, which aids the discriminator in convergent to an optimal state.
Our method outperforms the state-of-the-art semi-supervised methods, which demonstrates its effectiveness for medical image segmentation.
arXiv Detail & Related papers (2022-07-24T07:45:47Z) - SSD-KD: A Self-supervised Diverse Knowledge Distillation Method for
Lightweight Skin Lesion Classification Using Dermoscopic Images [62.60956024215873]
Skin cancer is one of the most common types of malignancy, affecting a large population and causing a heavy economic burden worldwide.
Most studies in skin cancer detection keep pursuing high prediction accuracies without considering the limitation of computing resources on portable devices.
This study specifically proposes a novel method, termed SSD-KD, that unifies diverse knowledge into a generic KD framework for skin diseases classification.
arXiv Detail & Related papers (2022-03-22T06:54:29Z) - A Deep Learning-Based Unified Framework for Red Lesions Detection on
Retinal Fundus Images [3.5557219875516646]
Red-lesions, i.e., microaneurysms (MAs) and hemorrhages (HMs) are the early signs of diabetic retinopathy (DR)
Most of the existing methods detect only MAs or only HMs because of the difference in their texture, sizes, and morphology.
We propose a two-stream red lesions detection system dealing simultaneously with small and large red lesions.
arXiv Detail & Related papers (2021-09-10T00:12:13Z) - About Explicit Variance Minimization: Training Neural Networks for
Medical Imaging With Limited Data Annotations [2.3204178451683264]
Variance Aware Training (VAT) method exploits this property by introducing the variance error into the model loss function.
We validate VAT on three medical imaging datasets from diverse domains and various learning objectives.
arXiv Detail & Related papers (2021-05-28T21:34:04Z) - Relational Subsets Knowledge Distillation for Long-tailed Retinal
Diseases Recognition [65.77962788209103]
We propose class subset learning by dividing the long-tailed data into multiple class subsets according to prior knowledge.
It enforces the model to focus on learning the subset-specific knowledge.
The proposed framework proved to be effective for the long-tailed retinal diseases recognition task.
arXiv Detail & Related papers (2021-04-22T13:39:33Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Deep Semi-supervised Knowledge Distillation for Overlapping Cervical
Cell Instance Segmentation [54.49894381464853]
We propose to leverage both labeled and unlabeled data for instance segmentation with improved accuracy by knowledge distillation.
We propose a novel Mask-guided Mean Teacher framework with Perturbation-sensitive Sample Mining.
Experiments show that the proposed method improves the performance significantly compared with the supervised method learned from labeled data only.
arXiv Detail & Related papers (2020-07-21T13:27:09Z) - Additive Angular Margin for Few Shot Learning to Classify Clinical
Endoscopy Images [42.74958357195011]
We propose a few-shot learning approach that requires less training data and can be used to predict label classes of test samples from an unseen dataset.
We compare our approach to the several established methods on a large cohort of multi-center, multi-organ, and multi-modal endoscopy data.
arXiv Detail & Related papers (2020-03-23T00:20:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.