LoGex: Improved tail detection of extremely rare histopathology classes via guided diffusion
- URL: http://arxiv.org/abs/2409.01317v1
- Date: Mon, 2 Sep 2024 15:18:15 GMT
- Title: LoGex: Improved tail detection of extremely rare histopathology classes via guided diffusion
- Authors: Maximilian Mueller, Matthias Hein,
- Abstract summary: In realistic medical settings, the data are often long-tailed, with most samples concentrated in a few classes and a long tail of rare classes, usually containing just a few samples.
This distribution presents a significant challenge because rare conditions are critical to detect and difficult to classify due to limited data.
In this paper, rather than attempting to classify rare classes, we aim to detect these as out-of-distribution data reliably.
- Score: 36.56346240815833
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In realistic medical settings, the data are often inherently long-tailed, with most samples concentrated in a few classes and a long tail of rare classes, usually containing just a few samples. This distribution presents a significant challenge because rare conditions are critical to detect and difficult to classify due to limited data. In this paper, rather than attempting to classify rare classes, we aim to detect these as out-of-distribution data reliably. We leverage low-rank adaption (LoRA) and diffusion guidance to generate targeted synthetic data for the detection problem. We significantly improve the OOD detection performance on a challenging histopathological task with only ten samples per tail class without losing classification accuracy on the head classes.
Related papers
- One Shot GANs for Long Tail Problem in Skin Lesion Dataset using novel content space assessment metric [1.833650794546064]
Long tail problems frequently arise in the medical field due to the scarcity of medical data for rare conditions.
One Shot GANs was employed to augment the tail class of HAM10000 dataset by generating additional samples.
arXiv Detail & Related papers (2024-09-30T04:51:54Z) - From Uncertainty to Clarity: Uncertainty-Guided Class-Incremental Learning for Limited Biomedical Samples via Semantic Expansion [0.0]
We propose a class-incremental learning method under limited samples in the biomedical field.
Our method achieves optimal performance, surpassing state-of-the-art methods by as much as 53.54% in accuracy.
arXiv Detail & Related papers (2024-09-12T05:22:45Z) - Unlearnable Examples Detection via Iterative Filtering [84.59070204221366]
Deep neural networks are proven to be vulnerable to data poisoning attacks.
It is quite beneficial and challenging to detect poisoned samples from a mixed dataset.
We propose an Iterative Filtering approach for UEs identification.
arXiv Detail & Related papers (2024-08-15T13:26:13Z) - EAT: Towards Long-Tailed Out-of-Distribution Detection [55.380390767978554]
This paper addresses the challenging task of long-tailed OOD detection.
The main difficulty lies in distinguishing OOD data from samples belonging to the tail classes.
We propose two simple ideas: (1) Expanding the in-distribution class space by introducing multiple abstention classes, and (2) Augmenting the context-limited tail classes by overlaying images onto the context-rich OOD data.
arXiv Detail & Related papers (2023-12-14T13:47:13Z) - Leveraging Unlabelled Data in Multiple-Instance Learning Problems for
Improved Detection of Parkinsonian Tremor in Free-Living Conditions [80.88681952022479]
We introduce a new method for combining semi-supervised with multiple-instance learning.
We show that by leveraging the unlabelled data of 454 subjects we can achieve large performance gains in per-subject tremor detection.
arXiv Detail & Related papers (2023-04-29T12:25:10Z) - Long-Tailed Classification of Thorax Diseases on Chest X-Ray: A New
Benchmark Study [75.05049024176584]
We present a benchmark study of the long-tailed learning problem in the specific domain of thorax diseases on chest X-rays.
We focus on learning from naturally distributed chest X-ray data, optimizing classification accuracy over not only the common "head" classes, but also the rare yet critical "tail" classes.
The benchmark consists of two chest X-ray datasets for 19- and 20-way thorax disease classification, containing classes with as many as 53,000 and as few as 7 labeled training images.
arXiv Detail & Related papers (2022-08-29T04:34:15Z) - Sample hardness based gradient loss for long-tailed cervical cell
detection [40.503143547742866]
We propose aGrad-Libra Loss to dynamically calibrate the degree of hardness of each sample for different categories.
Our loss can thus help the detector to put more emphasis on those hard samples in both head and tail categories.
arXiv Detail & Related papers (2022-08-07T17:52:29Z) - Knowledge distillation with a class-aware loss for endoscopic disease
detection [1.1470070927586016]
In this work, we leverage deep learning to develop a framework to improve the localization of difficult to detect lesions.
Our model achieves higher performance in terms of mean average precision (mAP) on both endoscopic disease detection challenge and Kvasir-SEG datasets.
arXiv Detail & Related papers (2022-07-19T19:56:12Z) - Relational Subsets Knowledge Distillation for Long-tailed Retinal
Diseases Recognition [65.77962788209103]
We propose class subset learning by dividing the long-tailed data into multiple class subsets according to prior knowledge.
It enforces the model to focus on learning the subset-specific knowledge.
The proposed framework proved to be effective for the long-tailed retinal diseases recognition task.
arXiv Detail & Related papers (2021-04-22T13:39:33Z) - Self-Taught Semi-Supervised Anomaly Detection on Upper Limb X-rays [11.859913430860335]
Supervised deep networks take for granted a large number of annotations by radiologists.
Our approach's rationale is to use task pretext tasks to leverage unlabeled data.
We show that our method outperforms baselines across unsupervised and self-supervised anomaly detection settings.
arXiv Detail & Related papers (2021-02-19T12:32:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.