Leveraging Sparse Annotations for Leukemia Diagnosis on the Large Leukemia Dataset
- URL: http://arxiv.org/abs/2504.02602v1
- Date: Thu, 03 Apr 2025 14:04:02 GMT
- Title: Leveraging Sparse Annotations for Leukemia Diagnosis on the Large Leukemia Dataset
- Authors: Abdul Rehman, Talha Meraj, Aiman Mahmood Minhas, Ayisha Imran, Mohsen Ali, Waqas Sultani, Mubarak Shah,
- Abstract summary: Leukemia is 10th most frequently diagnosed cancer and one of the leading causes of cancer related deaths worldwide.<n>Despite deep learning advances in medical imaging, leukemia analysis lacks a large, diverse multi-task dataset.<n>We present a large scale WBC dataset and novel methods for detecting WBC with their attributes.
- Score: 44.948939549346676
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Leukemia is 10th most frequently diagnosed cancer and one of the leading causes of cancer related deaths worldwide. Realistic analysis of Leukemia requires White Blook Cells (WBC) localization, classification, and morphological assessment. Despite deep learning advances in medical imaging, leukemia analysis lacks a large, diverse multi-task dataset, while existing small datasets lack domain diversity, limiting real world applicability. To overcome dataset challenges, we present a large scale WBC dataset named Large Leukemia Dataset (LLD) and novel methods for detecting WBC with their attributes. Our contribution here is threefold. First, we present a large-scale Leukemia dataset collected through Peripheral Blood Films (PBF) from several patients, through multiple microscopes, multi cameras, and multi magnification. To enhance diagnosis explainability and medical expert acceptance, each leukemia cell is annotated at 100x with 7 morphological attributes, ranging from Cell Size to Nuclear Shape. Secondly, we propose a multi task model that not only detects WBCs but also predicts their attributes, providing an interpretable and clinically meaningful solution. Third, we propose a method for WBC detection with attribute analysis using sparse annotations. This approach reduces the annotation burden on hematologists, requiring them to mark only a small area within the field of view. Our method enables the model to leverage the entire field of view rather than just the annotated regions, enhancing learning efficiency and diagnostic accuracy. From diagnosis explainability to overcoming domain shift challenges, presented datasets could be used for many challenging aspects of microscopic image analysis. The datasets, code, and demo are available at: https://im.itu.edu.pk/sparse-leukemiaattri/
Related papers
- MIL vs. Aggregation: Evaluating Patient-Level Survival Prediction Strategies Using Graph-Based Learning [52.231128973251124]
We compare various strategies for predicting survival at the WSI and patient level.
The former treats each WSI as an independent sample, mimicking the strategy adopted in other works.
The latter comprises methods to either aggregate the predictions of the several WSIs or automatically identify the most relevant slide.
arXiv Detail & Related papers (2025-03-29T11:14:02Z) - A Hybrid Feature Fusion Deep Learning Framework for Leukemia Cancer Detection in Microscopic Blood Sample Using Gated Recurrent Unit and Uncertainty Quantification [1.024113475677323]
Leukemia is diagnosed by analyzing blood and bone marrow smears under a microscope, with additional cytochemical tests for confirmation.
Deep learning has provided advanced methods for classifying microscopic smear images, aiding in the detection of leukemic cells.
In this research, hybrid deep learning models were implemented to classify Acute lymphoblastic leukemia (ALL)
The proposed method achieved a remarkable detection accuracy rate of 100% on the ALL-IDB1 dataset, 98.07% on the ALL-IDB2 dataset, and 98.64% on the combined dataset.
arXiv Detail & Related papers (2024-10-18T15:23:34Z) - A Large-scale Multi Domain Leukemia Dataset for the White Blood Cells Detection with Morphological Attributes for Explainability [7.6197612404154285]
Leukemia prognosis is challenging without morphological information of White Blood Cells (WBC)
Deep Learning based methods can be employed to assist hematologists.
We have acquired a realistic, generalized, and large dataset.
arXiv Detail & Related papers (2024-05-17T14:20:02Z) - DinoBloom: A Foundation Model for Generalizable Cell Embeddings in Hematology [1.3551232282678036]
We introduce DinoBloom, the first foundation model for single cell images in hematology.
Our model is built upon an extensive collection of 13 diverse, publicly available datasets of peripheral blood and bone marrow smears.
A family of four DinoBloom models can be adapted for a wide range of downstream applications.
arXiv Detail & Related papers (2024-04-07T17:25:52Z) - Multi-task Explainable Skin Lesion Classification [54.76511683427566]
We propose a few-shot-based approach for skin lesions that generalizes well with few labelled data.
The proposed approach comprises a fusion of a segmentation network that acts as an attention module and classification network.
arXiv Detail & Related papers (2023-10-11T05:49:47Z) - WBCAtt: A White Blood Cell Dataset Annotated with Detailed Morphological
Attributes [22.423647778787334]
This paper introduces comprehensive annotations for White Blood Cells (WBCs) images.
We have identified 11 morphological attributes associated with the cell and its components.
We conduct experiments to predict these attributes from images, providing insights beyond basic WBC classification.
arXiv Detail & Related papers (2023-06-23T14:52:37Z) - Pixel-Level Explanation of Multiple Instance Learning Models in
Biomedical Single Cell Images [52.527733226555206]
We investigate the use of four attribution methods to explain a multiple instance learning models.
We study two datasets of acute myeloid leukemia with over 100 000 single cell images.
We compare attribution maps with the annotations of a medical expert to see how the model's decision-making differs from the human standard.
arXiv Detail & Related papers (2023-03-15T14:00:11Z) - Federated Learning Enables Big Data for Rare Cancer Boundary Detection [98.5549882883963]
We present findings from the largest Federated ML study to-date, involving data from 71 healthcare institutions across 6 continents.
We generate an automatic tumor boundary detector for the rare disease of glioblastoma.
We demonstrate a 33% improvement over a publicly trained model to delineate the surgically targetable tumor, and 23% improvement over the tumor's entire extent.
arXiv Detail & Related papers (2022-04-22T17:27:00Z) - Classification of White Blood Cell Leukemia with Low Number of
Interpretable and Explainable Features [0.0]
White Blood Cell (WBC) Leukaemia is detected through image-based classification.
Convolutional Neural Networks are used to learn the features needed to classify images of cells a malignant or healthy.
This type of model requires learning a large number of parameters and is difficult to interpret and explain.
We present an XAI model which uses only 24 explainable and interpretable features and is highly competitive to other approaches by outperforming them by about 4.38%.
arXiv Detail & Related papers (2022-01-28T00:08:56Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.