Leveraging Sparse Annotations for Leukemia Diagnosis on the Large Leukemia Dataset
- URL: http://arxiv.org/abs/2504.02602v2
- Date: Fri, 08 Aug 2025 23:06:04 GMT
- Title: Leveraging Sparse Annotations for Leukemia Diagnosis on the Large Leukemia Dataset
- Authors: Abdul Rehman, Talha Meraj, Aiman Mahmood Minhas, Ayisha Imran, Mohsen Ali, Waqas Sultani, Mubarak Shah,
- Abstract summary: Leukemia is the 10th most frequently diagnosed cancer and one of the leading causes of cancer-related deaths worldwide.<n>Despite deep learning advances in medical imaging, leukemia analysis lacks a large, diverse multi-task dataset.<n>We present a large-scale WBC dataset and novel methods for detecting WBC with their attributes.
- Score: 44.948939549346676
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Leukemia is the 10th most frequently diagnosed cancer and one of the leading causes of cancer-related deaths worldwide. Realistic analysis of leukemia requires white blood cell (WBC) localization, classification, and morphological assessment. Despite deep learning advances in medical imaging, leukemia analysis lacks a large, diverse multi-task dataset, while existing small datasets lack domain diversity, limiting real-world applicability. To overcome dataset challenges, we present a large-scale WBC dataset named Large Leukemia Dataset (LLD) and novel methods for detecting WBC with their attributes. Our contribution here is threefold. First, we present a large-scale Leukemia dataset collected through Peripheral Blood Films (PBF) from 48 patients, through multiple microscopes, multi-cameras, and multi-magnification. To enhance diagnosis explainability and medical expert acceptance, each leukemia cell is annotated at 100x with 7 morphological attributes, ranging from Cell Size to Nuclear Shape. Secondly, we propose a multi-task model that not only detects WBCs but also predicts their attributes, providing an interpretable and clinically meaningful solution. Third, we propose a method for WBC detection with attribute analysis using sparse annotations. This approach reduces the annotation burden on hematologists, requiring them to mark only a small area within the field of view. Our method enables the model to leverage the entire field of view rather than just the annotated regions, enhancing learning efficiency and diagnostic accuracy. From diagnosis explainability to overcoming domain-shift challenges, the presented datasets can be used for many challenging aspects of microscopic image analysis. The datasets, code, and demo are available at: https://im.itu.edu.pk/sparse-leukemiaattri/
Related papers
- MIL vs. Aggregation: Evaluating Patient-Level Survival Prediction Strategies Using Graph-Based Learning [52.231128973251124]
We compare various strategies for predicting survival at the WSI and patient level.
The former treats each WSI as an independent sample, mimicking the strategy adopted in other works.
The latter comprises methods to either aggregate the predictions of the several WSIs or automatically identify the most relevant slide.
arXiv Detail & Related papers (2025-03-29T11:14:02Z) - A Graph-Based Framework for Interpretable Whole Slide Image Analysis [86.37618055724441]
We develop a framework that transforms whole-slide images into biologically-informed graph representations.<n>Our approach builds graph nodes from tissue regions that respect natural structures, not arbitrary grids.<n>We demonstrate strong performance on challenging cancer staging and survival prediction tasks.
arXiv Detail & Related papers (2025-03-14T20:15:04Z) - A Hybrid Feature Fusion Deep Learning Framework for Leukemia Cancer Detection in Microscopic Blood Sample Using Gated Recurrent Unit and Uncertainty Quantification [1.024113475677323]
Leukemia is diagnosed by analyzing blood and bone marrow smears under a microscope, with additional cytochemical tests for confirmation.
Deep learning has provided advanced methods for classifying microscopic smear images, aiding in the detection of leukemic cells.
In this research, hybrid deep learning models were implemented to classify Acute lymphoblastic leukemia (ALL)
The proposed method achieved a remarkable detection accuracy rate of 100% on the ALL-IDB1 dataset, 98.07% on the ALL-IDB2 dataset, and 98.64% on the combined dataset.
arXiv Detail & Related papers (2024-10-18T15:23:34Z) - Deep Learning Algorithms for Early Diagnosis of Acute Lymphoblastic Leukemia [0.0]
Acute lymphoblastic leukemia (ALL) is a form of blood cancer that affects the white blood cells.
In this study, we propose a binary image classification model to assist in the diagnostic process of ALL.
arXiv Detail & Related papers (2024-07-14T15:35:39Z) - A Large-scale Multi Domain Leukemia Dataset for the White Blood Cells Detection with Morphological Attributes for Explainability [7.6197612404154285]
Leukemia prognosis is challenging without morphological information of White Blood Cells (WBC)
Deep Learning based methods can be employed to assist hematologists.
We have acquired a realistic, generalized, and large dataset.
arXiv Detail & Related papers (2024-05-17T14:20:02Z) - DinoBloom: A Foundation Model for Generalizable Cell Embeddings in Hematology [1.3551232282678036]
We introduce DinoBloom, the first foundation model for single cell images in hematology.
Our model is built upon an extensive collection of 13 diverse, publicly available datasets of peripheral blood and bone marrow smears.
A family of four DinoBloom models can be adapted for a wide range of downstream applications.
arXiv Detail & Related papers (2024-04-07T17:25:52Z) - Multi-task Explainable Skin Lesion Classification [54.76511683427566]
We propose a few-shot-based approach for skin lesions that generalizes well with few labelled data.
The proposed approach comprises a fusion of a segmentation network that acts as an attention module and classification network.
arXiv Detail & Related papers (2023-10-11T05:49:47Z) - WBCAtt: A White Blood Cell Dataset Annotated with Detailed Morphological
Attributes [22.423647778787334]
This paper introduces comprehensive annotations for White Blood Cells (WBCs) images.
We have identified 11 morphological attributes associated with the cell and its components.
We conduct experiments to predict these attributes from images, providing insights beyond basic WBC classification.
arXiv Detail & Related papers (2023-06-23T14:52:37Z) - Pixel-Level Explanation of Multiple Instance Learning Models in
Biomedical Single Cell Images [52.527733226555206]
We investigate the use of four attribution methods to explain a multiple instance learning models.
We study two datasets of acute myeloid leukemia with over 100 000 single cell images.
We compare attribution maps with the annotations of a medical expert to see how the model's decision-making differs from the human standard.
arXiv Detail & Related papers (2023-03-15T14:00:11Z) - Federated Learning Enables Big Data for Rare Cancer Boundary Detection [98.5549882883963]
We present findings from the largest Federated ML study to-date, involving data from 71 healthcare institutions across 6 continents.
We generate an automatic tumor boundary detector for the rare disease of glioblastoma.
We demonstrate a 33% improvement over a publicly trained model to delineate the surgically targetable tumor, and 23% improvement over the tumor's entire extent.
arXiv Detail & Related papers (2022-04-22T17:27:00Z) - Classification of White Blood Cell Leukemia with Low Number of
Interpretable and Explainable Features [0.0]
White Blood Cell (WBC) Leukaemia is detected through image-based classification.
Convolutional Neural Networks are used to learn the features needed to classify images of cells a malignant or healthy.
This type of model requires learning a large number of parameters and is difficult to interpret and explain.
We present an XAI model which uses only 24 explainable and interpretable features and is highly competitive to other approaches by outperforming them by about 4.38%.
arXiv Detail & Related papers (2022-01-28T00:08:56Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.