Enhanced Leukemic Cell Classification Using Attention-Based CNN and Data Augmentation
- URL: http://arxiv.org/abs/2601.01026v1
- Date: Sat, 03 Jan 2026 01:24:11 GMT
- Title: Enhanced Leukemic Cell Classification Using Attention-Based CNN and Data Augmentation
- Authors: Douglas Costa Braga, Daniel Oliveira Dantas,
- Abstract summary: Acute lymphoblastic leukemia (ALL) is the most common childhood cancer, requiring expert microscopic diagnosis.<n>The proposed system integrates an attention-based convolutional neural network combining EfficientNetV2-B3 with Squeeze-and-Excitation mechanisms for automated ALL cell classification.<n>Our approach employs comprehensive data augmentation, focal loss for class imbalance, and patient-wise data splitting to ensure robust and reproducible evaluation.
- Score: 0.5371337604556311
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present a reproducible deep learning pipeline for leukemic cell classification, focusing on system architecture, experimental robustness, and software design choices for medical image analysis. Acute lymphoblastic leukemia (ALL) is the most common childhood cancer, requiring expert microscopic diagnosis that suffers from inter-observer variability and time constraints. The proposed system integrates an attention-based convolutional neural network combining EfficientNetV2-B3 with Squeeze-and-Excitation mechanisms for automated ALL cell classification. Our approach employs comprehensive data augmentation, focal loss for class imbalance, and patient-wise data splitting to ensure robust and reproducible evaluation. On the C-NMC 2019 dataset (12,528 original images from 62 patients), the system achieves a 97.89% F1-score and 97.89% accuracy on the test set, with statistical validation through 100-iteration Monte Carlo experiments confirming significant improvements (p < 0.001) over baseline methods. The proposed pipeline outperforms existing approaches by up to 4.67% while using 89% fewer parameters than VGG16 (15.2M vs. 138M). The attention mechanism provides interpretable visualizations of diagnostically relevant cellular features, demonstrating that modern attention-based architectures can improve leukemic cell classification while maintaining computational efficiency suitable for clinical deployment.
Related papers
- Enhancing Generalization in Sickle Cell Disease Diagnosis through Ensemble Methods and Feature Importance Analysis [2.8601718604194706]
This work presents a novel approach for selecting the optimal ensemble-based classification method and features.<n>It provides diagnostic support for Sickle Cell Disease using peripheral blood smear images of red blood cells.
arXiv Detail & Related papers (2026-01-19T13:00:41Z) - Prediction of Cellular Malignancy Using Electrical Impedance Signatures and Supervised Machine Learning [0.0]
This study systematically reviewed 33 scholarly articles to compile datasets of quantitative bioelectric parameters.<n>Three supervised machine learning algorithms- Random Forest (RF), Support Vector Machine (SVM), and K-Nearest Neighbor (KNN) were implemented and tuned.<n>Results demonstrate that Random Forest achieved the highest predictive accuracy of 90% when configured with a maximum depth of 4 and 100 estimators.
arXiv Detail & Related papers (2026-01-08T01:30:52Z) - CytoDINO: Risk-Aware and Biologically-Informed Adaptation of DINOv3 for Bone Marrow Cytomorphology [0.0]
We introduce CytoDINO, a framework that achieves state-of-the-art performance on the Munich Leukemia Laboratory dataset.<n>Our primary contribution is a novel Hierarchical Focal Loss with Critical Penalties, which encodes biological relationships between cell lineages and explicitly penalizes clinically dangerous misclassifications.<n>CytoDINO achieves an 88.2% weighted F1 score and 76.5% macro F1 on a held-out test set of 21 cell classes.
arXiv Detail & Related papers (2025-12-09T23:09:22Z) - Transparent Early ICU Mortality Prediction with Clinical Transformer and Per-Case Modality Attribution [42.85462513661566]
We present a lightweight, transparent multimodal ensemble that fuses physiological time-series measurements with unstructured clinical notes from the first 48 hours of an ICU stay.<n>A logistic regression model combines predictions from two modality-specific models: a bidirectional LSTM for vitals and a finetuned ClinicalModernBERT transformer for notes.<n>On the MIMIC-III benchmark, our late-fusion ensemble improves discrimination over the best single model while maintaining well-calibrated predictions.
arXiv Detail & Related papers (2025-11-19T20:11:49Z) - Deep Learning with Self-Attention and Enhanced Preprocessing for Precise Diagnosis of Acute Lymphoblastic Leukemia from Bone Marrow Smears in Hemato-Oncology [0.0]
Acute lymphoblastic leukemia (ALL) is a prevalent hematological malignancy in both pediatric and adult populations.<n>We present a deep learning framework for automated ALL diagnosis from bone marrow smear images.
arXiv Detail & Related papers (2025-08-24T05:30:02Z) - Skin Cancer Classification: Hybrid CNN-Transformer Models with KAN-Based Fusion [0.0]
We explore Sequential and Parallel Hybrid CNN-Transformer models with Convolutional Kolmogorov-Arnold Network (CKAN)<n>Our approach integrates transfer learning and extensive data augmentation, where CNNs extract local spatial features, Transformers model global dependencies, and CKAN facilitates nonlinear feature fusion for improved representation learning.<n>Our proposed approach achieves competitive performance in skin cancer classification, demonstrating 92.81% accuracy and 92.47% F1-score on the HAM10000 dataset, 97.83% accuracy and 97.83% F1-score on the PAD-UFES dataset, and 91.17% accuracy with 91.79% F1- score on
arXiv Detail & Related papers (2025-08-17T19:57:34Z) - Neural Cellular Automata for Weakly Supervised Segmentation of White Blood Cells [37.639376961641666]
We propose a novel approach for weakly supervised segmentation using neural cellular automata (NCA)<n>We evaluate our method on three white blood cell microscopy datasets and demonstrate that NCA-WSS significantly outperforms existing weakly supervised approaches.
arXiv Detail & Related papers (2025-08-17T10:40:16Z) - Automatic Cough Analysis for Non-Small Cell Lung Cancer Detection [33.37223681850477]
Early detection of non-small cell lung cancer (NSCLC) is critical for improving patient outcomes.<n>We explore the use of automatic cough analysis as a pre-screening tool for distinguishing between NSCLC patients and healthy controls.<n>Recordings were analyzed using machine learning techniques, such as support vector machine (SVM) and XGBoost.
arXiv Detail & Related papers (2025-07-25T11:30:22Z) - HANS-Net: Hyperbolic Convolution and Adaptive Temporal Attention for Accurate and Generalizable Liver and Tumor Segmentation in CT Imaging [2.7205074719266062]
Accurate liver and tumor segmentation on abdominal CT images is critical for reliable diagnosis and treatment planning.<n>We introduce Hyperbolic-convolutions Adaptive-temporal-attention with Neural-representation and Synaptic-plasticity Network (HANS-Net)<n>HANS-Net combines hyperbolic convolutions for hierarchical geometric representation, a wavelet-inspired decomposition module for multi-scale texture learning, and an implicit neural representation branch.
arXiv Detail & Related papers (2025-07-15T13:56:37Z) - CNN-LSTM Hybrid Model for AI-Driven Prediction of COVID-19 Severity from Spike Sequences and Clinical Data [0.0]
We developed a CNN-LSTM hybrid model to predict COVID-19 severity using spike protein sequences and clinical data.<n>The model achieved an F1 score of 82.92%, ROC-AUC of 0.9084, precision of 83.56%, and recall of 82.85%.
arXiv Detail & Related papers (2025-05-29T16:20:54Z) - Neural Cellular Automata for Lightweight, Robust and Explainable Classification of White Blood Cell Images [40.347953893940044]
We introduce a novel approach for white blood cell classification based on neural cellular automata (NCA)
Our NCA-based method is significantly smaller in terms of parameters and exhibits robustness to domain shifts.
Our results demonstrate that NCA can be used for image classification, and they address key challenges of conventional methods.
arXiv Detail & Related papers (2024-04-08T14:59:53Z) - Anomaly Detection in Electrocardiograms: Advancing Clinical Diagnosis Through Self-Supervised Learning [32.37717219026923]
Existing systems often miss rare cardiac anomalies that could be precursors to serious, life-threatening issues or alterations in the cardiac macro/microstructure.
We focus on self-supervised anomaly detection (AD), training exclusively on normal ECGs to recognize deviations indicating anomalies.
We introduce a novel self-supervised learning framework for ECG AD, utilizing a vast dataset of normal ECGs to autonomously detect and localize cardiac anomalies.
arXiv Detail & Related papers (2024-04-07T12:15:53Z) - CIMIL-CRC: a clinically-informed multiple instance learning framework for patient-level colorectal cancer molecular subtypes classification from H\&E stained images [42.771819949806655]
We introduce CIMIL-CRC', a framework that solves the MSI/MSS MIL problem by efficiently combining a pre-trained feature extraction model with principal component analysis (PCA) to aggregate information from all patches.
We assessed our CIMIL-CRC method using the average area under the curve (AUC) from a 5-fold cross-validation experimental setup for model development on the TCGA-CRC-DX cohort.
arXiv Detail & Related papers (2024-01-29T12:56:11Z) - Improving COVID-19 CT Classification of CNNs by Learning
Parameter-Efficient Representation [31.51725965329019]
Deep learning methods have been proposed to assist clinicians in automatic COVID-19 diagnosis based on computed tomography imaging.
DenseNet121 achieves an average test accuracy of 99.44% in three trials for three-category classification, including normal, non-COVID-19 pneumonia, and COVID-19 pneumonia.
arXiv Detail & Related papers (2022-08-09T12:24:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.