RAA-MIL: A Novel Framework for Classification of Oral Cytology
- URL: http://arxiv.org/abs/2511.12269v1
- Date: Sat, 15 Nov 2025 15:48:36 GMT
- Title: RAA-MIL: A Novel Framework for Classification of Oral Cytology
- Authors: Rupam Mukherjee, Rajkumar Daniel, Soujanya Hazra, Shirin Dasgupta, Subhamoy Mandal,
- Abstract summary: We introduce the first weakly supervised deep learning framework for patient-level diagnosis of oral Cytology whole slide images.<n>This study establishes the first patient-level weakly supervised benchmark for oral Cytology and moves toward reliable AI-assisted digital pathology.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Cytology is a valuable tool for early detection of oral squamous cell carcinoma (OSCC). However, manual examination of cytology whole slide images (WSIs) is slow, subjective, and depends heavily on expert pathologists. To address this, we introduce the first weakly supervised deep learning framework for patient-level diagnosis of oral cytology whole slide images, leveraging the newly released Oral Cytology Dataset [1], which provides annotated cytology WSIs from ten medical centres across India. Each patient case is represented as a bag of cytology patches and assigned a diagnosis label (Healthy, Benign, Oral Potentially Malignant Disorders (OPMD), OSCC) by an in-house expert pathologist. These patient-level weak labels form a new extension to the dataset. We evaluate a baseline multiple-instance learning (MIL) model and a proposed Region-Affinity Attention MIL (RAA-MIL) that models spatial relationships between regions within each slide. The RAA-MIL achieves an average accuracy of 72.7%, weighted F1-score of 0.69 on an unseen test set, outperforming the baseline. This study establishes the first patient-level weakly supervised benchmark for oral cytology and moves toward reliable AI-assisted digital pathology.
Related papers
- Multi-View Stenosis Classification Leveraging Transformer-Based Multiple-Instance Learning Using Real-World Clinical Data [76.89269238957593]
Coronary artery stenosis is a leading cause of cardiovascular disease, diagnosed by analyzing the coronary arteries from multiple angiography views.<n>We propose SegmentMIL, a transformer-based multi-view multiple-instance learning framework for patient-level stenosis classification.
arXiv Detail & Related papers (2026-02-02T13:07:52Z) - An Explainable Hybrid AI Framework for Enhanced Tuberculosis and Symptom Detection [55.35661671061754]
Tuberculosis remains a critical global health issue, particularly in resource-limited and remote areas.<n>We propose a framework which enhances disease and symptom detection on chest X-rays by integrating two supervised heads and a self-supervised head.<n>Our model achieves an accuracy of 98.85% for distinguishing between COVID-19, tuberculosis, and normal cases, and a macro-F1 score of 90.09% for multilabel symptom detection.
arXiv Detail & Related papers (2025-10-21T17:18:55Z) - Towards a Comprehensive Benchmark for Pathological Lymph Node Metastasis in Breast Cancer Sections [21.75452517154339]
We reprocessed 1,399 whole slide images (WSIs) and labels from the Camelyon-16 and Camelyon-17 datasets.
Based on the sizes of re-annotated tumor regions, we upgraded the binary cancer screening task to a four-class task.
arXiv Detail & Related papers (2024-11-16T09:19:24Z) - Large-scale cervical precancerous screening via AI-assisted cytology whole slide image analysis [11.148919818020495]
Cervical Cancer continues to be the leading gynecological malignancy, posing a persistent threat to women's health on a global scale.
Early screening via Whole Slide Image (WSI) diagnosis is critical to prevent this Cancer progression and improve survival rate.
But pathologist's single test suffers inevitable false negative due to the immense number of cells that need to be reviewed within a WSI.
arXiv Detail & Related papers (2024-07-28T15:29:07Z) - CIMIL-CRC: a clinically-informed multiple instance learning framework for patient-level colorectal cancer molecular subtypes classification from H\&E stained images [42.771819949806655]
We introduce CIMIL-CRC', a framework that solves the MSI/MSS MIL problem by efficiently combining a pre-trained feature extraction model with principal component analysis (PCA) to aggregate information from all patches.
We assessed our CIMIL-CRC method using the average area under the curve (AUC) from a 5-fold cross-validation experimental setup for model development on the TCGA-CRC-DX cohort.
arXiv Detail & Related papers (2024-01-29T12:56:11Z) - Active Learning Enhances Classification of Histopathology Whole Slide
Images with Attention-based Multiple Instance Learning [48.02011627390706]
We train an attention-based MIL and calculate a confidence metric for every image in the dataset to select the most uncertain WSIs for expert annotation.
With a novel attention guiding loss, this leads to an accuracy boost of the trained models with few regions annotated for each class.
It may in the future serve as an important contribution to train MIL models in the clinically relevant context of cancer classification in histopathology.
arXiv Detail & Related papers (2023-03-02T15:18:58Z) - Learning to diagnose cirrhosis from radiological and histological labels
with joint self and weakly-supervised pretraining strategies [62.840338941861134]
We propose to leverage transfer learning from large datasets annotated by radiologists, to predict the histological score available on a small annex dataset.
We compare different pretraining methods, namely weakly-supervised and self-supervised ones, to improve the prediction of the cirrhosis.
This method outperforms the baseline classification of the METAVIR score, reaching an AUC of 0.84 and a balanced accuracy of 0.75.
arXiv Detail & Related papers (2023-02-16T17:06:23Z) - Trustworthy Visual Analytics in Clinical Gait Analysis: A Case Study for
Patients with Cerebral Palsy [43.55994393060723]
gaitXplorer is a visual analytics approach for the classification of CP-related gait patterns.
It integrates Grad-CAM, a well-established explainable artificial intelligence algorithm, for explanations of machine learning classifications.
arXiv Detail & Related papers (2022-08-10T09:21:28Z) - Deep Learning-Based Sparse Whole-Slide Image Analysis for the Diagnosis
of Gastric Intestinal Metaplasia [5.64692772904991]
We propose a sparse WSI analysis method for the rapid identification of high-power ROI for WSI-level classification.
We test our method on a common but time-consuming task in pathology - that of diagnosing gastric intestinal metaplasia (GIM) on hematoxylin and eosin slides.
Our method successfully detects GIM in all positive WSI, with a WSI-level classification area under the receiver operating characteristic curve (AUC) of 0.98 and an average precision (AP) of 0.95.
arXiv Detail & Related papers (2022-01-05T04:43:46Z) - Histogram of Cell Types: Deep Learning for Automated Bone Marrow
Cytology [3.8385120184415418]
Histogram of Cell Types (HCT) is a novel representation of bone marrow cell class probability distribution.
HCT has potential to revolutionize hematopathology diagnostic, leading to more cost-effective, accurate diagnosis and opening the door to precision medicine.
arXiv Detail & Related papers (2021-07-05T21:55:00Z) - Deeply supervised UNet for semantic segmentation to assist
dermatopathological assessment of Basal Cell Carcinoma (BCC) [2.031570465477242]
We focus on detecting Basal Cell Carcinoma (BCC) through semantic segmentation using several models based on the UNet architecture.
We analyze two different encoders for the first part of the UNet network and two additional training strategies.
The best model achieves over 96%, accuracy, sensitivity, and specificity on the test set.
arXiv Detail & Related papers (2021-03-05T15:39:55Z) - Co-Heterogeneous and Adaptive Segmentation from Multi-Source and
Multi-Phase CT Imaging Data: A Study on Pathological Liver and Lesion
Segmentation [48.504790189796836]
We present a novel segmentation strategy, co-heterogenous and adaptive segmentation (CHASe)
We propose a versatile framework that fuses appearance based semi-supervision, mask based adversarial domain adaptation, and pseudo-labeling.
CHASe can further improve pathological liver mask Dice-Sorensen coefficients by ranges of $4.2% sim 9.4%$.
arXiv Detail & Related papers (2020-05-27T06:58:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.