IMILIA: interpretable multiple instance learning for inflammation prediction in IBD from H&E whole slide images
- URL: http://arxiv.org/abs/2512.13440v1
- Date: Mon, 15 Dec 2025 15:36:09 GMT
- Title: IMILIA: interpretable multiple instance learning for inflammation prediction in IBD from H&E whole slide images
- Authors: Thalyssa Baiocco-Rodrigues, Antoine Olivier, Reda Belbahri, Thomas Duboudin, Pierre-Antoine Bannier, Benjamin Adjadj, Katharina Von Loga, Nathan Noiry, Maxime Touzot, Hector Roux de Bezieux,
- Abstract summary: We introduce IMILIA, an end-to-end framework for prediction of inflammation presence in IBD digitized slides stained with hematoxylin and eosin (H&E)<n> IMILIA is composed of an inflammation prediction module, and an interpretability module, divided in two blocks: HistoPLUS, for cell instance detection, and EpiSeg, for epithelium segmentation.<n>The interpretability module yields biologically consistent insights: tiles with higher predicted scores show increased densities of immune cells, whereas lower-scored tiles predominantly contain normal epithelial cells.
- Score: 2.5182612992285844
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: As the therapeutic target for Inflammatory Bowel Disease (IBD) shifts toward histologic remission, the accurate assessment of microscopic inflammation has become increasingly central for evaluating disease activity and response to treatment. In this work, we introduce IMILIA (Interpretable Multiple Instance Learning for Inflammation Analysis), an end-to-end framework designed for the prediction of inflammation presence in IBD digitized slides stained with hematoxylin and eosin (H&E), followed by the automated computation of markers characterizing tissue regions driving the predictions. IMILIA is composed of an inflammation prediction module, consisting of a Multiple Instance Learning (MIL) model, and an interpretability module, divided in two blocks: HistoPLUS, for cell instance detection, segmentation and classification; and EpiSeg, for epithelium segmentation. IMILIA achieves a cross-validation ROC-AUC of 0.83 on the discovery cohort, and a ROC-AUC of 0.99 and 0.84 on two external validation cohorts. The interpretability module yields biologically consistent insights: tiles with higher predicted scores show increased densities of immune cells (lymphocytes, plasmocytes, neutrophils and eosinophils), whereas lower-scored tiles predominantly contain normal epithelial cells. Notably, these patterns were consistent across all datasets. Code and models to partially replicate the results on the public IBDColEpi dataset can be found at https://github.com/owkin/imilia.
Related papers
- Investigating the Impact of Histopathological Foundation Models on Regressive Prediction of Homologous Recombination Deficiency [52.50039435394964]
We systematically evaluate foundation models for regression-based tasks.<n>We extract patch-level features from whole slide images (WSI) using five state-of-the-art foundation models.<n>Models are trained to predict continuous HRD scores based on these extracted features across breast, endometrial, and lung cancer cohorts.
arXiv Detail & Related papers (2026-01-29T14:06:50Z) - RAA-MIL: A Novel Framework for Classification of Oral Cytology [0.0]
We introduce the first weakly supervised deep learning framework for patient-level diagnosis of oral Cytology whole slide images.<n>This study establishes the first patient-level weakly supervised benchmark for oral Cytology and moves toward reliable AI-assisted digital pathology.
arXiv Detail & Related papers (2025-11-15T15:48:36Z) - Leveraging Sparse Annotations for Leukemia Diagnosis on the Large Leukemia Dataset [44.948939549346676]
Leukemia is the 10th most frequently diagnosed cancer and one of the leading causes of cancer-related deaths worldwide.<n>Despite deep learning advances in medical imaging, leukemia analysis lacks a large, diverse multi-task dataset.<n>We present a large-scale WBC dataset and novel methods for detecting WBC with their attributes.
arXiv Detail & Related papers (2025-04-03T14:04:02Z) - Exploring Few-Shot Object Detection on Blood Smear Images: A Case Study of Leukocytes and Schistocytes [37.440449828136586]
Investigation focuses on a novel approach termed DE-ViT.<n>This methodology is employed in a Few-Shot paradigm, wherein training relies on a limited number of images.<n>While DE-ViT has demonstrated state-of-the-art performance on the COCO and LVIS datasets, both baseline models surpassed its performance on the Raabin-WBC dataset.
arXiv Detail & Related papers (2025-03-21T12:46:49Z) - Phenome-wide causal proteomics enhance systemic lupus erythematosus flare prediction: A study in Asian populations [8.003585790309332]
Systemic lupus erythematosus (SLE) is a complex autoimmune disease characterized by unpredictable flares.
This study aimed to develop a novel lupus-based risk prediction model specifically for Asian SLE populations.
arXiv Detail & Related papers (2024-11-18T01:50:36Z) - A Hybrid Feature Fusion Deep Learning Framework for Leukemia Cancer Detection in Microscopic Blood Sample Using Gated Recurrent Unit and Uncertainty Quantification [1.024113475677323]
Leukemia is diagnosed by analyzing blood and bone marrow smears under a microscope, with additional cytochemical tests for confirmation.
Deep learning has provided advanced methods for classifying microscopic smear images, aiding in the detection of leukemic cells.
In this research, hybrid deep learning models were implemented to classify Acute lymphoblastic leukemia (ALL)
The proposed method achieved a remarkable detection accuracy rate of 100% on the ALL-IDB1 dataset, 98.07% on the ALL-IDB2 dataset, and 98.64% on the combined dataset.
arXiv Detail & Related papers (2024-10-18T15:23:34Z) - TACCO: Task-guided Co-clustering of Clinical Concepts and Patient Visits for Disease Subtyping based on EHR Data [42.96821770394798]
TACCO is a novel framework that jointly discovers clusters of clinical concepts and patient visits based on a hypergraph modeling of EHR data.
We conduct experiments on the public MIMIC-III dataset and Emory internal CRADLE dataset over the downstream clinical tasks of phenotype classification and cardiovascular risk prediction.
In-depth model analysis, clustering results analysis, and clinical case studies further validate the improved utilities and insightful interpretations delivered by TACCO.
arXiv Detail & Related papers (2024-06-14T14:18:38Z) - CIMIL-CRC: a clinically-informed multiple instance learning framework for patient-level colorectal cancer molecular subtypes classification from H\&E stained images [42.771819949806655]
We introduce CIMIL-CRC', a framework that solves the MSI/MSS MIL problem by efficiently combining a pre-trained feature extraction model with principal component analysis (PCA) to aggregate information from all patches.
We assessed our CIMIL-CRC method using the average area under the curve (AUC) from a 5-fold cross-validation experimental setup for model development on the TCGA-CRC-DX cohort.
arXiv Detail & Related papers (2024-01-29T12:56:11Z) - Neural Network-Based Histologic Remission Prediction In Ulcerative
Colitis [38.150634108667774]
Histologic remission is a new therapeutic target in ulcerative colitis (UC)
Endocytoscopy (EC) is a novel ultra-high magnification endoscopic technique.
We propose a neural network model that can assess histological disease activity in EC images.
arXiv Detail & Related papers (2023-08-28T15:54:14Z) - Interpretable histopathology-based prediction of disease relevant
features in Inflammatory Bowel Disease biopsies using weakly-supervised deep
learning [0.8521205677945196]
Crohn's Disease (CD) and Ulcerative Colitis (UC) are the two main Inflammatory Bowel Disease (IBD) types.
We developed deep learning models to identify histological disease features for both CD and UC using only endoscopic labels.
arXiv Detail & Related papers (2023-03-20T15:59:29Z) - Learning to diagnose cirrhosis from radiological and histological labels
with joint self and weakly-supervised pretraining strategies [62.840338941861134]
We propose to leverage transfer learning from large datasets annotated by radiologists, to predict the histological score available on a small annex dataset.
We compare different pretraining methods, namely weakly-supervised and self-supervised ones, to improve the prediction of the cirrhosis.
This method outperforms the baseline classification of the METAVIR score, reaching an AUC of 0.84 and a balanced accuracy of 0.75.
arXiv Detail & Related papers (2023-02-16T17:06:23Z) - Corneal endothelium assessment in specular microscopy images with Fuchs'
dystrophy via deep regression of signed distance maps [48.498376125522114]
This paper proposes a UNet-based segmentation approach that requires minimal post-processing.
It achieves reliable CE morphometric assessment and guttae identification across all degrees of Fuchs' dystrophy.
arXiv Detail & Related papers (2022-10-13T15:34:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.