Engineering Spatial and Molecular Features from Cellular Niches to Inform Predictions of Inflammatory Bowel Disease
- URL: http://arxiv.org/abs/2509.09923v1
- Date: Fri, 12 Sep 2025 02:10:41 GMT
- Title: Engineering Spatial and Molecular Features from Cellular Niches to Inform Predictions of Inflammatory Bowel Disease
- Authors: Myles Joshua Toledo Tan, Maria Kapetanaki, Panayiotis V. Benos,
- Abstract summary: Differentiating between the two main subtypes of Inflammatory Bowel Disease (IBD): Crohns disease (CD) and ulcerative colitis (UC) is a persistent clinical challenge.<n>This study introduces a novel computational framework that employs spatial transcriptomics (ST) to create an explainable machine learning model for IBD classification.
- Score: 1.043230260556633
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Differentiating between the two main subtypes of Inflammatory Bowel Disease (IBD): Crohns disease (CD) and ulcerative colitis (UC) is a persistent clinical challenge due to overlapping presentations. This study introduces a novel computational framework that employs spatial transcriptomics (ST) to create an explainable machine learning model for IBD classification. We analyzed ST data from the colonic mucosa of healthy controls (HC), UC, and CD patients. Using Non-negative Matrix Factorization (NMF), we first identified four recurring cellular niches, representing distinct functional microenvironments within the tissue. From these niches, we systematically engineered 44 features capturing three key aspects of tissue pathology: niche composition, neighborhood enrichment, and niche-gene signals. A multilayer perceptron (MLP) classifier trained on these features achieved an accuracy of 0.774 +/- 0.161 for the more challenging three-class problem (HC, UC, and CD) and 0.916 +/- 0.118 in the two-class problem of distinguishing IBD from healthy tissue. Crucially, model explainability analysis revealed that disruptions in the spatial organization of niches were the strongest predictors of general inflammation, while the classification between UC and CD relied on specific niche-gene expression signatures. This work provides a robust, proof-of-concept pipeline that transforms descriptive spatial data into an accurate and explainable predictive tool, offering not only a potential new diagnostic paradigm but also deeper insights into the distinct biological mechanisms that drive IBD subtypes.
Related papers
- R-GenIMA: Integrating Neuroimaging and Genetics with Interpretable Multimodal AI for Alzheimer's Disease Progression [63.97617759805451]
Early detection of Alzheimer's disease requires models capable of integrating macro-scale neuroanatomical alterations with micro-scale genetic susceptibility.<n>We introduce R-GenIMA, an interpretable multimodal large language model that couples a novel ROI-wise vision transformer with genetic prompting.<n>R-GenIMA achieves state-of-the-art performance in four-way classification across normal cognition, subjective memory concerns, mild cognitive impairment, and AD.
arXiv Detail & Related papers (2025-12-22T02:54:10Z) - A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis [82.01597026329158]
We introduce a Correlation-Regulated Alignment Framework for Tissue Synthesis (CRAFTS) for pathology-specific text-to-image synthesis.<n>CRAFTS incorporates a novel alignment mechanism that suppresses semantic drift to ensure biological accuracy.<n>This model generates diverse pathological images spanning 30 cancer types, with quality rigorously validated by objective metrics and pathologist evaluations.
arXiv Detail & Related papers (2025-12-15T10:22:43Z) - From Pixels to Pathology: Restoration Diffusion for Diagnostic-Consistent Virtual IHC [37.284994932355865]
We introduce Star-Diff, a structure-aware staining restoration diffusion model that reformulates virtual staining as an image restoration task.<n>By combining residual and noise-based generation pathways, Star-Diff maintains tissue structure while modeling realistic biomarker variability.<n> Experiments on the BCI dataset demonstrate that Star-Diff achieves state-of-the-art (SOTA) performance in both visual fidelity and diagnostic relevance.
arXiv Detail & Related papers (2025-08-04T15:36:58Z) - ROIsGAN: A Region Guided Generative Adversarial Framework for Murine Hippocampal Subregion Segmentation [0.0]
The hippocampus is a critical brain structure involved in memory processing and various neurodegenerative and psychiatric disorders.<n>No existing methods address the automated segmentation of hippocampal subregions from tissue images.<n>We propose ROIsGAN, a region-guided U-Net-based generative adversarial network tailored for hippocampal subregion segmentation.
arXiv Detail & Related papers (2025-05-15T20:11:50Z) - Interpretable Graph Kolmogorov-Arnold Networks for Multi-Cancer Classification and Biomarker Identification using Multi-Omics Data [36.92842246372894]
Multi-Omics Graph Kolmogorov-Arnold Network (MOGKAN) is a deep learning framework that utilizes messenger-RNA, micro-RNA sequences, and DNA methylation samples.<n>By integrating multi-omics data with graph-based deep learning, our proposed approach demonstrates robust predictive performance and interpretability.
arXiv Detail & Related papers (2025-03-29T02:14:05Z) - Querying functional and structural niches on spatial transcriptomics data [7.240034062898855]
spatial transcriptomics enables gene expression profiling in spatial contexts.<n>It has been revealed that spatial niches serve as cohesive and recurrent units in physiological and pathological processes.<n>We defined the Niche Query Task, which is to identify similar niches across ST samples given a niche of interest (NOI)<n>We developed QueST, a specialized method for solving this task.
arXiv Detail & Related papers (2024-10-14T16:01:27Z) - Interpretable histopathology-based prediction of disease relevant
features in Inflammatory Bowel Disease biopsies using weakly-supervised deep
learning [0.8521205677945196]
Crohn's Disease (CD) and Ulcerative Colitis (UC) are the two main Inflammatory Bowel Disease (IBD) types.
We developed deep learning models to identify histological disease features for both CD and UC using only endoscopic labels.
arXiv Detail & Related papers (2023-03-20T15:59:29Z) - Meta-information-aware Dual-path Transformer for Differential Diagnosis
of Multi-type Pancreatic Lesions in Multi-phase CT [41.199716328468895]
We develop a dual-path transformer to exploit the feasibility of classification and segmentation of pancreatic lesions.
The proposed method consists of a CNN-based segmentation path (S-path) and a transformer-based classification path (C-path)
Our results show that our method can enable accurate classification and segmentation of the full taxonomy of pancreatic lesions.
arXiv Detail & Related papers (2023-03-02T03:34:28Z) - G-MIND: An End-to-End Multimodal Imaging-Genetics Framework for
Biomarker Identification and Disease Classification [49.53651166356737]
We propose a novel deep neural network architecture to integrate imaging and genetics data, as guided by diagnosis, that provides interpretable biomarkers.
We have evaluated our model on a population study of schizophrenia that includes two functional MRI (fMRI) paradigms and Single Nucleotide Polymorphism (SNP) data.
arXiv Detail & Related papers (2021-01-27T19:28:04Z) - Multimodal Gait Recognition for Neurodegenerative Diseases [38.06704951209703]
We propose a novel hybrid model to learn the gait differences between three neurodegenerative diseases.
A new correlative memory neural network architecture is designed for extracting temporal features.
Compared with several state-of-the-art techniques, our proposed framework shows more accurate classification results.
arXiv Detail & Related papers (2021-01-07T10:17:11Z) - MAGIC: Multi-scale Heterogeneity Analysis and Clustering for Brain
Diseases [3.955454029331185]
We introduce a novel method, MAGIC, to uncover disease heterogeneity by leveraging multi-scale clustering.
We validate MAGIC using simulated heterogeneous neuroanatomical data and demonstrate its clinical potential by exploring the heterogeneity of Alzheimers Disease (AD)
Our results indicate two main subtypes of AD with distinct atrophy patterns that consist of both fine-scale atrophy in the hippocampus as well as large-scale atrophy in cortical regions.
arXiv Detail & Related papers (2020-07-01T23:42:37Z) - Diagnosis of Coronavirus Disease 2019 (COVID-19) with Structured Latent
Multi-View Representation Learning [48.05232274463484]
Recently, the outbreak of Coronavirus Disease 2019 (COVID-19) has spread rapidly across the world.
Due to the large number of affected patients and heavy labor for doctors, computer-aided diagnosis with machine learning algorithm is urgently needed.
In this study, we propose to conduct the diagnosis of COVID-19 with a series of features extracted from CT images.
arXiv Detail & Related papers (2020-05-06T15:19:15Z) - Inflammatory Bowel Disease Biomarkers of Human Gut Microbiota Selected
via Ensemble Feature Selection Methods [0.0]
Inflammatory Bowel Diseases (IBD), diabetes, and cancer can cause several diseases such as Inflammatory Bowel Diseases (IBD), diabetes, and cancer.
IBD, is a gut related disorder where the deviations from the healthy gut microbiome are considered to be associated with IBD.
This study utilizes both supervised and unsupervised machine learning algorithms to generate a classification model that aids IBD diagnosis.
arXiv Detail & Related papers (2020-01-08T13:17:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.