Early Detection of Pancreatic Cancer Using Multimodal Learning on Electronic Health Records
- URL: http://arxiv.org/abs/2508.06627v3
- Date: Mon, 18 Aug 2025 23:48:23 GMT
- Title: Early Detection of Pancreatic Cancer Using Multimodal Learning on Electronic Health Records
- Authors: Mosbah Aouad, Anirudh Choudhary, Awais Farooq, Steven Nevers, Lusine Demirkhanyan, Bhrandon Harris, Suguna Pappu, Christopher Gondi, Ravishankar Iyer,
- Abstract summary: Pancreatic ductal adenocarcinoma (PDAC) is one of the deadliest cancers.<n>Early detection remains a major clinical challenge.<n>We develop and evaluate our approach on a real-world dataset of nearly 4,700 patients.
- Score: 0.7770975325131922
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Pancreatic ductal adenocarcinoma (PDAC) is one of the deadliest cancers, and early detection remains a major clinical challenge due to the absence of specific symptoms and reliable biomarkers. In this work, we propose a new multimodal approach that integrates longitudinal diagnosis code histories and routinely collected laboratory measurements from electronic health records to detect PDAC up to one year prior to clinical diagnosis. Our method combines neural controlled differential equations to model irregular lab time series, pretrained language models and recurrent networks to learn diagnosis code trajectory representations, and cross-attention mechanisms to capture interactions between the two modalities. We develop and evaluate our approach on a real-world dataset of nearly 4,700 patients and achieve significant improvements in AUC ranging from 6.5% to 15.5% over state-of-the-art methods. Furthermore, our model identifies diagnosis codes and laboratory panels associated with elevated PDAC risk, including both established and new biomarkers. Our code is available at https://github.com/MosbahAouad/EarlyPDAC-MML.
Related papers
- One-shot synthesis of rare gastrointestinal lesions improves diagnostic accuracy and clinical training [45.49415063761575]
EndoRare is a one-shot, retraining-free generative framework that synthesizes diverse, high-fidelity lesion exemplars from a single reference image.<n>We validated the framework across four rare pathologies.<n>These results establish a practical, data-efficient pathway to bridge the rare-disease gap in both computer-aided diagnostics and clinical education.
arXiv Detail & Related papers (2025-12-30T15:07:09Z) - Early Lung Cancer Diagnosis from Virtual Follow-up LDCT Generation via Correlational Autoencoder and Latent Flow Matching [18.151404268026244]
In clinical practice, a patient with a high risk may need to undergo an initial baseline and several annual follow-up examinations.<n>Recently, Artificial Intelligence (AI) methods have been increasingly used for early diagnosis of lung cancer.<n>This paper proposes a generative method, named CorrFlowNet, which creates a virtual, one-year follow-up CT scan after the initial baseline scan.
arXiv Detail & Related papers (2025-11-22T20:40:05Z) - Deep Learning Framework for Early Detection of Pancreatic Cancer Using Multi-Modal Medical Imaging Analysis [0.7161783472741748]
Pacreatic ductal adenocarcinoma (PDAC) remains one of the most lethal forms of cancer.<n>This research develops and validates a deep learning framework for early PDAC detection.
arXiv Detail & Related papers (2025-08-28T15:07:04Z) - A Novel Multimodal Framework for Early Detection of Alzheimers Disease Using Deep Learning [0.0]
Alzheimers Disease (AD) is a progressive neurodegenerative disorder that poses significant challenges in its early diagnosis.<n>Traditional diagnostic methods fall short of capturing the multifaceted nature of the disease.<n>We propose a novel framework for the early detection of AD that integrates data from three primary sources: MRI imaging, cognitive assessments, and biomarkers.
arXiv Detail & Related papers (2025-08-05T03:46:59Z) - ColonScopeX: Leveraging Explainable Expert Systems with Multimodal Data for Improved Early Diagnosis of Colorectal Cancer [3.541280502270993]
Colorectal cancer (CRC) ranks as the second leading cause of cancer-related deaths and the third most prevalent malignant tumour worldwide.<n>Early detection of CRC remains problematic due to its non-specific and often embarrassing symptoms.<n>We propose ColonScopeX, a machine learning framework utilizing explainable AI (XAI) methodologies to enhance the early detection of CRC and pre-cancerous lesions.
arXiv Detail & Related papers (2025-04-09T20:45:11Z) - Multimodal AI-driven Biomarker for Early Detection of Cancer Cachexia [14.27396467108753]
Cancer cachexia is a multifactorial syndrome characterized by progressive muscle wasting, metabolic dysfunction, and systemic inflammation.<n>There is no single definitive biomarker for cachexia.<n>This study proposes a multimodal AI-based biomarker for early cancer cachexia detection.
arXiv Detail & Related papers (2025-03-09T22:32:37Z) - Multi-modal Medical Image Fusion For Non-Small Cell Lung Cancer Classification [7.002657345547741]
Non-small cell lung cancer (NSCLC) is a predominant cause of cancer mortality worldwide.
In this paper, we introduce an innovative integration of multi-modal data, synthesizing fused medical imaging (CT and PET scans) with clinical health records and genomic data.
Our research surpasses existing approaches, as evidenced by a substantial enhancement in NSCLC detection and classification precision.
arXiv Detail & Related papers (2024-09-27T12:59:29Z) - CIResDiff: A Clinically-Informed Residual Diffusion Model for Predicting Idiopathic Pulmonary Fibrosis Progression [38.14873567230233]
Idiopathic Pulmonary Fibrosis (IPF) significantly correlates with higher patient mortality rates.
Current clinical criteria define disease progression requiring two CT scans with a one-year interval.
We develop a novel diffusion model to accurately predict the progression of IPF by generating patient's follow-up CT scan.
arXiv Detail & Related papers (2024-08-01T22:01:42Z) - Combining low-dose CT-based radiomics and metabolomics for early lung
cancer screening support [32.586316762855944]
Lung cancer is often diagnosed in advanced stages, resulting in poorer survival rates for patients.
Early diagnosis can be facilitated through screening programs designed to detect lung tissue tumors when they are still small, typically around 3mm in size.
arXiv Detail & Related papers (2023-09-20T12:07:16Z) - Open-Set Recognition of Breast Cancer Treatments [91.3247063132127]
Open-set recognition generalizes a classification task by classifying test samples as one of the known classes from training or "unknown"
We apply a recent existing Gaussian mixture variational autoencoder model, which achieves state-of-the-art results for image datasets, to breast cancer patient data.
Not only do we obtain more accurate and robust classification results, with a 24.5% average F1 increase compared to a recent method, but we also reexamine open-set recognition in terms of deployability to a clinical setting.
arXiv Detail & Related papers (2022-01-09T04:35:55Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - G-MIND: An End-to-End Multimodal Imaging-Genetics Framework for
Biomarker Identification and Disease Classification [49.53651166356737]
We propose a novel deep neural network architecture to integrate imaging and genetics data, as guided by diagnosis, that provides interpretable biomarkers.
We have evaluated our model on a population study of schizophrenia that includes two functional MRI (fMRI) paradigms and Single Nucleotide Polymorphism (SNP) data.
arXiv Detail & Related papers (2021-01-27T19:28:04Z) - Inheritance-guided Hierarchical Assignment for Clinical Automatic
Diagnosis [50.15205065710629]
Clinical diagnosis, which aims to assign diagnosis codes for a patient based on the clinical note, plays an essential role in clinical decision-making.
We propose a novel framework to combine the inheritance-guided hierarchical assignment and co-occurrence graph propagation for clinical automatic diagnosis.
arXiv Detail & Related papers (2021-01-27T13:16:51Z) - Hierarchical Classification of Pulmonary Lesions: A Large-Scale
Radio-Pathomics Study [38.78350161086617]
Diagnosis of pulmonary lesions from computed tomography (CT) is important but challenging for clinical decision making in lung cancer related diseases.
Deep learning has achieved great success in computer aided diagnosis (CADx) area for lung cancer, whereas it suffers from label ambiguity due to the difficulty in the radiological diagnosis.
Considering that invasive pathological analysis serves as the clinical golden standard of lung cancer diagnosis, in this study, we solve the label ambiguity issue via a large-scale radio-pathomics dataset.
This retrospective dataset, named Pulmonary-RadPath, enables development and validation of accurate deep learning systems to predict invasive pathological labels with a non-
arXiv Detail & Related papers (2020-10-08T15:14:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.