Related papers: Multimodal Forecasting of Sparse Intraoperative Hypotension Events Powered by Language Model

Multimodal Forecasting of Sparse Intraoperative Hypotension Events Powered by Language Model

URL: http://arxiv.org/abs/2505.22116v3
Date: Tue, 22 Jul 2025 09:34:56 GMT
Title: Multimodal Forecasting of Sparse Intraoperative Hypotension Events Powered by Language Model
Authors: Jintao Zhang, Zirui Liu, Mingyue Cheng, Shilong Zhang, Tingyue Pan, Yitong zhou, Qi Liu, Yanhu Xie,
Abstract summary: Intraoperative hypotension (IOH) frequently occurs under general anesthesia and is strongly linked to adverse outcomes such as myocardial injury and increased mortality.<n>Despite its significance, IOH prediction is hindered by event sparsity and the challenge of integrating static and dynamic data across diverse patients.<n>We propose textbfIOHFuseLM, a multimodal language model framework to accurately identify and differentiate sparse hypotensive events.
Score: 14.69824092898171
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Intraoperative hypotension (IOH) frequently occurs under general anesthesia and is strongly linked to adverse outcomes such as myocardial injury and increased mortality. Despite its significance, IOH prediction is hindered by event sparsity and the challenge of integrating static and dynamic data across diverse patients. In this paper, we propose \textbf{IOHFuseLM}, a multimodal language model framework. To accurately identify and differentiate sparse hypotensive events, we leverage a two-stage training strategy. The first stage involves domain adaptive pretraining on IOH physiological time series augmented through diffusion methods, thereby enhancing the model sensitivity to patterns associated with hypotension. Subsequently, task fine-tuning is performed on the original clinical dataset to further enhance the ability to distinguish normotensive from hypotensive states. To enable multimodal fusion for each patient, we align structured clinical descriptions with the corresponding physiological time series at the token level. Such alignment enables the model to capture individualized temporal patterns alongside their corresponding clinical semantics. In addition, we convert static patient attributes into structured text to enrich personalized information. Experimental evaluations on two intraoperative datasets demonstrate that IOHFuseLM outperforms established baselines in accurately identifying IOH events, highlighting its applicability in clinical decision support scenarios. Our code is publicly available to promote reproducibility at https://github.com/zjt-gpu/IOHFuseLM.

Related papers

Adaptable Cardiovascular Disease Risk Prediction from Heterogeneous Data using Large Language Models [70.64969663547703]
AdaCVD is an adaptable CVD risk prediction framework built on large language models extensively fine-tuned on over half a million participants from the UK Biobank.<n>It addresses key clinical challenges across three dimensions: it flexibly incorporates comprehensive yet variable patient information; it seamlessly integrates both structured data and unstructured text; and it rapidly adapts to new patient populations using minimal additional data.
arXiv Detail & Related papers (2025-05-30T14:42:02Z)
Aneumo: A Large-Scale Multimodal Aneurysm Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks [16.753219355754222]
Intracranial aneurysms (IAs) are serious cerebrovascular lesions found in approximately 5% of the general population.<n>Their rupture may lead to high mortality.<n>Current methods for assessing IA risk focus on morphological and patient-specific factors, but the hemodynamic influences on IA development and rupture remain unclear.<n>This dataset aims to advance aneurysm research and promote data-driven approaches in biofluids, biomedical engineering, and clinical risk assessment.
arXiv Detail & Related papers (2025-05-19T09:32:09Z)
A Hybrid Multi-Factor Network with Dynamic Sequence Modeling for Early Warning of Intraoperative Hypotension [2.9833446079112473]
Intraoperative hypotension (IOH) prediction using past physiological signals is crucial.<n>We propose a Hybrid Multi-Factor network that formulates IOH prediction as a dynamic sequence forecasting task.<n> Experiments on both public and real-world clinical datasets show that HMF significantly outperforms competitive baselines.
arXiv Detail & Related papers (2024-09-17T10:46:41Z)
Deep State-Space Generative Model For Correlated Time-to-Event Predictions [54.3637600983898]
We propose a deep latent state-space generative model to capture the interactions among different types of correlated clinical events. Our method also uncovers meaningful insights about the latent correlations among mortality and different types of organ failures.
arXiv Detail & Related papers (2024-07-28T02:42:36Z)
Fusing Echocardiography Images and Medical Records for Continuous Patient Stratification [16.93115087698284]
We propose a method to learn the representation of a cardiovascular pathology with a difficult-to-characterize continuum, namely hypertension. Our method first projects each variable into its own representation space using modality-specific approaches. These standardized representations of multimodal data are then fed to a transformer encoder, which learns to merge them into a comprehensive representation of the patient through the task of predicting a clinical rating. We observe the major trends along this continuum on a cohort of 239 hypertensive patients, providing unprecedented details in the description of hypertension's impact on various cardiac function descriptors.
arXiv Detail & Related papers (2024-01-15T16:04:46Z)
SHAPE: A Sample-adaptive Hierarchical Prediction Network for Medication Recommendation [22.899946140205962]
We propose a novel Sample-adaptive Hierarchical medicAtion Prediction nEtwork, termed SHAPE, to tackle the challenges in the medication recommendation task. Specifically, we design a compact intra-visit set encoder to encode the relationship in the medical event for obtaining visit-level representation. To endow the model with the capability of modeling the variable visit length, we introduce a soft curriculum learning method to assign the difficulty of each sample automatically by the visit length.
arXiv Detail & Related papers (2023-09-09T08:28:04Z)
TREEMENT: Interpretable Patient-Trial Matching via Personalized Dynamic Tree-Based Memory Network [54.332862955411656]
Clinical trials are critical for drug development but often suffer from expensive and inefficient patient recruitment. In recent years, machine learning models have been proposed for speeding up patient recruitment via automatically matching patients with clinical trials. We introduce a dynamic tree-based memory network model named TREEMENT to provide accurate and interpretable patient trial matching.
arXiv Detail & Related papers (2023-07-19T12:35:09Z)
A Conditional Flow Variational Autoencoder for Controllable Synthesis of Virtual Populations of Anatomy [76.20367415712867]
We propose a conditional variational autoencoder (cVAE) with normalising flows to boost the flexibility and complexity of the approximate posterior learnt. We demonstrate the performance of our conditional flow VAE using a data set of cardiac left ventricles acquired from 2360 patients.
arXiv Detail & Related papers (2023-06-26T13:23:52Z)
Individualized Dosing Dynamics via Neural Eigen Decomposition [51.62933814971523]
We introduce the Neural Eigen Differential Equation algorithm (NESDE) NESDE provides individualized modeling, tunable generalization to new treatment policies, and fast, continuous, closed-form prediction. We demonstrate the robustness of NESDE in both synthetic and real medical problems, and use the learned dynamics to publish simulated medical gym environments.
arXiv Detail & Related papers (2023-06-24T17:01:51Z)
Tissue Classification During Needle Insertion Using Self-Supervised Contrastive Learning and Optical Coherence Tomography [53.38589633687604]
We propose a deep neural network that classifies the tissues from the phase and intensity data of complex OCT signals acquired at the needle tip. We show that with 10% of the training set, our proposed pretraining strategy helps the model achieve an F1 score of 0.84 whereas the model achieves an F1 score of 0.60 without it.
arXiv Detail & Related papers (2023-04-26T14:11:04Z)
Epileptic Seizure Classification with Symmetric and Hybrid Bilinear Models [20.376912072606412]
This paper proposes a novel hybrid bilinear deep learning network with an application in the clinical procedures of epilepsy classification diagnosis. The accuracy of the diagnosis is also complicated by overlapping medical symptoms, varying levels of experience and inter-ob variability among clinical professions.
arXiv Detail & Related papers (2020-01-15T03:22:10Z)

This list is automatically generated from the titles and abstracts of the papers in this site.