Machine Learning for Multimodal Electronic Health Records-based
Research: Challenges and Perspectives
- URL: http://arxiv.org/abs/2111.04898v1
- Date: Tue, 9 Nov 2021 01:19:11 GMT
- Title: Machine Learning for Multimodal Electronic Health Records-based
Research: Challenges and Perspectives
- Authors: Ziyi Liu, Jiaqi Zhang, Yongshuai Hou, Xinran Zhang, Ge Li, Yang Xiang
- Abstract summary: Electronic Health Records contain rich information of patients' health history.
relying on structured data only might be insufficient in reflecting patients' comprehensive information.
An increasing number of studies seek to obtain more accurate results by incorporating unstructured free-text data as well.
- Score: 22.230972071321357
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Background: Electronic Health Records (EHRs) contain rich information of
patients' health history, which usually include both structured and
unstructured data. There have been many studies focusing on distilling valuable
information from structured data, such as disease codes, laboratory test
results, and treatments. However, relying on structured data only might be
insufficient in reflecting patients' comprehensive information and such data
may occasionally contain erroneous records. Objective: With the recent advances
of machine learning (ML) and deep learning (DL) techniques, an increasing
number of studies seek to obtain more accurate results by incorporating
unstructured free-text data as well. This paper reviews studies that use
multimodal data, i.e. a combination of structured and unstructured data, from
EHRs as input for conventional ML or DL models to address the targeted tasks.
Materials and Methods: We searched in the Institute of Electrical and
Electronics Engineers (IEEE) Digital Library, PubMed, and Association for
Computing Machinery (ACM) Digital Library for articles related to ML-based
multimodal EHR studies. Results and Discussion: With the final 94 included
studies, we focus on how data from different modalities were combined and
interacted using conventional ML and DL techniques, and how these algorithms
were applied in EHR-related tasks. Further, we investigate the advantages and
limitations of these fusion methods and indicate future directions for ML-based
multimodal EHR research.
Related papers
- When Raw Data Prevails: Are Large Language Model Embeddings Effective in Numerical Data Representation for Medical Machine Learning Applications? [8.89829757177796]
We examine the effectiveness of vector representations from last hidden states of Large Language Models for medical diagnostics and prognostics.
We focus on instruction-tuned LLMs in a zero-shot setting to represent abnormal physiological data and evaluate their utilities as feature extractors.
Although findings suggest the raw data features still prevails in medical ML tasks, zero-shot LLM embeddings demonstrate competitive results.
arXiv Detail & Related papers (2024-08-15T03:56:40Z) - Machine Learning Techniques for MRI Data Processing at Expanding Scale [0.5221459608786241]
Imaging sites around the world generate growing amounts of medical scan data with ever more versatile and affordable technology.
These large datasets encode substantial information about human health and hold considerable potential for machine learning training and analysis.
This chapter examines ongoing large-scale studies and the challenge of distribution shifts between them.
arXiv Detail & Related papers (2024-04-22T16:38:41Z) - Interpretable Medical Diagnostics with Structured Data Extraction by
Large Language Models [59.89454513692417]
Tabular data is often hidden in text, particularly in medical diagnostic reports.
We propose a novel, simple, and effective methodology for extracting structured tabular data from textual medical reports, called TEMED-LLM.
We demonstrate that our approach significantly outperforms state-of-the-art text classification models in medical diagnostics.
arXiv Detail & Related papers (2023-06-08T09:12:28Z) - Incomplete Multimodal Learning for Complex Brain Disorders Prediction [65.95783479249745]
We propose a new incomplete multimodal data integration approach that employs transformers and generative adversarial networks.
We apply our new method to predict cognitive degeneration and disease outcomes using the multimodal imaging genetic data from Alzheimer's Disease Neuroimaging Initiative cohort.
arXiv Detail & Related papers (2023-05-25T16:29:16Z) - SEMPAI: a Self-Enhancing Multi-Photon Artificial Intelligence for
prior-informed assessment of muscle function and pathology [48.54269377408277]
We introduce the Self-Enhancing Multi-Photon Artificial Intelligence (SEMPAI), that integrates hypothesis-driven priors in a data-driven Deep Learning approach.
SEMPAI performs joint learning of several tasks to enable prediction for small datasets.
SEMPAI outperforms state-of-the-art biomarkers in six of seven predictive tasks, including those with scarce data.
arXiv Detail & Related papers (2022-10-28T17:03:04Z) - Artificial Intelligence-Based Methods for Fusion of Electronic Health
Records and Imaging Data [0.9749560288448113]
We focus on synthesizing and analyzing the literature that uses AI techniques to fuse multimodal medical data for different clinical applications.
We present a comprehensive analysis of the various fusion strategies, the diseases and clinical outcomes for which multimodal fusion was used, and the available multimodal medical datasets.
arXiv Detail & Related papers (2022-10-23T07:13:37Z) - Multimodal Machine Learning in Precision Health [10.068890037410316]
This review was conducted to summarize this field and identify topics ripe for future research.
We used a combination of content analysis and literature searches to establish search strings and databases of PubMed, Google Scholar, and IEEEXplore from 2011 to 2021.
The most common form of information fusion was early fusion. Notably, there was an improvement in predictive performance performing heterogeneous data fusion.
arXiv Detail & Related papers (2022-04-10T21:56:07Z) - Deep learning for temporal data representation in electronic health
records: A systematic review of challenges and methodologies [11.584972135829199]
Temporal electronic health records can be a wealth of information for secondary uses, such as clinical events prediction or chronic disease management.
We sought articles that reported deep learning methodologies on temporal data representation in structured EHR data from January 1, 2010, to August 30, 2020.
Four major challenges were identified, including data irregularity, data heterogeneity, data sparsity, and model opacity.
arXiv Detail & Related papers (2021-07-21T09:00:40Z) - Uncovering the structure of clinical EEG signals with self-supervised
learning [64.4754948595556]
Supervised learning paradigms are often limited by the amount of labeled data that is available.
This phenomenon is particularly problematic in clinically-relevant data, such as electroencephalography (EEG)
By extracting information from unlabeled data, it might be possible to reach competitive performance with deep neural networks.
arXiv Detail & Related papers (2020-07-31T14:34:47Z) - A Systematic Approach to Featurization for Cancer Drug Sensitivity
Predictions with Deep Learning [49.86828302591469]
We train >35,000 neural network models, sweeping over common featurization techniques.
We found the RNA-seq to be highly redundant and informative even with subsets larger than 128 features.
arXiv Detail & Related papers (2020-04-30T20:42:17Z) - Opportunities and Challenges of Deep Learning Methods for
Electrocardiogram Data: A Systematic Review [62.490310870300746]
The electrocardiogram (ECG) is one of the most commonly used diagnostic tools in medicine and healthcare.
Deep learning methods have achieved promising results on predictive healthcare tasks using ECG signals.
This paper presents a systematic review of deep learning methods for ECG data from both modeling and application perspectives.
arXiv Detail & Related papers (2019-12-28T02:44:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.