WellFactor: Patient Profiling using Integrative Embedding of Healthcare
Data
- URL: http://arxiv.org/abs/2312.14129v1
- Date: Thu, 21 Dec 2023 18:49:22 GMT
- Title: WellFactor: Patient Profiling using Integrative Embedding of Healthcare
Data
- Authors: Dongjin Choi, Andy Xiang, Ozgur Ozturk, Deep Shrestha, Barry Drake,
Hamid Haidarian, Faizan Javed, Haesun Park
- Abstract summary: WellFactor is a method that derives patient profiles by integrating information from different sources.
WellFactor is optimized to handle the sparsity that is often inherent in healthcare data.
It produces better results compared to other existing methods in classification performance.
- Score: 2.370296071691569
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In the rapidly evolving healthcare industry, platforms now have access to not
only traditional medical records, but also diverse data sets encompassing
various patient interactions, such as those from healthcare web portals. To
address this rich diversity of data, we introduce WellFactor: a method that
derives patient profiles by integrating information from these sources. Central
to our approach is the utilization of constrained low-rank approximation.
WellFactor is optimized to handle the sparsity that is often inherent in
healthcare data. Moreover, by incorporating task-specific label information,
our method refines the embedding results, offering a more informed perspective
on patients. One important feature of WellFactor is its ability to compute
embeddings for new, previously unobserved patient data instantaneously,
eliminating the need to revisit the entire data set or recomputing the
embedding. Comprehensive evaluations on real-world healthcare data demonstrate
WellFactor's effectiveness. It produces better results compared to other
existing methods in classification performance, yields meaningful clustering of
patients, and delivers consistent results in patient similarity searches and
predictions.
Related papers
- Cross-patient Seizure Onset Zone Classification by Patient-Dependent Weight [7.773508953474537]
We propose a method to fine-tune a pretrained model using patient-specific weights for every new test patient to improve diagnostic performance.<n>Results show improved classification accuracy for every test patient, with an average improvement of more than 10%.
arXiv Detail & Related papers (2025-08-05T16:50:50Z) - Gaussian Random Fields as an Abstract Representation of Patient Metadata for Multimodal Medical Image Segmentation [6.531186135660947]
Chronic wounds can have devastating consequences for the patient.
Deep learning methods for the detection and monitoring of such wounds have the potential to reduce the impact to both patient and clinician.
We present a novel multimodal segmentation method which allows for the introduction of patient metadata into the training workflow.
arXiv Detail & Related papers (2025-03-07T08:09:59Z) - Enhancing Performance for Highly Imbalanced Medical Data via Data Regularization in a Federated Learning Setting [6.22153888560487]
The goal of the proposed method is to enhance model performance for cardiovascular disease prediction.
The method is evaluated across four datasets for cardiovascular disease prediction, which are scattered across different clients.
arXiv Detail & Related papers (2024-05-30T19:15:38Z) - Medical Federated Model with Mixture of Personalized and Sharing
Components [31.068735334318088]
We propose a new personalized framework of federated learning to handle the problem.
It successfully yields personalized models based on awareness of similarity between local data.
Also, we propose an effective method to reduce the computational cost, which improves computation efficiency significantly.
arXiv Detail & Related papers (2023-06-26T07:50:32Z) - SPeC: A Soft Prompt-Based Calibration on Performance Variability of
Large Language Model in Clinical Notes Summarization [50.01382938451978]
We introduce a model-agnostic pipeline that employs soft prompts to diminish variance while preserving the advantages of prompt-based summarization.
Experimental findings indicate that our method not only bolsters performance but also effectively curbs variance for various language models.
arXiv Detail & Related papers (2023-03-23T04:47:46Z) - Reliable and Trustworthy Machine Learning for Health Using Dataset Shift
Detection [7.263558963357268]
Unpredictable ML model behavior on unseen data, especially in the health domain, raises serious concerns about its safety.
We show that Mahalanobis distance- and Gram matrices-based out-of-distribution detection methods are able to detect out-of-distribution data with high accuracy.
We then translate the out-of-distribution score into a human interpretable CONFIDENCE SCORE to investigate its effect on the users' interaction with health ML applications.
arXiv Detail & Related papers (2021-10-26T20:49:01Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - FLOP: Federated Learning on Medical Datasets using Partial Networks [84.54663831520853]
COVID-19 Disease due to the novel coronavirus has caused a shortage of medical resources.
Different data-driven deep learning models have been developed to mitigate the diagnosis of COVID-19.
The data itself is still scarce due to patient privacy concerns.
We propose a simple yet effective algorithm, named textbfFederated textbfL textbfon Medical datasets using textbfPartial Networks (FLOP)
arXiv Detail & Related papers (2021-02-10T01:56:58Z) - Using Deep Learning and Explainable Artificial Intelligence in Patients'
Choices of Hospital Levels [10.985001960872264]
This study used nationwide insurance data, accumulated possible features discussed in existing literature, and used a deep neural network to predict the patients choices of hospital levels.
The results showed that the model was able to predict with high area under the receiver operating characteristics curve (AUC) (0.90), accuracy (0.90), sensitivity (0.94), and specificity (0.97) with highly imbalanced label.
arXiv Detail & Related papers (2020-06-24T02:15:15Z) - Temporal Phenotyping using Deep Predictive Clustering of Disease
Progression [97.88605060346455]
We develop a deep learning approach for clustering time-series data, where each cluster comprises patients who share similar future outcomes of interest.
Experiments on two real-world datasets show that our model achieves superior clustering performance over state-of-the-art benchmarks.
arXiv Detail & Related papers (2020-06-15T20:48:43Z) - Causal Feature Selection for Algorithmic Fairness [61.767399505764736]
We consider fairness in the integration component of data management.
We propose an approach to identify a sub-collection of features that ensure the fairness of the dataset.
arXiv Detail & Related papers (2020-06-10T20:20:10Z) - Predictive Modeling of ICU Healthcare-Associated Infections from
Imbalanced Data. Using Ensembles and a Clustering-Based Undersampling
Approach [55.41644538483948]
This work is focused on both the identification of risk factors and the prediction of healthcare-associated infections in intensive-care units.
The aim is to support decision making addressed at reducing the incidence rate of infections.
arXiv Detail & Related papers (2020-05-07T16:13:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.