Mining Social Determinants of Health for Heart Failure Patient 30-Day Readmission via Large Language Model
- URL: http://arxiv.org/abs/2502.12158v1
- Date: Thu, 23 Jan 2025 23:05:53 GMT
- Title: Mining Social Determinants of Health for Heart Failure Patient 30-Day Readmission via Large Language Model
- Authors: Mingchen Shao, Youjeong Kang, Xiao Hu, Hyunjung Gloria Kwak, Carl Yang, Jiaying Lu,
- Abstract summary: Heart Failure (HF) affects millions of Americans and leads to high readmission rates.<n>Social Determinants of Health (SDOH) such as socioeconomic status and housing stability play critical roles in health outcomes.<n>This study leverages advanced large language models (LLMs) to extract SDOHs from clinical text and uses logistic regression to analyze their association with HF readmissions.
- Score: 20.890596696992727
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Heart Failure (HF) affects millions of Americans and leads to high readmission rates, posing significant healthcare challenges. While Social Determinants of Health (SDOH) such as socioeconomic status and housing stability play critical roles in health outcomes, they are often underrepresented in structured EHRs and hidden in unstructured clinical notes. This study leverages advanced large language models (LLMs) to extract SDOHs from clinical text and uses logistic regression to analyze their association with HF readmissions. By identifying key SDOHs (e.g. tobacco usage, limited transportation) linked to readmission risk, this work also offers actionable insights for reducing readmissions and improving patient care.
Related papers
- Spurious Correlations and Beyond: Understanding and Mitigating Shortcut Learning in SDOH Extraction with Large Language Models [3.3408746880885003]
Large language models (LLMs) have shown promise, but they may rely on superficial cues leading to spurious predictions.<n>We demonstrate that mentions of alcohol or smoking can falsely induce models to predict current/past drug use where none is present.<n>We evaluate mitigation strategies - such as prompt engineering and chain-of-thought reasoning - to reduce these false positives.
arXiv Detail & Related papers (2025-05-30T18:11:33Z) - Adaptable Cardiovascular Disease Risk Prediction from Heterogeneous Data using Large Language Models [70.64969663547703]
AdaCVD is an adaptable CVD risk prediction framework built on large language models extensively fine-tuned on over half a million participants from the UK Biobank.<n>It addresses key clinical challenges across three dimensions: it flexibly incorporates comprehensive yet variable patient information; it seamlessly integrates both structured data and unstructured text; and it rapidly adapts to new patient populations using minimal additional data.
arXiv Detail & Related papers (2025-05-30T14:42:02Z) - Patients Speak, AI Listens: LLM-based Analysis of Online Reviews Uncovers Key Drivers for Urgent Care Satisfaction [26.391629320279904]
This study collects Google Maps reviews across the DMV and Florida areas.
We first analyze the geospatial patterns of various aspects, including population density, median income, GINI Index, rent-to-income ratio, household below poverty rate, no insurance rate, and unemployment rate.
arXiv Detail & Related papers (2025-03-26T20:45:01Z) - Integrating Social Determinants of Health into Knowledge Graphs: Evaluating Prediction Bias and Fairness in Healthcare [47.23120247002356]
Social determinants of health (SDoH) play a crucial role in patient health outcomes, yet their integration into biomedical knowledge graphs remains underexplored.<n>This study addresses this gap by constructing an SDoH-enriched knowledge graph using the MIMIC-III dataset and PrimeKG.
arXiv Detail & Related papers (2024-11-29T20:35:01Z) - FedCVD: The First Real-World Federated Learning Benchmark on Cardiovascular Disease Data [52.55123685248105]
Cardiovascular diseases (CVDs) are currently the leading cause of death worldwide, highlighting the critical need for early diagnosis and treatment.
Machine learning (ML) methods can help diagnose CVDs early, but their performance relies on access to substantial data with high quality.
This paper presents the first real-world FL benchmark for cardiovascular disease detection, named FedCVD.
arXiv Detail & Related papers (2024-10-28T02:24:01Z) - SDOH-NLI: a Dataset for Inferring Social Determinants of Health from
Clinical Notes [13.991819517682574]
Social and behavioral determinants of health (SDOH) play a significant role in shaping health outcomes.
Progress on using NLP methods for this task has been hindered by the lack of high-quality publicly available labeled data.
This paper introduces a new dataset, SDOH-NLI, that is based on publicly available notes and which we release publicly.
arXiv Detail & Related papers (2023-10-27T19:09:30Z) - Interpretable Survival Analysis for Heart Failure Risk Prediction [50.64739292687567]
We propose a novel survival analysis pipeline that is both interpretable and competitive with state-of-the-art survival models.
Our pipeline achieves state-of-the-art performance and provides interesting and novel insights about risk factors for heart failure.
arXiv Detail & Related papers (2023-10-24T02:56:05Z) - Evaluating the Impact of Social Determinants on Health Prediction in the
Intensive Care Unit [10.764842579064636]
Social determinants of health (SDOH) play a crucial role in a person's health and well-being.
Most risk prediction models based on electronic health records do not incorporate a comprehensive set of SDOH features.
Our work links a publicly available EHR database, MIMIC-IV, to well-documented SDOH features.
arXiv Detail & Related papers (2023-05-22T01:27:51Z) - SPeC: A Soft Prompt-Based Calibration on Performance Variability of
Large Language Model in Clinical Notes Summarization [50.01382938451978]
We introduce a model-agnostic pipeline that employs soft prompts to diminish variance while preserving the advantages of prompt-based summarization.
Experimental findings indicate that our method not only bolsters performance but also effectively curbs variance for various language models.
arXiv Detail & Related papers (2023-03-23T04:47:46Z) - Systematic Design and Evaluation of Social Determinants of Health
Ontology (SDoHO) [19.90090257979115]
Social determinants of health (SDoH) have a significant impact on health outcomes and well-being.
We propose an SDoH ontology (SDoHO) which represents fundamental SDoH factors and their relationships in a standardized and measurable way.
arXiv Detail & Related papers (2022-12-04T22:23:30Z) - Predicting Patient Readmission Risk from Medical Text via Knowledge
Graph Enhanced Multiview Graph Convolution [67.72545656557858]
We propose a new method that uses medical text of Electronic Health Records for prediction.
We represent discharge summaries of patients with multiview graphs enhanced by an external knowledge graph.
Experimental results prove the effectiveness of our method, yielding state-of-the-art performance.
arXiv Detail & Related papers (2021-12-19T01:45:57Z) - Assessing the Severity of Health States based on Social Media Posts [62.52087340582502]
We propose a multiview learning framework that models both the textual content as well as contextual-information to assess the severity of the user's health state.
The diverse NLU views demonstrate its effectiveness on both the tasks and as well as on the individual disease to assess a user's health.
arXiv Detail & Related papers (2020-09-21T03:45:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.