Leveraging Natural Language Processing to Augment Structured Social
Determinants of Health Data in the Electronic Health Record
- URL: http://arxiv.org/abs/2212.07538v2
- Date: Fri, 14 Apr 2023 17:04:35 GMT
- Title: Leveraging Natural Language Processing to Augment Structured Social
Determinants of Health Data in the Electronic Health Record
- Authors: Kevin Lybarger, Nicholas J Dobbins, Ritche Long, Angad Singh, Patrick
Wedgeworth, Ozlem Ozuner, Meliha Yetisgen
- Abstract summary: Social determinants of health (SDOH) impact health outcomes.
Clinical notes often contain more comprehensive SDOH information.
We developed a novel SDOH extractor using a deep learning entity and relation extraction architecture.
- Score: 1.7812428873698403
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Objective: Social determinants of health (SDOH) impact health outcomes and
are documented in the electronic health record (EHR) through structured data
and unstructured clinical notes. However, clinical notes often contain more
comprehensive SDOH information, detailing aspects such as status, severity, and
temporality. This work has two primary objectives: i) develop a natural
language processing (NLP) information extraction model to capture detailed SDOH
information and ii) evaluate the information gain achieved by applying the SDOH
extractor to clinical narratives and combining the extracted representations
with existing structured data.
Materials and Methods: We developed a novel SDOH extractor using a deep
learning entity and relation extraction architecture to characterize SDOH
across various dimensions. In an EHR case study, we applied the SDOH extractor
to a large clinical data set with 225,089 patients and 430,406 notes with
social history sections and compared the extracted SDOH information with
existing structured data.
Results: The SDOH extractor achieved 0.86 F1 on a withheld test set. In the
EHR case study, we found extracted SDOH information complements existing
structured data with 32% of homeless patients, 19% of current tobacco users,
and 10% of drug users only having these health risk factors documented in the
clinical narrative.
Conclusions: Utilizing EHR data to identify SDOH health risk factors and
social needs may improve patient care and outcomes. Semantic representations of
text-encoded SDOH information can augment existing structured data, and this
more comprehensive SDOH representation can assist health systems in identifying
and addressing these social needs.
Related papers
- De-identification of clinical free text using natural language
processing: A systematic review of current approaches [48.343430343213896]
Natural language processing has repeatedly demonstrated its feasibility in automating the de-identification process.
Our study aims to provide systematic evidence on how the de-identification of clinical free text has evolved in the last thirteen years.
arXiv Detail & Related papers (2023-11-28T13:20:41Z) - SDOH-NLI: a Dataset for Inferring Social Determinants of Health from
Clinical Notes [13.991819517682574]
Social and behavioral determinants of health (SDOH) play a significant role in shaping health outcomes.
Progress on using NLP methods for this task has been hindered by the lack of high-quality publicly available labeled data.
This paper introduces a new dataset, SDOH-NLI, that is based on publicly available notes and which we release publicly.
arXiv Detail & Related papers (2023-10-27T19:09:30Z) - TREEMENT: Interpretable Patient-Trial Matching via Personalized Dynamic
Tree-Based Memory Network [54.332862955411656]
Clinical trials are critical for drug development but often suffer from expensive and inefficient patient recruitment.
In recent years, machine learning models have been proposed for speeding up patient recruitment via automatically matching patients with clinical trials.
We introduce a dynamic tree-based memory network model named TREEMENT to provide accurate and interpretable patient trial matching.
arXiv Detail & Related papers (2023-07-19T12:35:09Z) - Evaluating the Impact of Social Determinants on Health Prediction in the
Intensive Care Unit [10.764842579064636]
Social determinants of health (SDOH) play a crucial role in a person's health and well-being.
Most risk prediction models based on electronic health records do not incorporate a comprehensive set of SDOH features.
Our work links a publicly available EHR database, MIMIC-IV, to well-documented SDOH features.
arXiv Detail & Related papers (2023-05-22T01:27:51Z) - Question-Answering System Extracts Information on Injection Drug Use
from Clinical Notes [4.537953996010351]
Injection drug use (IDU) is a dangerous health behavior that increases mortality and morbidity.
The only place IDU information can be indicated is unstructured free-text clinical notes.
We design and demonstrate a question-answering (QA) framework to extract information on IDU from clinical notes.
arXiv Detail & Related papers (2023-05-15T16:37:00Z) - Development and validation of a natural language processing algorithm to
pseudonymize documents in the context of a clinical data warehouse [53.797797404164946]
The study highlights the difficulties faced in sharing tools and resources in this domain.
We annotated a corpus of clinical documents according to 12 types of identifying entities.
We build a hybrid system, merging the results of a deep learning model as well as manual rules.
arXiv Detail & Related papers (2023-03-23T17:17:46Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - HINT: Hierarchical Interaction Network for Trial Outcome Prediction
Leveraging Web Data [56.53715632642495]
Clinical trials face uncertain outcomes due to issues with efficacy, safety, or problems with patient recruitment.
In this paper, we propose Hierarchical INteraction Network (HINT) for more general, clinical trial outcome predictions.
arXiv Detail & Related papers (2021-02-08T15:09:07Z) - Social determinants of health in the era of artificial intelligence with
electronic health records: A systematic review [12.944415086215708]
How to make the best of social determinant of health (SDOH) information from electronic health records ( EHRs) is yet to be studied.
A total of 1250 articles were retrieved from the literature between 2010 and 2020, and 74 papers were included in this review.
We conclude this paper with discussion on the current trends, challenges, and future directions on using SDOH from EHRs.
arXiv Detail & Related papers (2021-01-22T09:03:39Z) - Annotating Social Determinants of Health Using Active Learning, and
Characterizing Determinants Using Neural Event Extraction [11.845850292404768]
Social determinants of health (SDOH) affect health outcomes, and knowledge of SDOH can inform clinical decision-making.
This work presents a new corpus with SDOH annotations, a novel active learning framework, and the first extraction results on the new corpus.
arXiv Detail & Related papers (2020-04-11T16:19:02Z) - DeepEnroll: Patient-Trial Matching with Deep Embedding and Entailment
Prediction [67.91606509226132]
Clinical trials are essential for drug development but often suffer from expensive, inaccurate and insufficient patient recruitment.
DeepEnroll is a cross-modal inference learning model to jointly encode enrollment criteria (tabular data) into a shared latent space for matching inference.
arXiv Detail & Related papers (2020-01-22T17:51:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.