Leveraging Large Language Models for Analyzing Blood Pressure Variations
Across Biological Sex from Scientific Literature
- URL: http://arxiv.org/abs/2402.01826v1
- Date: Fri, 2 Feb 2024 18:15:51 GMT
- Title: Leveraging Large Language Models for Analyzing Blood Pressure Variations
Across Biological Sex from Scientific Literature
- Authors: Yuting Guo, Seyedeh Somayyeh Mousavi, Reza Sameni, Abeed Sarker
- Abstract summary: Hypertension, defined as blood pressure (BP) that is above normal, holds paramount significance in the realm of public health.
Existing BP measurement technologies and standards might be biased because they do not consider clinical outcomes, comorbidities, or demographic factors.
We employed GPT-35-turbo, a large language model (LLM), to automatically extract the mean and standard deviation values of BP for both males and females from a dataset comprising 25 million abstracts sourced from PubMed.
- Score: 3.731841514150172
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Hypertension, defined as blood pressure (BP) that is above normal, holds
paramount significance in the realm of public health, as it serves as a
critical precursor to various cardiovascular diseases (CVDs) and significantly
contributes to elevated mortality rates worldwide. However, many existing BP
measurement technologies and standards might be biased because they do not
consider clinical outcomes, comorbidities, or demographic factors, making them
inconclusive for diagnostic purposes. There is limited data-driven research
focused on studying the variance in BP measurements across these variables. In
this work, we employed GPT-35-turbo, a large language model (LLM), to
automatically extract the mean and standard deviation values of BP for both
males and females from a dataset comprising 25 million abstracts sourced from
PubMed. 993 article abstracts met our predefined inclusion criteria (i.e.,
presence of references to blood pressure, units of blood pressure such as mmHg,
and mention of biological sex). Based on the automatically-extracted
information from these articles, we conducted an analysis of the variations of
BP values across biological sex. Our results showed the viability of utilizing
LLMs to study the BP variations across different demographic factors.
Related papers
- A Multi-scenario Attention-based Generative Model for Personalized Blood Pressure Time Series Forecasting [6.311504297463515]
Continuous blood pressure (BP) monitoring is essential for timely diagnosis and intervention in critical care settings.
In this work, we propose a personalized BP forecasting model mainly using electrocardiogram (ECG) and photoplethysmogram ( photoplethysmogram) signals.
Experiments are conducted on datasets collected from three diverse scenarios with BP measurements from 60 subjects total.
arXiv Detail & Related papers (2024-09-07T04:24:15Z) - Meta-Learners for Partially-Identified Treatment Effects Across Multiple Environments [67.80453452949303]
Estimating the conditional average treatment effect (CATE) from observational data is relevant for many applications such as personalized medicine.
Here, we focus on the widespread setting where the observational data come from multiple environments.
We propose different model-agnostic learners (so-called meta-learners) to estimate the bounds that can be used in combination with arbitrary machine learning models.
arXiv Detail & Related papers (2024-06-04T16:31:43Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - Learning from Two Decades of Blood Pressure Data: Demography-Specific Patterns Across 75 Million Patient Encounters [3.416097871805964]
Hypertension is a global health concern with an increasing prevalence, underscoring the need for effective monitoring and analysis of blood pressure dynamics.
We analyzed a substantial BP dataset comprising 75,636,128 records from 2,054,462 unique patients collected between 2000 and 2022 at Emory Healthcare in Georgia, USA.
arXiv Detail & Related papers (2024-02-02T17:51:49Z) - Simulation-based Inference for Cardiovascular Models [57.92535897767929]
We use simulation-based inference to solve the inverse problem of mapping waveforms back to plausible physiological parameters.
We perform an in-silico uncertainty analysis of five biomarkers of clinical interest.
We study the gap between in-vivo and in-silico with the MIMIC-III waveform database.
arXiv Detail & Related papers (2023-07-26T02:34:57Z) - A Survey on Blood Pressure Measurement Technologies: Addressing
Potential Sources of Bias [4.0527913281804135]
Blood pressure (BP) monitoring plays a crucial role in the prevention, diagnosis, treatment, and management of cardiovascular diseases.
Recent guidelines advocate for regular BP monitoring as part of regular clinical visits or even at home.
BP measurements can demonstrate substantial biases and variances due to factors such as measurement and device errors, demographics, and body habitus.
arXiv Detail & Related papers (2023-06-14T11:51:11Z) - A Shallow U-Net Architecture for Reliably Predicting Blood Pressure (BP)
from Photoplethysmogram (PPG) and Electrocardiogram (ECG) Signals [1.1695966610359496]
Most existing methods used in the hospitals for continuous monitoring of Blood Pressure (BP) are invasive.
In this study, we explored the applicability of autoencoders in predicting BP from non-invasively collectible signals such as Photoplethysmogram ( PPG) and ECG signals.
It was found that a very shallow, one-dimensional autoencoder can extract the relevant features to predict the SBP and DBP with the state-of-the-art performance on a very large dataset.
arXiv Detail & Related papers (2021-11-12T19:34:20Z) - Label scarcity in biomedicine: Data-rich latent factor discovery
enhances phenotype prediction [102.23901690661916]
Low-dimensional embedding spaces can be derived from the UK Biobank population dataset to enhance data-scarce prediction of health indicators, lifestyle and demographic characteristics.
Performances gains from semisupervison approaches will probably become an important ingredient for various medical data science applications.
arXiv Detail & Related papers (2021-10-12T16:25:50Z) - Ambulatory blood pressure monitoring versus office blood pressure
measurement: Are there sex differences? [55.41644538483948]
Office Blood Pressure Measurement (OBP) is a technique performed in-office with the sphygmomanometer, while Ambulatory Blood Pressure Monitoring (ABPM) is a technique that measures blood pressure during 24h.
The aim of this study is to examine the possible influence of sex on the discrepancies between OBP and ABPM in 872 subjects with known or suspected hypertension.
arXiv Detail & Related papers (2021-06-04T10:09:44Z) - Personalized pathology test for Cardio-vascular disease: Approximate
Bayesian computation with discriminative summary statistics learning [48.7576911714538]
We propose a platelet deposition model and an inferential scheme to estimate the biologically meaningful parameters using approximate computation.
This work opens up an unprecedented opportunity of personalized pathology test for CVD detection and medical treatment.
arXiv Detail & Related papers (2020-10-13T15:20:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.