Co-occurrence of medical conditions: Exposing patterns through
probabilistic topic modeling of SNOMED codes
- URL: http://arxiv.org/abs/2109.09199v1
- Date: Sun, 19 Sep 2021 19:34:21 GMT
- Title: Co-occurrence of medical conditions: Exposing patterns through
probabilistic topic modeling of SNOMED codes
- Authors: Moumita Bhattacharya, Claudine Jurkovitz, Hagit Shatkay
- Abstract summary: Co-occurring conditions are especially prevalent among individuals suffering from kidney disease.
This study aims to identify and characterize patterns of co-occurring medical conditions in patients employing a probabilistic framework.
- Score: 0.3867363075280544
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Patients associated with multiple co-occurring health conditions often face
aggravated complications and less favorable outcomes. Co-occurring conditions
are especially prevalent among individuals suffering from kidney disease, an
increasingly widespread condition affecting 13% of the general population in
the US. This study aims to identify and characterize patterns of co-occurring
medical conditions in patients employing a probabilistic framework.
Specifically, we apply topic modeling in a non-traditional way to find
associations across SNOMEDCT codes assigned and recorded in the EHRs of>13,000
patients diagnosed with kidney disease. Unlike most prior work on topic
modeling, we apply the method to codes rather than to natural language.
Moreover, we quantitatively evaluate the topics, assessing their tightness and
distinctiveness, and also assess the medical validity of our results. Our
experiments show that each topic is succinctly characterized by a few highly
probable and unique disease codes, indicating that the topics are tight.
Furthermore, inter-topic distance between each pair of topics is typically
high, illustrating distinctiveness. Last, most coded conditions grouped
together within a topic, are indeed reported to co-occur in the medical
literature. Notably, our results uncover a few indirect associations among
conditions that have hitherto not been reported as correlated in the medical
literature.
Related papers
- Learning Disease Progression Models That Capture Health Disparities [2.5576848352640087]
We develop an interpretable Bayesian disease progression model that captures three key health disparities.
We show theoretically and empirically that failing to account for disparities produces biased estimates of severity.
Our model can identify groups that face each type of health disparity, and that accounting for these disparities meaningfully shifts which patients are considered high-risk.
arXiv Detail & Related papers (2024-12-20T23:56:37Z) - DisEmbed: Transforming Disease Understanding through Embeddings [0.0]
DisEmbed is a disease-focused embedding model.
DisEmbed is trained on a synthetic dataset curated to include disease descriptions, symptoms, and disease-related Q&A pairs.
arXiv Detail & Related papers (2024-12-16T12:04:22Z) - Deep State-Space Generative Model For Correlated Time-to-Event Predictions [54.3637600983898]
We propose a deep latent state-space generative model to capture the interactions among different types of correlated clinical events.
Our method also uncovers meaningful insights about the latent correlations among mortality and different types of organ failures.
arXiv Detail & Related papers (2024-07-28T02:42:36Z) - Estimating the severity of dental and oral problems via sentiment
classification over clinical reports [0.8287206589886879]
Analyzing authors' sentiments in texts can be practical and useful in various fields, including medicine and dentistry.
Deep learning model based on Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) network architecture, known as CNN-LSTM, was developed to detect severity level of patient's problem.
arXiv Detail & Related papers (2024-01-17T14:33:13Z) - Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge [59.323306639144526]
Many real-world image recognition problems, such as diagnostic medical imaging exams, are emerging.
Diagnose is both a long-tailed and multi-label problem, as patients often present with multiple findings.
We synthesize common themes, providing recommendations for long-tailed, multi-label medical image classification.
arXiv Detail & Related papers (2023-10-24T18:26:22Z) - Unifying Relational Sentence Generation and Retrieval for Medical Image
Report Composition [142.42920413017163]
Current methods often generate the most common sentences due to dataset bias for individual case.
We propose a novel framework that unifies template retrieval and sentence generation to handle both common and rare abnormality.
arXiv Detail & Related papers (2021-01-09T04:33:27Z) - Predicting Clinical Diagnosis from Patients Electronic Health Records
Using BERT-based Neural Networks [62.9447303059342]
We show the importance of this problem in medical community.
We present a modification of Bidirectional Representations from Transformers (BERT) model for classification sequence.
We use a large-scale Russian EHR dataset consisting of about 4 million unique patient visits.
arXiv Detail & Related papers (2020-07-15T09:22:55Z) - Mining Misdiagnosis Patterns from Biomedical Literature [8.534433954411409]
We find that the most commonly misdiagnosed diseases were often misdiagnosed as many different diseases.
While a misdiagnosis relationship may generally exist, the relationship was often found to be one-sided.
arXiv Detail & Related papers (2020-06-24T13:34:43Z) - A General Framework for Survival Analysis and Multi-State Modelling [70.31153478610229]
We use neural ordinary differential equations as a flexible and general method for estimating multi-state survival models.
We show that our model exhibits state-of-the-art performance on popular survival data sets and demonstrate its efficacy in a multi-state setting.
arXiv Detail & Related papers (2020-06-08T19:24:54Z) - Semi-supervised Medical Image Classification with Relation-driven
Self-ensembling Model [71.80319052891817]
We present a relation-driven semi-supervised framework for medical image classification.
It exploits the unlabeled data by encouraging the prediction consistency of given input under perturbations.
Our method outperforms many state-of-the-art semi-supervised learning methods on both single-label and multi-label image classification scenarios.
arXiv Detail & Related papers (2020-05-15T06:57:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.