Neural Language Models with Distant Supervision to Identify Major
Depressive Disorder from Clinical Notes
- URL: http://arxiv.org/abs/2104.09644v1
- Date: Mon, 19 Apr 2021 21:11:41 GMT
- Title: Neural Language Models with Distant Supervision to Identify Major
Depressive Disorder from Clinical Notes
- Authors: Bhavani Singh Agnikula Kshatriya, Nicolas A Nunez, Manuel Gardea-
Resendez, Euijung Ryu, Brandon J Coombes, Sunyang Fu, Mark A Frye, Joanna M
Biernacka, Yanshan Wang
- Abstract summary: Major depressive disorder (MDD) is a prevalent psychiatric disorder that is associated with significant healthcare burden worldwide.
Recent advancements in neural language models, such as Bidirectional Representations for Transformers (BERT) model, resulted in state-of-the-art neural language models.
We propose to leverage the neural language models in a distant supervision paradigm to identify MDD phenotypes from clinical notes.
- Score: 2.1060613825447407
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Major depressive disorder (MDD) is a prevalent psychiatric disorder that is
associated with significant healthcare burden worldwide. Phenotyping of MDD can
help early diagnosis and consequently may have significant advantages in
patient management. In prior research MDD phenotypes have been extracted from
structured Electronic Health Records (EHR) or using Electroencephalographic
(EEG) data with traditional machine learning models to predict MDD phenotypes.
However, MDD phenotypic information is also documented in free-text EHR data,
such as clinical notes. While clinical notes may provide more accurate
phenotyping information, natural language processing (NLP) algorithms must be
developed to abstract such information. Recent advancements in NLP resulted in
state-of-the-art neural language models, such as Bidirectional Encoder
Representations for Transformers (BERT) model, which is a transformer-based
model that can be pre-trained from a corpus of unsupervised text data and then
fine-tuned on specific tasks. However, such neural language models have been
underutilized in clinical NLP tasks due to the lack of large training datasets.
In the literature, researchers have utilized the distant supervision paradigm
to train machine learning models on clinical text classification tasks to
mitigate the issue of lacking annotated training data. It is still unknown
whether the paradigm is effective for neural language models. In this paper, we
propose to leverage the neural language models in a distant supervision
paradigm to identify MDD phenotypes from clinical notes. The experimental
results indicate that our proposed approach is effective in identifying MDD
phenotypes and that the Bio- Clinical BERT, a specific BERT model for clinical
data, achieved the best performance in comparison with conventional machine
learning models.
Related papers
- EEG-Language Modeling for Pathology Detection [0.0]
This study pioneers EEG-language models trained on clinical reports and 15000 EEGs.
Our results indicate that models learn richer representations from being exposed to a variety of report segments.
representations of EEG-language models can significantly improve pathology detection compared to those of EEG-only models.
arXiv Detail & Related papers (2024-09-02T10:03:03Z) - Interpretable Medical Diagnostics with Structured Data Extraction by
Large Language Models [59.89454513692417]
Tabular data is often hidden in text, particularly in medical diagnostic reports.
We propose a novel, simple, and effective methodology for extracting structured tabular data from textual medical reports, called TEMED-LLM.
We demonstrate that our approach significantly outperforms state-of-the-art text classification models in medical diagnostics.
arXiv Detail & Related papers (2023-06-08T09:12:28Z) - Mitigating Data Scarcity for Large Language Models [7.259279261659759]
In recent years, pretrained neural language models (PNLMs) have taken the field of natural language processing by storm.
Data scarcity are commonly found in specialized domains, such as medical, or in low-resource languages that are underexplored by AI research.
In this dissertation, we focus on mitigating data scarcity using data augmentation and neural ensemble learning techniques.
arXiv Detail & Related papers (2023-02-03T15:17:53Z) - Clinical Deterioration Prediction in Brazilian Hospitals Based on
Artificial Neural Networks and Tree Decision Models [56.93322937189087]
An extremely boosted neural network (XBNet) is used to predict clinical deterioration (CD)
The XGBoost model obtained the best results in predicting CD among Brazilian hospitals' data.
arXiv Detail & Related papers (2022-12-17T23:29:14Z) - DR.BENCH: Diagnostic Reasoning Benchmark for Clinical Natural Language
Processing [5.022185333260402]
Diagnostic Reasoning Benchmarks, DR.BENCH, is a new benchmark for developing and evaluating cNLP models with clinical diagnostic reasoning ability.
DR.BENCH is the first clinical suite of tasks designed to be a natural language generation framework to evaluate pre-trained language models.
arXiv Detail & Related papers (2022-09-29T16:05:53Z) - Neural Language Models are not Born Equal to Fit Brain Data, but
Training Helps [75.84770193489639]
We examine the impact of test loss, training corpus and model architecture on the prediction of functional Magnetic Resonance Imaging timecourses of participants listening to an audiobook.
We find that untrained versions of each model already explain significant amount of signal in the brain by capturing similarity in brain responses across identical words.
We suggest good practices for future studies aiming at explaining the human language system using neural language models.
arXiv Detail & Related papers (2022-07-07T15:37:17Z) - EINNs: Epidemiologically-Informed Neural Networks [75.34199997857341]
We introduce a new class of physics-informed neural networks-EINN-crafted for epidemic forecasting.
We investigate how to leverage both the theoretical flexibility provided by mechanistic models as well as the data-driven expressability afforded by AI models.
arXiv Detail & Related papers (2022-02-21T18:59:03Z) - Rationale production to support clinical decision-making [31.66739991129112]
We apply InfoCal to the task of predicting hospital readmission using hospital discharge notes.
We find each presented model with selected interpretability or feature importance methods yield varying results.
arXiv Detail & Related papers (2021-11-15T09:02:10Z) - Hybrid deep learning methods for phenotype prediction from clinical
notes [4.866431869728018]
This paper proposes a novel hybrid model for automatically extracting patient phenotypes using natural language processing and deep learning models.
The proposed hybrid model is based on a neural bidirectional sequence model (BiLSTM or BiGRU) and a Convolutional Neural Network (CNN) for identifying patient's phenotypes in discharge reports.
arXiv Detail & Related papers (2021-08-16T05:57:28Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Predicting Clinical Diagnosis from Patients Electronic Health Records
Using BERT-based Neural Networks [62.9447303059342]
We show the importance of this problem in medical community.
We present a modification of Bidirectional Representations from Transformers (BERT) model for classification sequence.
We use a large-scale Russian EHR dataset consisting of about 4 million unique patient visits.
arXiv Detail & Related papers (2020-07-15T09:22:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.