Depression and Anxiety Prediction Using Deep Language Models and Transfer Learning
- URL: http://arxiv.org/abs/2412.20741v1
- Date: Mon, 30 Dec 2024 06:33:39 GMT
- Title: Depression and Anxiety Prediction Using Deep Language Models and Transfer Learning
- Authors: Tomasz Rutowski, Elizabeth Shriberg, Amir Harati, Yang Lu, Piotr Chlebek, Ricardo Oliveira,
- Abstract summary: We explore deep language models for detecting depression, anxiety, and their co-occurrence from conversational speech collected during 16k user interactions with an application.<n>We find that results for binary classification range from 0.86 to 0.79 AUC, depending on condition and co-occurrence.
- Score: 8.677511646372086
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Digital screening and monitoring applications can aid providers in the management of behavioral health conditions. We explore deep language models for detecting depression, anxiety, and their co-occurrence from conversational speech collected during 16k user interactions with an application. Labels come from PHQ-8 and GAD-7 results also collected by the application. We find that results for binary classification range from 0.86 to 0.79 AUC, depending on condition and co-occurrence. Best performance is achieved when a user has either both or neither condition, and we show that this result is not attributable to data skew. Finally, we find evidence suggesting that underlying word sequence cues may be more salient for depression than for anxiety.
Related papers
- Development and Evaluation of HopeBot: an LLM-based chatbot for structured and interactive PHQ-9 depression screening [48.355615275247786]
HopeBot administers the Patient Health Questionnaire-9 (PHQ-9) using retrieval-augmented generation and real-time clarification.<n>In a within-subject study, 132 adults in the United Kingdom and China completed both self-administered and chatbots versions.<n>Overall, 87.1% expressed willingness to reuse or recommend HopeBot.
arXiv Detail & Related papers (2025-07-08T13:41:22Z) - Cross-platform Prediction of Depression Treatment Outcome Using Location Sensory Data on Smartphones [55.992010576087424]
We explore using location sensory data collected passively on smartphones to predict treatment outcome.
Our results show that using location features and baseline self-reported questionnaire score can lead to F1 score up to 0.67, comparable to that obtained using periodic self-reported questionnaires.
arXiv Detail & Related papers (2025-03-10T22:00:07Z) - Large-scale digital phenotyping: identifying depression and anxiety indicators in a general UK population with over 10,000 participants [2.2909783327197393]
We conducted a cross-sectional analysis of data from 10,129 participants recruited from a UK-based general population.
Participants shared wearable (Fitbit) data and self-reported questionnaires on depression (PHQ-8), anxiety (GAD-7), and mood via a study app.
We observed significant associations between the severity of depression and anxiety with several factors, including mood, age, gender, BMI, sleep patterns, physical activity, and heart rate.
arXiv Detail & Related papers (2024-09-24T16:05:17Z) - Enhancing Depression Diagnosis with Chain-of-Thought Prompting [1.8532406942078647]
We theorize that using chain-of-thought (CoT) prompting to evaluate Patient Health Questionnaire-8 (PHQ-8) scores will improve the accuracy of the scores determined by AI models.
Our goal is to expand upon AI models' understanding of the intricacies of human conversation, allowing them to more effectively assess a patient's feelings and tone.
arXiv Detail & Related papers (2024-08-26T07:19:07Z) - We Care: Multimodal Depression Detection and Knowledge Infused Mental Health Therapeutic Response Generation [41.09752906121257]
We present the Extended D-vlog dataset, encompassing a collection of 1, 261 YouTube vlogs.
We introduce a virtual agent serving as an initial contact for mental health patients, offering Cognitive Behavioral Therapy (CBT)-based responses.
Our Mistral model achieved impressive scores of 70.1% and 30.9% for distortion assessment and classification, along with a Bert score of 88.7%.
arXiv Detail & Related papers (2024-06-15T08:41:46Z) - DAIC-WOZ: On the Validity of Using the Therapist's prompts in Automatic Depression Detection from Clinical Interviews [39.08557916089242]
Recent studies have reported enhanced performance when incorporating interviewer's prompts into the model.
We discover that models using interviewer's prompts learn to focus on a specific region of the interviews, where questions about past experiences with mental health issues are asked.
We achieve a 0.90 F1 score by intentionally exploiting it, the highest result reported to date on this dataset using only textual information.
arXiv Detail & Related papers (2024-04-22T09:07:50Z) - The Relationship Between Speech Features Changes When You Get Depressed:
Feature Correlations for Improving Speed and Performance of Depression
Detection [69.88072583383085]
This work shows that depression changes the correlation between features extracted from speech.
Using such an insight can improve the training speed and performance of depression detectors based on SVMs and LSTMs.
arXiv Detail & Related papers (2023-07-06T09:54:35Z) - Journaling Data for Daily PHQ-2 Depression Prediction and Forecasting [47.93070579578704]
We explore the potential of using actively-collected data to predict and forecast daily PHQ-2 scores on a newly-collected longitudinal dataset.
We obtain a best MAE of 1.417 for daily prediction of PHQ-2 scores, which specifically in the used dataset have a range of 0 to 12.
This illustrates the additive value that can be obtained by incorporating actively-collected data in a depression monitoring application.
arXiv Detail & Related papers (2022-05-06T17:47:05Z) - Learning Language and Multimodal Privacy-Preserving Markers of Mood from
Mobile Data [74.60507696087966]
Mental health conditions remain underdiagnosed even in countries with common access to advanced medical care.
One promising data source to help monitor human behavior is daily smartphone usage.
We study behavioral markers of daily mood using a recent dataset of mobile behaviors from adolescent populations at high risk of suicidal behaviors.
arXiv Detail & Related papers (2021-06-24T17:46:03Z) - NUVA: A Naming Utterance Verifier for Aphasia Treatment [49.114436579008476]
Assessment of speech performance using picture naming tasks is a key method for both diagnosis and monitoring of responses to treatment interventions by people with aphasia (PWA)
Here we present NUVA, an utterance verification system incorporating a deep learning element that classifies 'correct' versus'incorrect' naming attempts from aphasic stroke patients.
When tested on eight native British-English speaking PWA the system's performance accuracy ranged between 83.6% to 93.6%, with a 10-fold cross-validation mean of 89.5%.
arXiv Detail & Related papers (2021-02-10T13:00:29Z) - Deep Multi-task Learning for Depression Detection and Prediction in
Longitudinal Data [50.02223091927777]
Depression is among the most prevalent mental disorders, affecting millions of people of all ages globally.
Machine learning techniques have shown effective in enabling automated detection and prediction of depression for early intervention and treatment.
We introduce a novel deep multi-task recurrent neural network to tackle this challenge, in which depression classification is jointly optimized with two auxiliary tasks.
arXiv Detail & Related papers (2020-12-05T05:14:14Z) - Towards Preemptive Detection of Depression and Anxiety in Twitter [14.877991297466174]
Depression and anxiety are psychiatric disorders that are observed in many areas of everyday life.
We develop a dataset designed to foster research in depression and anxiety detection in Twitter.
arXiv Detail & Related papers (2020-11-10T17:17:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.