Generalized Dilated CNN Models for Depression Detection Using Inverted
Vocal Tract Variables
- URL: http://arxiv.org/abs/2011.06739v3
- Date: Fri, 9 Apr 2021 04:29:46 GMT
- Title: Generalized Dilated CNN Models for Depression Detection Using Inverted
Vocal Tract Variables
- Authors: Nadee Seneviratne, Carol Espy-Wilson
- Abstract summary: Depression detection using vocal biomarkers is a highly researched area.
Findings of existing studies are mostly validated on a single database which limits the generalizability of results.
We propose to develop a generalized classifier for depression detection using a dilated Coniculaal Neural Network.
- Score: 4.050982413149992
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Depression detection using vocal biomarkers is a highly researched area.
Articulatory coordination features (ACFs) are developed based on the changes in
neuromotor coordination due to psychomotor slowing, a key feature of Major
Depressive Disorder. However findings of existing studies are mostly validated
on a single database which limits the generalizability of results. Variability
across different depression databases adversely affects the results in cross
corpus evaluations (CCEs). We propose to develop a generalized classifier for
depression detection using a dilated Convolutional Neural Network which is
trained on ACFs extracted from two depression databases. We show that ACFs
derived from Vocal Tract Variables (TVs) show promise as a robust set of
features for depression detection. Our model achieves relative accuracy
improvements of ~10% compared to CCEs performed on models trained on a single
database. We extend the study to show that fusing TVs and Mel-Frequency
Cepstral Coefficients can further improve the performance of this classifier.
Related papers
- STANet: A Novel Spatio-Temporal Aggregation Network for Depression Classification with Small and Unbalanced FMRI Data [12.344849949026989]
We propose the Spatio-Temporal Aggregation Network (STANet) for diagnosing depression by integrating CNN and RNN to capture both temporal and spatial features.
Experiments demonstrate that STANet superior depression diagnostic performance with 82.38% accuracy and a 90.72% AUC.
arXiv Detail & Related papers (2024-07-31T04:06:47Z) - How Does Pruning Impact Long-Tailed Multi-Label Medical Image
Classifiers? [49.35105290167996]
Pruning has emerged as a powerful technique for compressing deep neural networks, reducing memory usage and inference time without significantly affecting overall performance.
This work represents a first step toward understanding the impact of pruning on model behavior in deep long-tailed, multi-label medical image classification.
arXiv Detail & Related papers (2023-08-17T20:40:30Z) - Automatic diagnosis of knee osteoarthritis severity using Swin
transformer [55.01037422579516]
Knee osteoarthritis (KOA) is a widespread condition that can cause chronic pain and stiffness in the knee joint.
We propose an automated approach that employs the Swin Transformer to predict the severity of KOA.
arXiv Detail & Related papers (2023-07-10T09:49:30Z) - The Relationship Between Speech Features Changes When You Get Depressed:
Feature Correlations for Improving Speed and Performance of Depression
Detection [69.88072583383085]
This work shows that depression changes the correlation between features extracted from speech.
Using such an insight can improve the training speed and performance of depression detectors based on SVMs and LSTMs.
arXiv Detail & Related papers (2023-07-06T09:54:35Z) - Bayesian Networks for the robust and unbiased prediction of depression
and its symptoms utilizing speech and multimodal data [65.28160163774274]
We apply a Bayesian framework to capture the relationships between depression, depression symptoms, and features derived from speech, facial expression and cognitive game data collected at thymia.
arXiv Detail & Related papers (2022-11-09T14:48:13Z) - Deep Temporal Modelling of Clinical Depression through Social Media Text [1.513693945164213]
We develop a model to detect user-level clinical depression based on a user's temporal social media posts.
Our model uses a Depression Detection (DSD) classifier, which is trained on the largest existing samples of clinician annotated tweets for clinical depression symptoms.
arXiv Detail & Related papers (2022-10-28T18:31:52Z) - DECK: Behavioral Tests to Improve Interpretability and Generalizability
of BERT Models Detecting Depression from Text [4.269268432906194]
Models that accurately detect depression from text are important tools for addressing the post-pandemic mental health crisis.
BERT-based classifiers' promising performance and the off-the-shelf availability make them great candidates for this task.
We introduce the DECK (DEpression ChecKlist), depression-specific model behavioural tests that allow better interpretability.
arXiv Detail & Related papers (2022-09-12T14:39:46Z) - Speech based Depression Severity Level Classification Using a
Multi-Stage Dilated CNN-LSTM Model [5.419077350924331]
We formulate the depression classification task as a severity level classification problem to provide more granularity to the classification outcomes.
We use articulatory coordination features (ACFs) developed to capture the changes of neuromotor coordination that happens as a result of psychomotor slowing.
arXiv Detail & Related papers (2021-04-09T05:10:08Z) - Deep Multi-task Learning for Depression Detection and Prediction in
Longitudinal Data [50.02223091927777]
Depression is among the most prevalent mental disorders, affecting millions of people of all ages globally.
Machine learning techniques have shown effective in enabling automated detection and prediction of depression for early intervention and treatment.
We introduce a novel deep multi-task recurrent neural network to tackle this challenge, in which depression classification is jointly optimized with two auxiliary tasks.
arXiv Detail & Related papers (2020-12-05T05:14:14Z) - Predicting Clinical Diagnosis from Patients Electronic Health Records
Using BERT-based Neural Networks [62.9447303059342]
We show the importance of this problem in medical community.
We present a modification of Bidirectional Representations from Transformers (BERT) model for classification sequence.
We use a large-scale Russian EHR dataset consisting of about 4 million unique patient visits.
arXiv Detail & Related papers (2020-07-15T09:22:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.