Multi-modal Depression Estimation based on Sub-attentional Fusion
- URL: http://arxiv.org/abs/2207.06180v1
- Date: Wed, 13 Jul 2022 13:19:32 GMT
- Title: Multi-modal Depression Estimation based on Sub-attentional Fusion
- Authors: Ping-Cheng Wei, Kunyu Peng, Alina Roitberg, Kailun Yang, Jiaming
Zhang, Rainer Stiefelhagen
- Abstract summary: Failure to diagnose depression leads to over 280 million people suffering from this psychological disorder worldwide.
We tackle the task of automatically identifying depression from multi-modal data.
We introduce a sub-attention mechanism for linking heterogeneous information.
- Score: 29.74171323437029
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Failure to timely diagnose and effectively treat depression leads to over 280
million people suffering from this psychological disorder worldwide. The
information cues of depression can be harvested from diverse heterogeneous
resources, e.g., audio, visual, and textual data, raising demand for new
effective multi-modal fusion approaches for its automatic estimation. In this
work, we tackle the task of automatically identifying depression from
multi-modal data and introduce a sub-attention mechanism for linking
heterogeneous information while leveraging Convolutional Bidirectional LSTM as
our backbone. To validate this idea, we conduct extensive experiments on the
public DAIC-WOZ benchmark for depression assessment featuring different
evaluation modes and taking gender-specific biases into account. The proposed
model yields effective results with 0.89 precision and 0.70 F1-score in
detecting major depression and 4.92 MAE in estimating the severity. Our
attention-based fusion module consistently outperforms conventional late fusion
approaches and achieves a competitive performance compared to the previously
published depression estimation frameworks, while learning to diagnose the
disorder end-to-end and relying on far less preprocessing steps.
Related papers
- Sentiment Informed Sentence BERT-Ensemble Algorithm for Depression Detection [0.0]
WHO revealed approximately 280 million people in the world suffer from depression.
Our paper examined the performance of several ML algorithms for early-stage depression detection using two benchmark social media datasets.
arXiv Detail & Related papers (2024-09-07T07:47:55Z) - Improving Bias Correction Standards by Quantifying its Effects on Treatment Outcomes [54.18828236350544]
Propensity score matching (PSM) addresses selection biases by selecting comparable populations for analysis.
Different matching methods can produce significantly different Average Treatment Effects (ATE) for the same task, even when meeting all validation criteria.
To address this issue, we introduce a novel metric, A2A, to reduce the number of valid matches.
arXiv Detail & Related papers (2024-07-20T12:42:24Z) - A Depression Detection Method Based on Multi-Modal Feature Fusion Using Cross-Attention [3.4872769952628926]
Depression affects approximately 3.8% of the global population.
Over 75% of individuals in low- and middle-income countries remain untreated.
This paper introduces a novel method for detecting depression based on multi-modal feature fusion utilizing cross-attention.
arXiv Detail & Related papers (2024-07-02T13:13:35Z) - MOGAM: A Multimodal Object-oriented Graph Attention Model for Depression Detection [5.506046101113427]
We introduce a Multimodal Object-Oriented Graph Attention Model (MOGAM) for detecting depression in social media.
To ensure that our model can capture authentic symptoms of depression, we only include vlogs from users with a clinical diagnosis.
MOGAM achieved an accuracy of 0.871 and an F1-score of 0.888.
arXiv Detail & Related papers (2024-03-21T07:45:58Z) - Automatic diagnosis of knee osteoarthritis severity using Swin
transformer [55.01037422579516]
Knee osteoarthritis (KOA) is a widespread condition that can cause chronic pain and stiffness in the knee joint.
We propose an automated approach that employs the Swin Transformer to predict the severity of KOA.
arXiv Detail & Related papers (2023-07-10T09:49:30Z) - Cross-Attention is Not Enough: Incongruity-Aware Dynamic Hierarchical
Fusion for Multimodal Affect Recognition [69.32305810128994]
Incongruity between modalities poses a challenge for multimodal fusion, especially in affect recognition.
We propose the Hierarchical Crossmodal Transformer with Dynamic Modality Gating (HCT-DMG), a lightweight incongruity-aware model.
HCT-DMG: 1) outperforms previous multimodal models with a reduced size of approximately 0.8M parameters; 2) recognizes hard samples where incongruity makes affect recognition difficult; 3) mitigates the incongruity at the latent level in crossmodal attention.
arXiv Detail & Related papers (2023-05-23T01:24:15Z) - Depression Diagnosis and Analysis via Multimodal Multi-order Factor
Fusion [4.991507302519828]
Depression is a leading cause of death worldwide, and the diagnosis of depression is nontrivial.
We propose a multimodal multi-order factor fusion (MMFF) method for automatic diagnosis of depression.
arXiv Detail & Related papers (2022-12-31T17:13:06Z) - Am I No Good? Towards Detecting Perceived Burdensomeness and Thwarted
Belongingness from Suicide Notes [51.378225388679425]
We present an end-to-end multitask system to address a novel task of detection of Perceived Burdensomeness (PB) and Thwarted Belongingness (TB) from suicide notes.
We also introduce a manually translated code-mixed suicide notes corpus, CoMCEASE-v2.0, based on the benchmark CEASE-v2.0 dataset.
We exploit the temporal orientation and emotion information in the suicide notes to boost overall performance.
arXiv Detail & Related papers (2022-05-20T06:31:08Z) - Deep Multi-task Learning for Depression Detection and Prediction in
Longitudinal Data [50.02223091927777]
Depression is among the most prevalent mental disorders, affecting millions of people of all ages globally.
Machine learning techniques have shown effective in enabling automated detection and prediction of depression for early intervention and treatment.
We introduce a novel deep multi-task recurrent neural network to tackle this challenge, in which depression classification is jointly optimized with two auxiliary tasks.
arXiv Detail & Related papers (2020-12-05T05:14:14Z) - Predicting Parkinson's Disease with Multimodal Irregularly Collected
Longitudinal Smartphone Data [75.23250968928578]
Parkinsons Disease is a neurological disorder and prevalent in elderly people.
Traditional ways to diagnose the disease rely on in-person subjective clinical evaluations on the quality of a set of activity tests.
We propose a novel time-series based approach to predicting Parkinson's Disease with raw activity test data collected by smartphones in the wild.
arXiv Detail & Related papers (2020-09-25T01:50:15Z) - Multimodal Depression Severity Prediction from medical bio-markers using
Machine Learning Tools and Technologies [0.0]
Depression has been a leading cause of mental-health illnesses across the world.
Using behavioural cues to automate depression diagnosis and stage prediction in recent years has relatively increased.
The absence of labelled behavioural datasets and a vast amount of possible variations prove to be a major challenge in accomplishing the task.
arXiv Detail & Related papers (2020-09-11T20:44:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.