Related papers: Detecting Reddit Users with Depression Using a Hybrid Neural Network SBERT-CNN

Detecting Reddit Users with Depression Using a Hybrid Neural Network SBERT-CNN

URL: http://arxiv.org/abs/2302.02759v2
Date: Mon, 29 Jan 2024 16:59:09 GMT
Title: Detecting Reddit Users with Depression Using a Hybrid Neural Network SBERT-CNN
Authors: Ziyi Chen, Ren Yang, Sunyang Fu, Nansu Zong, Hongfang Liu, Ming Huang
Abstract summary: Depression is a widespread mental health issue, affecting an estimated 3.8% of the global population. We propose a hybrid deep learning model which combines a pretrained sentence BERT (SBERT) and convolutional neural network (CNN) to detect individuals with depression with their Reddit posts. The model achieved an accuracy of 0.86 and an F1 score of 0.86 and outperformed the state-of-the-art documented result (F1 score of 0.79) by other machine learning models in the literature.
Score: 18.32536789799511
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Depression is a widespread mental health issue, affecting an estimated 3.8% of the global population. It is also one of the main contributors to disability worldwide. Recently it is becoming popular for individuals to use social media platforms (e.g., Reddit) to express their difficulties and health issues (e.g., depression) and seek support from other users in online communities. It opens great opportunities to automatically identify social media users with depression by parsing millions of posts for potential interventions. Deep learning methods have begun to dominate in the field of machine learning and natural language processing (NLP) because of their ease of use, efficient processing, and state-of-the-art results on many NLP tasks. In this work, we propose a hybrid deep learning model which combines a pretrained sentence BERT (SBERT) and convolutional neural network (CNN) to detect individuals with depression with their Reddit posts. The sentence BERT is used to learn the meaningful representation of semantic information in each post. CNN enables the further transformation of those embeddings and the temporal identification of behavioral patterns of users. We trained and evaluated the model performance to identify Reddit users with depression by utilizing the Self-reported Mental Health Diagnoses (SMHD) data. The hybrid deep learning model achieved an accuracy of 0.86 and an F1 score of 0.86 and outperformed the state-of-the-art documented result (F1 score of 0.79) by other machine learning models in the literature. The results show the feasibility of the hybrid model to identify individuals with depression. Although the hybrid model is validated to detect depression with Reddit posts, it can be easily tuned and applied to other text classification tasks and different clinical applications.

Related papers

DepressionEmo: A novel dataset for multilabel classification of depression emotions [6.26397257917403]
DepressionEmo is a dataset designed to detect 8 emotions associated with depression by 6037 examples of long Reddit user posts. This dataset was created through a majority vote over inputs by zero-shot classifications from pre-trained models. We provide several text classification methods classified into two groups: machine learning methods such as SVM, XGBoost, and Light GBM; and deep learning methods such as BERT, GAN-BERT, and BART.
arXiv Detail & Related papers (2024-01-09T16:25:31Z)
Depression detection in social media posts using affective and social norm features [84.12658971655253]
We propose a deep architecture for depression detection from social media posts. We incorporate profanity and morality features of posts and words in our architecture using a late fusion scheme. The inclusion of the proposed features yields state-of-the-art results in both settings.
arXiv Detail & Related papers (2023-03-24T21:26:27Z)
Semantic Similarity Models for Depression Severity Estimation [53.72188878602294]
This paper presents an efficient semantic pipeline to study depression severity in individuals based on their social media writings. We use test user sentences for producing semantic rankings over an index of representative training sentences corresponding to depressive symptoms and severity levels. We evaluate our methods on two Reddit-based benchmarks, achieving 30% improvement over state of the art in terms of measuring depression severity.
arXiv Detail & Related papers (2022-11-14T18:47:26Z)
SERCNN: Stacked Embedding Recurrent Convolutional Neural Network in Detecting Depression on Twitter [2.535271349350579]
We propose SERCNN, which improves user representation by stacking two pretrained embeddings from different domains. Our SERCNN shows great performance over state-of-the-art and other baselines, achieving 93.7% accuracy in a 5-fold cross-validation setting. With as minimal as 10 posts per user, SERCNN performed exceptionally well with an 87% accuracy, which is on par with the BERT model.
arXiv Detail & Related papers (2022-07-29T08:08:15Z)
Mental Illness Classification on Social Media Texts using Deep Learning and Transfer Learning [55.653944436488786]
According to the World health organization (WHO), approximately 450 million people are affected. Mental illnesses, such as depression, anxiety, bipolar disorder, ADHD, and PTSD. This study analyzes unstructured user data on Reddit platform and classifies five common mental illnesses: depression, anxiety, bipolar disorder, ADHD, and PTSD.
arXiv Detail & Related papers (2022-07-03T11:33:52Z)
Data set creation and empirical analysis for detecting signs of depression from social media postings [0.0]
Depression is a common mental illness that has to be detected and treated at an early stage to avoid serious consequences. We developed a gold standard data set that detects the levels of depression as not depressed', moderately depressed' and severely depressed' from the social media postings.
arXiv Detail & Related papers (2022-02-07T10:24:33Z)
Learning Language and Multimodal Privacy-Preserving Markers of Mood from Mobile Data [74.60507696087966]
Mental health conditions remain underdiagnosed even in countries with common access to advanced medical care. One promising data source to help monitor human behavior is daily smartphone usage. We study behavioral markers of daily mood using a recent dataset of mobile behaviors from adolescent populations at high risk of suicidal behaviors.
arXiv Detail & Related papers (2021-06-24T17:46:03Z)
DepressionNet: A Novel Summarization Boosted Deep Framework for Depression Detection on Social Media [12.820775223409857]
Twitter is a popular online social media platform which allows users to share their user-generated content. One of the applications is in automatically discovering mental health problems, e.g., depression. Previous studies to automatically detect a depressed user on online social media have largely relied upon the user behaviour and their linguistic patterns.
arXiv Detail & Related papers (2021-05-23T08:05:53Z)
Deep Multi-task Learning for Depression Detection and Prediction in Longitudinal Data [50.02223091927777]
Depression is among the most prevalent mental disorders, affecting millions of people of all ages globally. Machine learning techniques have shown effective in enabling automated detection and prediction of depression for early intervention and treatment. We introduce a novel deep multi-task recurrent neural network to tackle this challenge, in which depression classification is jointly optimized with two auxiliary tasks.
arXiv Detail & Related papers (2020-12-05T05:14:14Z)
A Multitask Deep Learning Approach for User Depression Detection on Sina Weibo [6.899536164312357]
We build a large dataset on Sina Weibo (a leading OSN with the largest number of active users in the Chinese community) By analyzing the user's text, social behavior, and posted pictures, ten statistical features are concluded and proposed. A novel deep neural network classification model, i.e. FusionNet, is proposed and simultaneously trained with the above-extracted features.
arXiv Detail & Related papers (2020-08-26T17:53:17Z)
Self-PU: Self Boosted and Calibrated Positive-Unlabeled Training [118.10946662410639]
We propose a novel Self-PU learning framework, which seamlessly integrates PU learning and self-training. Self-PU highlights three "self"-oriented building blocks: a self-paced training algorithm that adaptively discovers and augments confident examples as the training proceeds. We study a real-world application of PU learning, i.e., classifying brain images of Alzheimer's Disease.
arXiv Detail & Related papers (2020-06-22T17:53:59Z)

This list is automatically generated from the titles and abstracts of the papers in this site.