Mining Mental Health Signals: A Comparative Study of Four Machine Learning Methods for Depression Detection from Social Media Posts in Sorani Kurdish
- URL: http://arxiv.org/abs/2508.15829v1
- Date: Mon, 18 Aug 2025 16:27:31 GMT
- Title: Mining Mental Health Signals: A Comparative Study of Four Machine Learning Methods for Depression Detection from Social Media Posts in Sorani Kurdish
- Authors: Idrees Mohammed, Hossein Hassani,
- Abstract summary: Depression is a common mental health condition that can lead to hopelessness, loss of interest, self-harm, and even suicide.<n>With the rise of social media, users increasingly express emotions online, offering new opportunities for detection through text analysis.<n>This work presents a machine learning and Natural Language Processing (NLP) approach to detect depression in Sorani Kurdish tweets.
- Score: 1.3464152928754487
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Depression is a common mental health condition that can lead to hopelessness, loss of interest, self-harm, and even suicide. Early detection is challenging due to individuals not self-reporting or seeking timely clinical help. With the rise of social media, users increasingly express emotions online, offering new opportunities for detection through text analysis. While prior research has focused on languages such as English, no studies exist for Sorani Kurdish. This work presents a machine learning and Natural Language Processing (NLP) approach to detect depression in Sorani tweets. A set of depression-related keywords was developed with expert input to collect 960 public tweets from X (Twitter platform). The dataset was annotated into three classes: Shows depression, Not-show depression, and Suspicious by academics and final year medical students at the University of Kurdistan Hewl\^er. Four supervised models, including Support Vector Machines, Multinomial Naive Bayes, Logistic Regression, and Random Forest, were trained and evaluated, with Random Forest achieving the highest performance accuracy and F1-score of 80%. This study establishes a baseline for automated depression detection in Kurdish language contexts.
Related papers
- DepFlow: Disentangled Speech Generation to Mitigate Semantic Bias in Depression Detection [54.209716321122194]
We present DepFlow, a depression-conditioned text-to-speech framework.<n>A Depression Acoustic Camouflage learns speaker- and content-invariant depression embeddings through adversarial training.<n>A flow-matching TTS model with FiLM modulation injects these embeddings into synthesis, enabling control over depressive severity.<n>A prototype-based severity mapping mechanism provides smooth and interpretable manipulation across the depression continuum.
arXiv Detail & Related papers (2026-01-01T10:44:38Z) - ReDepress: A Cognitive Framework for Detecting Depression Relapse from Social Media [48.56586765769052]
We present ReDepress, the first clinically validated social media dataset focused on relapse.<n>Our framework draws on cognitive theories of depression, incorporating constructs such as attention bias, interpretation bias, memory bias and rumination.<n>Our findings validate psychological theories in real-world textual data and underscore the potential of cognitive-informed computational methods for early relapse detection.
arXiv Detail & Related papers (2025-09-22T16:33:59Z) - Depression detection in social media posts using affective and social
norm features [84.12658971655253]
We propose a deep architecture for depression detection from social media posts.
We incorporate profanity and morality features of posts and words in our architecture using a late fusion scheme.
The inclusion of the proposed features yields state-of-the-art results in both settings.
arXiv Detail & Related papers (2023-03-24T21:26:27Z) - Handwriting and Drawing for Depression Detection: A Preliminary Study [53.11777541341063]
Short-term covid effects on mental health were a significant increase in anxiety and depressive symptoms.
The aim of this study is to use a new tool, the online handwriting and drawing analysis, to discriminate between healthy individuals and depressed patients.
arXiv Detail & Related papers (2023-02-05T22:33:49Z) - Semantic Similarity Models for Depression Severity Estimation [53.72188878602294]
This paper presents an efficient semantic pipeline to study depression severity in individuals based on their social media writings.
We use test user sentences for producing semantic rankings over an index of representative training sentences corresponding to depressive symptoms and severity levels.
We evaluate our methods on two Reddit-based benchmarks, achieving 30% improvement over state of the art in terms of measuring depression severity.
arXiv Detail & Related papers (2022-11-14T18:47:26Z) - Mental Illness Classification on Social Media Texts using Deep Learning
and Transfer Learning [55.653944436488786]
According to the World health organization (WHO), approximately 450 million people are affected.
Mental illnesses, such as depression, anxiety, bipolar disorder, ADHD, and PTSD.
This study analyzes unstructured user data on Reddit platform and classifies five common mental illnesses: depression, anxiety, bipolar disorder, ADHD, and PTSD.
arXiv Detail & Related papers (2022-07-03T11:33:52Z) - Automatic Depression Detection: An Emotional Audio-Textual Corpus and a
GRU/BiLSTM-based Model [17.83052349861756]
Depression is a global mental health problem, the worst case of which can lead to suicide.
We propose a novel depression detection approach utilizing speech characteristics and linguistic contents from participants' interviews.
We establish an Emotional Audio-Textual Depression Corpus (EATD-Corpus) which contains audios and extracted transcripts of responses from depressed and non-depressed volunteers.
arXiv Detail & Related papers (2022-02-15T03:29:39Z) - Data set creation and empirical analysis for detecting signs of
depression from social media postings [0.0]
Depression is a common mental illness that has to be detected and treated at an early stage to avoid serious consequences.
We developed a gold standard data set that detects the levels of depression as not depressed', moderately depressed' and severely depressed' from the social media postings.
arXiv Detail & Related papers (2022-02-07T10:24:33Z) - An Exploratory Analysis of the Relation Between Offensive Language and
Mental Health [3.333967282951668]
We train computational models to compare the use of offensive language in social media posts written by groups of individuals with and without self-reported depression diagnosis.
Our analysis indicates that offensive language is more frequently used in the samples written by individuals with self-reported depression as well as individuals showing signs of depression.
arXiv Detail & Related papers (2021-05-31T11:25:07Z) - Deep Multi-task Learning for Depression Detection and Prediction in
Longitudinal Data [50.02223091927777]
Depression is among the most prevalent mental disorders, affecting millions of people of all ages globally.
Machine learning techniques have shown effective in enabling automated detection and prediction of depression for early intervention and treatment.
We introduce a novel deep multi-task recurrent neural network to tackle this challenge, in which depression classification is jointly optimized with two auxiliary tasks.
arXiv Detail & Related papers (2020-12-05T05:14:14Z) - Detecting Early Onset of Depression from Social Media Text using Learned
Confidence Scores [19.86148958828238]
Depression is the second leading cause of death for young adults.
In this work, we focus on methods for detecting the early onset of depression from social media texts.
arXiv Detail & Related papers (2020-11-03T13:34:04Z) - Affective Conditioning on Hierarchical Networks applied to Depression
Detection from Transcribed Clinical Interviews [0.0]
Depression is a mental disorder that impacts not only the subject's mood but also the use of language.
We use a Hierarchical Attention Network to classify interviews of depressed subjects.
We augment the attention layer of our model with a conditioning mechanism on linguistic features, extracted from affective lexica.
arXiv Detail & Related papers (2020-06-04T20:55:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.