Enhanced Labeling Technique for Reddit Text and Fine-Tuned Longformer
Models for Classifying Depression Severity in English and Luganda
- URL: http://arxiv.org/abs/2401.14240v1
- Date: Thu, 25 Jan 2024 15:28:07 GMT
- Title: Enhanced Labeling Technique for Reddit Text and Fine-Tuned Longformer
Models for Classifying Depression Severity in English and Luganda
- Authors: Richard Kimera, Daniela N. Rim, Joseph Kirabira, Ubong Godwin Udomah,
Heeyoul Choi
- Abstract summary: This research extracts text from Reddit to facilitate the diagnostic process.
It employs a proposed labeling approach to categorize the text and subsequently fine-tunes the Longformer model.
Our findings reveal that the Longformer model outperforms the baseline models in both English (48%) and Luganda (45%) languages.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Depression is a global burden and one of the most challenging mental health
conditions to control. Experts can detect its severity early using the Beck
Depression Inventory (BDI) questionnaire, administer appropriate medication to
patients, and impede its progression. Due to the fear of potential
stigmatization, many patients turn to social media platforms like Reddit for
advice and assistance at various stages of their journey. This research
extracts text from Reddit to facilitate the diagnostic process. It employs a
proposed labeling approach to categorize the text and subsequently fine-tunes
the Longformer model. The model's performance is compared against baseline
models, including Naive Bayes, Random Forest, Support Vector Machines, and
Gradient Boosting. Our findings reveal that the Longformer model outperforms
the baseline models in both English (48%) and Luganda (45%) languages on a
custom-made dataset.
Related papers
- LlaMADRS: Prompting Large Language Models for Interview-Based Depression Assessment [75.44934940580112]
This study introduces LlaMADRS, a novel framework leveraging open-source Large Language Models (LLMs) to automate depression severity assessment.
We employ a zero-shot prompting strategy with carefully designed cues to guide the model in interpreting and scoring transcribed clinical interviews.
Our approach, tested on 236 real-world interviews, demonstrates strong correlations with clinician assessments.
arXiv Detail & Related papers (2025-01-07T08:49:04Z) - Detecting anxiety and depression in dialogues: a multi-label and explainable approach [5.635300481123079]
Anxiety and depression are the most common mental health issues worldwide, affecting a non-negligible part of the population.
In this work, an entirely novel system for the multi-label classification of anxiety and depression is proposed.
arXiv Detail & Related papers (2024-12-23T15:29:46Z) - MentalArena: Self-play Training of Language Models for Diagnosis and Treatment of Mental Health Disorders [59.515827458631975]
Mental health disorders are one of the most serious diseases in the world.
Privacy concerns limit the accessibility of personalized treatment data.
MentalArena is a self-play framework to train language models.
arXiv Detail & Related papers (2024-10-09T13:06:40Z) - They Look Like Each Other: Case-based Reasoning for Explainable Depression Detection on Twitter using Large Language Models [3.5904920375592098]
We introduce ProtoDep, a novel, explainable framework for Twitter-based depression detection.
ProtoDep provides transparent explanations at three levels: (i) symptom-level explanations for each tweet and user, (ii) case-based explanations comparing the user to similar individuals, and (iii) transparent decision-making through classification weights.
arXiv Detail & Related papers (2024-07-21T20:13:50Z) - Medical Vision-Language Pre-Training for Brain Abnormalities [96.1408455065347]
We show how to automatically collect medical image-text aligned data for pretraining from public resources such as PubMed.
In particular, we present a pipeline that streamlines the pre-training process by initially collecting a large brain image-text dataset.
We also investigate the unique challenge of mapping subfigures to subcaptions in the medical domain.
arXiv Detail & Related papers (2024-04-27T05:03:42Z) - Multi-class Categorization of Reasons behind Mental Disturbance in Long
Texts [0.0]
We use Longformer to handle the problem of finding causal indicators behind mental illness in self-reported text.
Experiments show that Longformer achieves new state-of-the-art results on M-CAMS, a publicly available dataset with 62% F1-score.
We believe our work facilitates causal analysis of depression and suicide risk on social media data, and shows potential for application on other mental health conditions.
arXiv Detail & Related papers (2023-04-08T22:44:32Z) - PAL: Persona-Augmented Emotional Support Conversation Generation [54.069321178816686]
Due to the lack of human resources for mental health support, there is an increasing demand for employing conversational agents for support.
Recent work has demonstrated the effectiveness of dialogue models in providing emotional support.
We propose a framework for dynamically inferring and modeling seekers' persona.
arXiv Detail & Related papers (2022-12-19T04:12:54Z) - Semantic Similarity Models for Depression Severity Estimation [53.72188878602294]
This paper presents an efficient semantic pipeline to study depression severity in individuals based on their social media writings.
We use test user sentences for producing semantic rankings over an index of representative training sentences corresponding to depressive symptoms and severity levels.
We evaluate our methods on two Reddit-based benchmarks, achieving 30% improvement over state of the art in terms of measuring depression severity.
arXiv Detail & Related papers (2022-11-14T18:47:26Z) - Semi-Supervised Variational Reasoning for Medical Dialogue Generation [70.838542865384]
Two key characteristics are relevant for medical dialogue generation: patient states and physician actions.
We propose an end-to-end variational reasoning approach to medical dialogue generation.
A physician policy network composed of an action-classifier and two reasoning detectors is proposed for augmented reasoning ability.
arXiv Detail & Related papers (2021-05-13T04:14:35Z) - A Novel Sentiment Analysis Engine for Preliminary Depression Status
Estimation on Social Media [0.0]
We propose a cloud-based smartphone application, with a deep learning-based backend to primarily perform depression detection on Twitter social media.
A psychologist could leverage the application to assess the patient's depression status prior to counseling, which provides better insight into the mental health status of a patient.
The model achieved pinnacle results, with a testing accuracy of 87.23% and an AUC of 0.8621.
arXiv Detail & Related papers (2020-11-29T04:42:53Z) - Text Mining to Identify and Extract Novel Disease Treatments From
Unstructured Datasets [56.38623317907416]
We use Google Cloud to transcribe podcast episodes of an NPR radio show.
We then build a pipeline for systematically pre-processing the text.
Our model successfully identified that Omeprazole can help treat heartburn.
arXiv Detail & Related papers (2020-10-22T19:52:49Z) - Multimodal Depression Severity Prediction from medical bio-markers using
Machine Learning Tools and Technologies [0.0]
Depression has been a leading cause of mental-health illnesses across the world.
Using behavioural cues to automate depression diagnosis and stage prediction in recent years has relatively increased.
The absence of labelled behavioural datasets and a vast amount of possible variations prove to be a major challenge in accomplishing the task.
arXiv Detail & Related papers (2020-09-11T20:44:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.