Deep Learning for Suicide and Depression Identification with
Unsupervised Label Correction
- URL: http://arxiv.org/abs/2102.09427v1
- Date: Thu, 18 Feb 2021 15:40:07 GMT
- Title: Deep Learning for Suicide and Depression Identification with
Unsupervised Label Correction
- Authors: Ayaan Haque, Viraaj Reddi, and Tyler Giallanza
- Abstract summary: Early detection of suicidal ideation in depressed individuals can allow for adequate medical attention and support.
Recent NLP research focuses on classifying, from a given piece of text, if an individual is suicidal or clinically healthy.
We propose SDCNL, a suicide versus classification method through a deep learning approach.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Early detection of suicidal ideation in depressed individuals can allow for
adequate medical attention and support, which in many cases is life-saving.
Recent NLP research focuses on classifying, from a given piece of text, if an
individual is suicidal or clinically healthy. However, there have been no major
attempts to differentiate between depression and suicidal ideation, which is an
important clinical challenge. Due to the scarce availability of EHR data,
suicide notes, or other similar verified sources, web query data has emerged as
a promising alternative. Online sources, such as Reddit, allow for anonymity
that prompts honest disclosure of symptoms, making it a plausible source even
in a clinical setting. However, these online datasets also result in lower
performance, which can be attributed to the inherent noise in web-scraped
labels, which necessitates a noise-removal process. Thus, we propose SDCNL, a
suicide versus depression classification method through a deep learning
approach. We utilize online content from Reddit to train our algorithm, and to
verify and correct noisy labels, we propose a novel unsupervised label
correction method which, unlike previous work, does not require prior noise
distribution information. Our extensive experimentation with multiple deep word
embedding models and classifiers display the strong performance of the method
in anew, challenging classification application. We make our code and dataset
available at https://github.com/ayaanzhaque/SDCNL
Related papers
- Enhancing Suicide Risk Detection on Social Media through Semi-Supervised Deep Label Smoothing [15.1082838217225]
Barriers to people receiving support include social stigma and lack of access to mental health care.
With the popularity of social media, people have turned to online forums, such as Reddit to express their feelings and seek support.
Social media posts can be classified, using text classification, to help connect people with professional help.
These systems fail to account for the inherent uncertainty in classifying mental health conditions.
arXiv Detail & Related papers (2024-05-09T14:25:25Z) - SOS-1K: A Fine-grained Suicide Risk Classification Dataset for Chinese Social Media Analysis [22.709733830774788]
This study presents a Chinese social media dataset designed for fine-grained suicide risk classification.
Seven pre-trained models were evaluated in two tasks: high and low suicide risk, and fine-grained suicide risk classification on a level of 0 to 10.
Deep learning models show good performance in distinguishing between high and low suicide risk, with the best model achieving an F1 score of 88.39%.
arXiv Detail & Related papers (2024-04-19T06:58:51Z) - Non-Invasive Suicide Risk Prediction Through Speech Analysis [74.8396086718266]
We present a non-invasive, speech-based approach for automatic suicide risk assessment.
We extract three sets of features, including wav2vec, interpretable speech and acoustic features, and deep learning-based spectral representations.
Our most effective speech model achieves a balanced accuracy of $66.2,%$.
arXiv Detail & Related papers (2024-04-18T12:33:57Z) - A noisy elephant in the room: Is your out-of-distribution detector robust to label noise? [49.88894124047644]
We take a closer look at 20 state-of-the-art OOD detection methods.
We show that poor separation between incorrectly classified ID samples vs. OOD samples is an overlooked yet important limitation of existing methods.
arXiv Detail & Related papers (2024-04-02T09:40:22Z) - Combating Label Noise With A General Surrogate Model For Sample
Selection [84.61367781175984]
We propose to leverage the vision-language surrogate model CLIP to filter noisy samples automatically.
We validate the effectiveness of our proposed method on both real-world and synthetic noisy datasets.
arXiv Detail & Related papers (2023-10-16T14:43:27Z) - Leveraging Contextual Relatedness to Identify Suicide Documentation in
Clinical Notes through Zero Shot Learning [8.57098973963918]
This paper describes a novel methodology that identifies suicidality in clinical notes by addressing this data sparsity issue through zero-shot learning.
A deep neural network was trained by mapping the training documents contents to a semantic space.
In applying a 0.90 probability threshold, the methodology identified notes not associated with a relevant ICD 10 CM code that documented suicidality, with 94 percent accuracy.
arXiv Detail & Related papers (2023-01-09T17:26:07Z) - Transductive Linear Probing: A Novel Framework for Few-Shot Node
Classification [56.17097897754628]
We show that transductive linear probing with self-supervised graph contrastive pretraining can outperform the state-of-the-art fully supervised meta-learning based methods under the same protocol.
We hope this work can shed new light on few-shot node classification problems and foster future research on learning from scarcely labeled instances on graphs.
arXiv Detail & Related papers (2022-12-11T21:10:34Z) - An ensemble deep learning technique for detecting suicidal ideation from
posts in social media platforms [0.0]
This paper proposes a LSTM-Attention-CNN combined model to analyze social media submissions to detect suicidal intentions.
The proposed model demonstrated an accuracy of 90.3 percent and an F1-score of 92.6 percent.
arXiv Detail & Related papers (2021-12-17T15:34:03Z) - Detecting Potentially Harmful and Protective Suicide-related Content on
Twitter: A Machine Learning Approach [0.1582078748632554]
We apply machine learning methods to automatically label large quantities of Twitter data.
Two deep learning models achieved the best performance in two classification tasks.
This work enables future large-scale investigations on harmful and protective effects of various kinds of social media content on suicide rates and on help-seeking behavior.
arXiv Detail & Related papers (2021-12-09T09:35:48Z) - Tackling Instance-Dependent Label Noise via a Universal Probabilistic
Model [80.91927573604438]
This paper proposes a simple yet universal probabilistic model, which explicitly relates noisy labels to their instances.
Experiments on datasets with both synthetic and real-world label noise verify that the proposed method yields significant improvements on robustness.
arXiv Detail & Related papers (2021-01-14T05:43:51Z) - A Second-Order Approach to Learning with Instance-Dependent Label Noise [58.555527517928596]
The presence of label noise often misleads the training of deep neural networks.
We show that the errors in human-annotated labels are more likely to be dependent on the difficulty levels of tasks.
arXiv Detail & Related papers (2020-12-22T06:36:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.