ScAN: Suicide Attempt and Ideation Events Dataset
- URL: http://arxiv.org/abs/2205.07872v1
- Date: Thu, 12 May 2022 17:11:07 GMT
- Title: ScAN: Suicide Attempt and Ideation Events Dataset
- Authors: Bhanu Pratap Singh Rawat, Samuel Kovaly, Wilfred R. Pigeon, Hong Yu
- Abstract summary: Suicidal behaviors, including suicide attempts (SA) and suicide ideations (SI) are leading risk factors for death by suicide.
Accurate detection of such documentation may help improve surveillance and predictions of patients' suicidal behaviors.
- Score: 4.905488376442885
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Suicide is an important public health concern and one of the leading causes
of death worldwide. Suicidal behaviors, including suicide attempts (SA) and
suicide ideations (SI), are leading risk factors for death by suicide.
Information related to patients' previous and current SA and SI are frequently
documented in the electronic health record (EHR) notes. Accurate detection of
such documentation may help improve surveillance and predictions of patients'
suicidal behaviors and alert medical professionals for suicide prevention
efforts. In this study, we first built Suicide Attempt and Ideation Events
(ScAN) dataset, a subset of the publicly available MIMIC III dataset spanning
over 12k+ EHR notes with 19k+ annotated SA and SI events information. The
annotations also contain attributes such as method of suicide attempt. We also
provide a strong baseline model ScANER (Suicide Attempt and Ideation Events
Retriever), a multi-task RoBERTa-based model with a retrieval module to extract
all the relevant suicidal behavioral evidences from EHR notes of an
hospital-stay and, and a prediction module to identify the type of suicidal
behavior (SA and SI) concluded during the patient's stay at the hospital.
ScANER achieved a macro-weighted F1-score of 0.83 for identifying suicidal
behavioral evidences and a macro F1-score of 0.78 and 0.60 for classification
of SA and SI for the patient's hospital-stay, respectively. ScAN and ScANER are
publicly available.
Related papers
- SepsisLab: Early Sepsis Prediction with Uncertainty Quantification and Active Sensing [67.8991481023825]
Sepsis is the leading cause of in-hospital mortality in the USA.
Existing predictive models are usually trained on high-quality data with few missing information.
For the potential high-risk patients with low confidence due to limited observations, we propose a robust active sensing algorithm.
arXiv Detail & Related papers (2024-07-24T04:47:36Z) - Bored to Death: Artificial Intelligence Research Reveals the Role of Boredom in Suicide Behavior [10.83380478033686]
This study aimed to harness AI methodologies to uncover hidden risk factors that trigger or aggravate suicide behaviors.
An almost fully automated, AI-guided research pipeline resulted in four Facebook topics that predicted the risk of suicide.
The study signals boredom as a maladaptive 'ingredient' that might trigger suicide behaviors, regardless of depression.
arXiv Detail & Related papers (2024-04-22T10:16:02Z) - SOS-1K: A Fine-grained Suicide Risk Classification Dataset for Chinese Social Media Analysis [22.709733830774788]
This study presents a Chinese social media dataset designed for fine-grained suicide risk classification.
Seven pre-trained models were evaluated in two tasks: high and low suicide risk, and fine-grained suicide risk classification on a level of 0 to 10.
Deep learning models show good performance in distinguishing between high and low suicide risk, with the best model achieving an F1 score of 88.39%.
arXiv Detail & Related papers (2024-04-19T06:58:51Z) - Non-Invasive Suicide Risk Prediction Through Speech Analysis [74.8396086718266]
We present a non-invasive, speech-based approach for automatic suicide risk assessment.
We extract three sets of features, including wav2vec, interpretable speech and acoustic features, and deep learning-based spectral representations.
Our most effective speech model achieves a balanced accuracy of $66.2,%$.
arXiv Detail & Related papers (2024-04-18T12:33:57Z) - Conceptualizing Suicidal Behavior: Utilizing Explanations of Predicted
Outcomes to Analyze Longitudinal Social Media Data [2.76101452577748]
The COVID-19 pandemic has escalated mental health crises worldwide.
Suicide can result from social factors such as shame, abuse, abandonment, and mental health conditions like depression.
As these conditions develop, signs of suicidal ideation may manifest in social media interactions.
arXiv Detail & Related papers (2023-12-13T17:15:12Z) - Multimodal Pretraining of Medical Time Series and Notes [45.89025874396911]
Deep learning models show promise in extracting meaningful patterns, but they require extensive labeled data.
We propose a novel approach employing self-supervised pretraining, focusing on the alignment of clinical measurements and notes.
In downstream tasks, including in-hospital mortality prediction and phenotyping, our model outperforms baselines in settings where only a fraction of the data is labeled.
arXiv Detail & Related papers (2023-12-11T21:53:40Z) - Exploring Social Media for Early Detection of Depression in COVID-19
Patients [44.76299288962596]
Detection and intervention at an early stage can reduce the risk of depression in COVID-19 patients.
We managed a dataset of COVID-19 patients that contains information about their social media activity both before and after infection.
We conducted an extensive analysis of this dataset to investigate the characteristic of COVID-19 patients with a higher risk of depression.
arXiv Detail & Related papers (2023-02-23T14:13:52Z) - Am I No Good? Towards Detecting Perceived Burdensomeness and Thwarted
Belongingness from Suicide Notes [51.378225388679425]
We present an end-to-end multitask system to address a novel task of detection of Perceived Burdensomeness (PB) and Thwarted Belongingness (TB) from suicide notes.
We also introduce a manually translated code-mixed suicide notes corpus, CoMCEASE-v2.0, based on the benchmark CEASE-v2.0 dataset.
We exploit the temporal orientation and emotion information in the suicide notes to boost overall performance.
arXiv Detail & Related papers (2022-05-20T06:31:08Z) - Journaling Data for Daily PHQ-2 Depression Prediction and Forecasting [47.93070579578704]
We explore the potential of using actively-collected data to predict and forecast daily PHQ-2 scores on a newly-collected longitudinal dataset.
We obtain a best MAE of 1.417 for daily prediction of PHQ-2 scores, which specifically in the used dataset have a range of 0 to 12.
This illustrates the additive value that can be obtained by incorporating actively-collected data in a depression monitoring application.
arXiv Detail & Related papers (2022-05-06T17:47:05Z) - Characterization of Time-variant and Time-invariant Assessment of
Suicidality on Reddit using C-SSRS [9.424631103856596]
We develop deep learning algorithms to assess suicide risk in terms of severity and temporality from Reddit data.
Our results suggest that the time-variant approach outperforms the time-invariant method in the assessment of suicide-related ideations and supportive behaviors.
The proposed approach can be integrated with clinical diagnostic interviews for improving suicide risk assessments.
arXiv Detail & Related papers (2021-04-09T01:39:41Z) - Survival Modeling of Suicide Risk with Rare and Uncertain Diagnoses [15.732431764583323]
We use medical claims data to study the risk of subsequent suicide attempts for patients who were hospitalized due to suicide attempts and later discharged.
Almost 20% of "suspected" suicide attempts are identified from diagnosis codes indicating external causes of injury and poisoning with undermined intent.
arXiv Detail & Related papers (2020-09-05T20:47:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.