Identifying social isolation themes in NVDRS text narratives using topic modeling and text-classification methods
- URL: http://arxiv.org/abs/2506.15030v1
- Date: Wed, 18 Jun 2025 00:13:42 GMT
- Title: Identifying social isolation themes in NVDRS text narratives using topic modeling and text-classification methods
- Authors: Drew Walker, Swati Rajwal, Sudeshna Das, Snigdha Peddireddy, Abeed Sarker,
- Abstract summary: Social isolation and loneliness are not currently recorded within the US National Violent Death Reporting System.<n>Natural language processing (NLP) techniques can be used to identify these constructs in law enforcement and coroner medical examiner narratives.
- Score: 2.2822409889718798
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Social isolation and loneliness, which have been increasing in recent years strongly contribute toward suicide rates. Although social isolation and loneliness are not currently recorded within the US National Violent Death Reporting System's (NVDRS) structured variables, natural language processing (NLP) techniques can be used to identify these constructs in law enforcement and coroner medical examiner narratives. Using topic modeling to generate lexicon development and supervised learning classifiers, we developed high-quality classifiers (average F1: .86, accuracy: .82). Evaluating over 300,000 suicides from 2002 to 2020, we identified 1,198 mentioning chronic social isolation. Decedents had higher odds of chronic social isolation classification if they were men (OR = 1.44; CI: 1.24, 1.69, p<.0001), gay (OR = 3.68; 1.97, 6.33, p<.0001), or were divorced (OR = 3.34; 2.68, 4.19, p<.0001). We found significant predictors for other social isolation topics of recent or impending divorce, child custody loss, eviction or recent move, and break-up. Our methods can improve surveillance and prevention of social isolation and loneliness in the United States.
Related papers
- Alignment Backfire: Language-Dependent Reversal of Safety Interventions Across 16 Languages in LLM Multi-Agent Systems [0.0]
In perpetrator treatment, offenders articulate remorse yet behavioral change does not follow.<n>We show that alignment interventions produce a structurally analogous phenomenon: surface safety that masks or generates collective pathology and internal dissociation.<n>These findings reframe alignment as a behavioral intervention subject to risk homeostasis and iatrogenesis.
arXiv Detail & Related papers (2026-03-05T07:46:59Z) - Spiral of Silence in Large Language Model Agents [44.98734791415891]
The Spiral of Silence (SoS) theory holds that individuals with minority views often refrain from speaking out for fear of social isolation.<n>This raises a central question: can SoS-like dynamics emerge from purely statistical language generation in large language models?<n>We consider four controlled conditions that systematically vary the availability of 'History' and 'Persona' signals.
arXiv Detail & Related papers (2025-09-28T08:59:54Z) - Generative Exaggeration in LLM Social Agents: Consistency, Bias, and Toxicity [2.3997896447030653]
We investigate how Large Language Models (LLMs) behave when simulating political discourse on social media.<n>We construct LLM agents based on 1,186 real users, prompting them to reply to politically salient tweets under controlled conditions.<n>We find that richer contextualization improves internal consistency but also amplifies polarization, stylized signals, and harmful language.
arXiv Detail & Related papers (2025-07-01T10:54:51Z) - Investigating Factuality in Long-Form Text Generation: The Roles of Self-Known and Self-Unknown [55.91887554462312]
We investigate the factuality of long-form text generation across various large language models (LLMs)
Our analysis reveals that factuality scores tend to decline in later sentences of the generated text, accompanied by a rise in the number of unsupported claims.
We find a correlation between higher Self-Known scores and improved factuality, while higher Self-Unknown scores are associated with lower factuality.
arXiv Detail & Related papers (2024-11-24T22:06:26Z) - Causal Micro-Narratives [62.47217054314046]
We present a novel approach to classify causal micro-narratives from text.
These narratives are sentence-level explanations of the cause(s) and/or effect(s) of a target subject.
arXiv Detail & Related papers (2024-10-07T17:55:10Z) - SS-GEN: A Social Story Generation Framework with Large Language Models [87.11067593512716]
Children with Autism Spectrum Disorder (ASD) often misunderstand social situations and struggle to participate in daily routines.
Social Stories are traditionally crafted by psychology experts under strict constraints to address these challenges.
We propose textbfSS-GEN, a framework to generate Social Stories in real-time with broad coverage.
arXiv Detail & Related papers (2024-06-22T00:14:48Z) - The Devil is in the Neurons: Interpreting and Mitigating Social Biases in Pre-trained Language Models [78.69526166193236]
Pre-trained Language models (PLMs) have been acknowledged to contain harmful information, such as social biases.
We propose sc Social Bias Neurons to accurately pinpoint units (i.e., neurons) in a language model that can be attributed to undesirable behavior, such as social bias.
As measured by prior metrics from StereoSet, our model achieves a higher degree of fairness while maintaining language modeling ability with low cost.
arXiv Detail & Related papers (2024-06-14T15:41:06Z) - White Men Lead, Black Women Help? Benchmarking and Mitigating Language Agency Social Biases in LLMs [58.27353205269664]
Social biases can manifest in language agency in Large Language Model (LLM)-generated content.<n>We introduce the Language Agency Bias Evaluation benchmark, which comprehensively evaluates biases in LLMs.<n>Using LABE, we unveil language agency social biases in 3 recent LLMs: ChatGPT, Llama3, and Mistral.
arXiv Detail & Related papers (2024-04-16T12:27:54Z) - Reliability Analysis of Psychological Concept Extraction and
Classification in User-penned Text [9.26840677406494]
We use the LoST dataset to capture nuanced textual cues that suggest the presence of low self-esteem in the posts of Reddit users.
Our findings suggest the need of shifting the focus of PLMs from Trigger and Consequences to a more comprehensive explanation.
arXiv Detail & Related papers (2024-01-12T17:19:14Z) - Conceptualizing Suicidal Behavior: Utilizing Explanations of Predicted
Outcomes to Analyze Longitudinal Social Media Data [2.76101452577748]
The COVID-19 pandemic has escalated mental health crises worldwide.
Suicide can result from social factors such as shame, abuse, abandonment, and mental health conditions like depression.
As these conditions develop, signs of suicidal ideation may manifest in social media interactions.
arXiv Detail & Related papers (2023-12-13T17:15:12Z) - LonXplain: Lonesomeness as a Consequence of Mental Disturbance in Reddit
Posts [0.41998444721319217]
Social media is a potential source of information that infers latent mental states through Natural Language Processing (NLP)
Existing literature on psychological theories points to loneliness as the major consequence of interpersonal risk factors.
We formulate lonesomeness detection in social media posts as an explainable binary classification problem.
arXiv Detail & Related papers (2023-05-30T04:21:24Z) - Stable Bias: Analyzing Societal Representations in Diffusion Models [72.27121528451528]
We propose a new method for exploring the social biases in Text-to-Image (TTI) systems.
Our approach relies on characterizing the variation in generated images triggered by enumerating gender and ethnicity markers in the prompts.
We leverage this method to analyze images generated by 3 popular TTI systems and find that while all of their outputs show correlations with US labor demographics, they also consistently under-represent marginalized identities to different extents.
arXiv Detail & Related papers (2023-03-20T19:32:49Z) - A Hierarchical Regression Chain Framework for Affective Vocal Burst
Recognition [72.36055502078193]
We propose a hierarchical framework, based on chain regression models, for affective recognition from vocal bursts.
To address the challenge of data sparsity, we also use self-supervised learning (SSL) representations with layer-wise and temporal aggregation modules.
The proposed systems participated in the ACII Affective Vocal Burst (A-VB) Challenge 2022 and ranked first in the "TWO'' and "CULTURE" tasks.
arXiv Detail & Related papers (2023-03-14T16:08:45Z) - Topic Modelling of Swedish Newspaper Articles about Coronavirus: a Case
Study using Latent Dirichlet Allocation Method [8.405827390095064]
Topic Modelling (TM) is from the research branches of natural language understanding (NLU) and natural language processing (NLP)
In this study, we apply popular Latent Dirichlet Allocation (LDA) methods to model the topic changes in Swedish newspaper articles about Coronavirus.
We describe the corpus we created including 6515 articles, methods applied, and statistics on topic changes over approximately 1 year and two months period of time from 17th January 2020 to 13th March 2021.
arXiv Detail & Related papers (2023-01-08T12:33:58Z) - An ensemble deep learning technique for detecting suicidal ideation from
posts in social media platforms [0.0]
This paper proposes a LSTM-Attention-CNN combined model to analyze social media submissions to detect suicidal intentions.
The proposed model demonstrated an accuracy of 90.3 percent and an F1-score of 92.6 percent.
arXiv Detail & Related papers (2021-12-17T15:34:03Z) - Quantifying the Suicidal Tendency on Social Media: A Survey [0.0]
Suicide is one of the leading cause of premature but preventable death.
Recent studies have shown that mining social media data has helped in quantifying the suicidal tendency of users at risk.
This manuscript elucidates the taxonomy of mental healthcare and highlights some recent attempts in examining the potential of quantifying suicidal tendency on social media data.
arXiv Detail & Related papers (2021-10-04T12:26:14Z) - Suicidal Ideation and Mental Disorder Detection with Attentive Relation
Networks [43.2802002858859]
This paper enhances text representation with lexicon-based sentiment scores and latent topics.
It proposes using relation networks to detect suicidal ideation and mental disorders with related risk indicators.
arXiv Detail & Related papers (2020-04-16T11:18:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.