Discovering Mental Health Research Topics with Topic Modeling
- URL: http://arxiv.org/abs/2308.13569v1
- Date: Fri, 25 Aug 2023 05:25:05 GMT
- Title: Discovering Mental Health Research Topics with Topic Modeling
- Authors: Xin Gao, Cem Sazara
- Abstract summary: This study aims to identify general trends in the field and pinpoint high-impact research topics by analyzing a large dataset of mental health research papers.
Our dataset comprises 96,676 research papers pertaining to mental health, enabling us to examine the relationships between different topics using their abstracts.
To enhance our analysis, we also generated word clouds to provide a comprehensive overview of the machine learning models applied in mental health research.
- Score: 13.651763262606782
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Mental health significantly influences various aspects of our daily lives,
and its importance has been increasingly recognized by the research community
and the general public, particularly in the wake of the COVID-19 pandemic. This
heightened interest is evident in the growing number of publications dedicated
to mental health in the past decade. In this study, our goal is to identify
general trends in the field and pinpoint high-impact research topics by
analyzing a large dataset of mental health research papers. To accomplish this,
we collected abstracts from various databases and trained a customized
Sentence-BERT based embedding model leveraging the BERTopic framework. Our
dataset comprises 96,676 research papers pertaining to mental health, enabling
us to examine the relationships between different topics using their abstracts.
To evaluate the effectiveness of the model, we compared it against two other
state-of-the-art methods: Top2Vec model and LDA-BERT model. The model
demonstrated superior performance in metrics that measure topic diversity and
coherence. To enhance our analysis, we also generated word clouds to provide a
comprehensive overview of the machine learning models applied in mental health
research, shedding light on commonly utilized techniques and emerging trends.
Furthermore, we provide a GitHub link* to the dataset used in this paper,
ensuring its accessibility for further research endeavors.
Related papers
- Lessons Learned on Information Retrieval in Electronic Health Records: A Comparison of Embedding Models and Pooling Strategies [8.822087602255504]
Applying large language models to the clinical domain is challenging due to the context-heavy nature of processing medical records.
This paper explores how different embedding models and pooling methods affect information retrieval for the clinical domain.
arXiv Detail & Related papers (2024-09-23T16:16:08Z) - Rapid Biomedical Research Classification: The Pandemic PACT Advanced Categorisation Engine [10.692728349388297]
Pandemic PACT project aims to track and analyse research funding and clinical evidence for a wide range of diseases with outbreak potential.
This paper introduces the Pandemic PACT Advanced Categorisation Engine (PPACE) along with its associated dataset.
arXiv Detail & Related papers (2024-07-14T05:22:53Z) - Explainable AI for Mental Disorder Detection via Social Media: A survey and outlook [0.7689629183085726]
We conduct a thorough survey to explore the intersection of data science, artificial intelligence, and mental healthcare.
A significant portion of the population actively engages in online social media platforms, creating a vast repository of personal data.
The paper navigates through traditional diagnostic methods, state-of-the-art data- and AI-driven research studies, and the emergence of explainable AI (XAI) models for mental healthcare.
arXiv Detail & Related papers (2024-06-10T02:51:16Z) - Question-Answering Model for Schizophrenia Symptoms and Their Impact on
Daily Life using Mental Health Forums Data [0.0]
The Mental Health'' forum was used, a forum dedicated to people suffering from schizophrenia and different mental disorders.
It is shown how to pre-process the dataset to convert it into a QA dataset.
The BiBERT, DistilBERT, RoBERTa, and BioBERT models were fine-tuned and evaluated via F1-Score, Exact Match, Precision and Recall.
arXiv Detail & Related papers (2023-09-30T17:50:50Z) - UniBrain: Universal Brain MRI Diagnosis with Hierarchical
Knowledge-enhanced Pre-training [66.16134293168535]
We propose a hierarchical knowledge-enhanced pre-training framework for the universal brain MRI diagnosis, termed as UniBrain.
Specifically, UniBrain leverages a large-scale dataset of 24,770 imaging-report pairs from routine diagnostics.
arXiv Detail & Related papers (2023-09-13T09:22:49Z) - Application of targeted maximum likelihood estimation in public health
and epidemiological studies: a systematic review [0.0]
The Targeted Maximum Likelihood Estimation framework integrates machine learning, statistical theory, and statistical inference.
We conduct a systematic literature review in PubMed for articles that applied any form of TMLE in observational studies.
Of the 81 publications included, 25% originated from the University of California at Berkeley, where the framework was first developed.
arXiv Detail & Related papers (2023-03-13T17:50:03Z) - MET: Multimodal Perception of Engagement for Telehealth [52.54282887530756]
We present MET, a learning-based algorithm for perceiving a human's level of engagement from videos.
We release a new dataset, MEDICA, for mental health patient engagement detection.
arXiv Detail & Related papers (2020-11-17T15:18:38Z) - Muti-view Mouse Social Behaviour Recognition with Deep Graphical Model [124.26611454540813]
Social behaviour analysis of mice is an invaluable tool to assess therapeutic efficacy of neurodegenerative diseases.
Because of the potential to create rich descriptions of mouse social behaviors, the use of multi-view video recordings for rodent observations is increasingly receiving much attention.
We propose a novel multiview latent-attention and dynamic discriminative model that jointly learns view-specific and view-shared sub-structures.
arXiv Detail & Related papers (2020-11-04T18:09:58Z) - Predicting Clinical Diagnosis from Patients Electronic Health Records
Using BERT-based Neural Networks [62.9447303059342]
We show the importance of this problem in medical community.
We present a modification of Bidirectional Representations from Transformers (BERT) model for classification sequence.
We use a large-scale Russian EHR dataset consisting of about 4 million unique patient visits.
arXiv Detail & Related papers (2020-07-15T09:22:55Z) - Modeling Shared Responses in Neuroimaging Studies through MultiView ICA [94.31804763196116]
Group studies involving large cohorts of subjects are important to draw general conclusions about brain functional organization.
We propose a novel MultiView Independent Component Analysis model for group studies, where data from each subject are modeled as a linear combination of shared independent sources plus noise.
We demonstrate the usefulness of our approach first on fMRI data, where our model demonstrates improved sensitivity in identifying common sources among subjects.
arXiv Detail & Related papers (2020-06-11T17:29:53Z) - Opportunities and Challenges of Deep Learning Methods for
Electrocardiogram Data: A Systematic Review [62.490310870300746]
The electrocardiogram (ECG) is one of the most commonly used diagnostic tools in medicine and healthcare.
Deep learning methods have achieved promising results on predictive healthcare tasks using ECG signals.
This paper presents a systematic review of deep learning methods for ECG data from both modeling and application perspectives.
arXiv Detail & Related papers (2019-12-28T02:44:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.