What's Race Got to do with it? Predicting Youth Depression Across Racial
Groups Using Machine and Deep Learning
- URL: http://arxiv.org/abs/2308.11591v1
- Date: Mon, 21 Aug 2023 13:59:50 GMT
- Title: What's Race Got to do with it? Predicting Youth Depression Across Racial
Groups Using Machine and Deep Learning
- Authors: Nathan Zhong and Nikhil Yadav
- Abstract summary: Depression is a common yet serious mental disorder that affects millions of U.S. high schoolers every year.
This study proposes a similar approach, utilizing machine learning (ML) and artificial neural network (ANN) models to classify depression in a student.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Depression is a common yet serious mental disorder that affects millions of
U.S. high schoolers every year. Still, accurate diagnosis and early detection
remain significant challenges. In the field of public health, research shows
that neural networks produce promising results in identifying other diseases
such as cancer and HIV. This study proposes a similar approach, utilizing
machine learning (ML) and artificial neural network (ANN) models to classify
depression in a student. Additionally, the study highlights the differences in
relevant factors for race subgroups and advocates the need for more extensive
and diverse datasets. The models train on nationwide Youth Risk Behavior
Surveillance System (YRBSS) survey data, in which the most relevant factors of
depression are found with statistical analysis. The survey data is a structured
dataset with 15000 entries including three race subsets each consisting of 900
entries. For classification, the research problem is modeled as a supervised
learning binary classification problem. Factors relevant to depression for
different racial subgroups are also identified. The ML and ANN models are
trained on the entire dataset followed by different race subsets to classify
whether an individual has depression. The ANN model achieves the highest F1
score of 82.90% while the best-performing machine learning model, support
vector machines (SVM), achieves a score of 81.90%. This study reveals that
different parameters are more valuable for modeling depression across diverse
racial groups and furthers research regarding American youth depression.
Related papers
- Large-scale digital phenotyping: identifying depression and anxiety indicators in a general UK population with over 10,000 participants [2.2909783327197393]
We conducted a cross-sectional analysis of data from 10,129 participants recruited from a UK-based general population.
Participants shared wearable (Fitbit) data and self-reported questionnaires on depression (PHQ-8), anxiety (GAD-7), and mood via a study app.
We observed significant associations between the severity of depression and anxiety with several factors, including mood, age, gender, BMI, sleep patterns, physical activity, and heart rate.
arXiv Detail & Related papers (2024-09-24T16:05:17Z) - A Demographic-Conditioned Variational Autoencoder for fMRI Distribution Sampling and Removal of Confounds [49.34500499203579]
We create a variational autoencoder (VAE)-based model, DemoVAE, to decorrelate fMRI features from demographics.
We generate high-quality synthetic fMRI data based on user-supplied demographics.
arXiv Detail & Related papers (2024-05-13T17:49:20Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - Fairness Evolution in Continual Learning for Medical Imaging [47.52603262576663]
We study the behavior of Continual Learning (CL) strategies in medical imaging regarding classification performance.
We evaluate the Replay, Learning without Forgetting (LwF), LwF, and Pseudo-Label strategies.
LwF and Pseudo-Label exhibit optimal classification performance, but when including fairness metrics in the evaluation, it is clear that Pseudo-Label is less biased.
arXiv Detail & Related papers (2024-04-10T09:48:52Z) - Adapting Machine Learning Diagnostic Models to New Populations Using a Small Amount of Data: Results from Clinical Neuroscience [21.420302408947194]
We develop a weighted empirical risk minimization approach that optimally combines data from a source group to make predictions on a target group.
We apply this method to multi-source data of 15,363 individuals from 20 neuroimaging studies to build ML models for diagnosis of Alzheimer's disease and estimation of brain age.
arXiv Detail & Related papers (2023-08-06T18:05:39Z) - Change is Hard: A Closer Look at Subpopulation Shift [48.0369745740936]
We propose a unified framework that dissects and explains common shifts in subgroups.
We then establish a benchmark of 20 state-of-the-art algorithms evaluated on 12 real-world datasets in vision, language, and healthcare domains.
arXiv Detail & Related papers (2023-02-23T18:59:56Z) - Detecting Reddit Users with Depression Using a Hybrid Neural Network
SBERT-CNN [18.32536789799511]
Depression is a widespread mental health issue, affecting an estimated 3.8% of the global population.
We propose a hybrid deep learning model which combines a pretrained sentence BERT (SBERT) and convolutional neural network (CNN) to detect individuals with depression with their Reddit posts.
The model achieved an accuracy of 0.86 and an F1 score of 0.86 and outperformed the state-of-the-art documented result (F1 score of 0.79) by other machine learning models in the literature.
arXiv Detail & Related papers (2023-02-03T06:22:18Z) - Picking on the Same Person: Does Algorithmic Monoculture lead to Outcome
Homogenization? [90.35044668396591]
A recurring theme in machine learning is algorithmic monoculture: the same systems, or systems that share components, are deployed by multiple decision-makers.
We propose the component-sharing hypothesis: if decision-makers share components like training data or specific models, then they will produce more homogeneous outcomes.
We test this hypothesis on algorithmic fairness benchmarks, demonstrating that sharing training data reliably exacerbates homogenization.
We conclude with philosophical analyses of and societal challenges for outcome homogenization, with an eye towards implications for deployed machine learning systems.
arXiv Detail & Related papers (2022-11-25T09:33:11Z) - Machine Learning Models Are Not Necessarily Biased When Constructed
Properly: Evidence from Neuroimaging Studies [19.288217559980545]
We provide experimental data which support that when properly trained, machine learning models can generalize well across diverse conditions.
Specifically, by using multi-study magnetic resonance imaging consortia for diagnosing Alzheimer's disease, schizophrenia, and autism spectrum disorder, we find that, the accuracy of well-trained models is consistent across different subgroups.
arXiv Detail & Related papers (2022-05-26T15:24:39Z) - Gender and Racial Fairness in Depression Research using Social Media [13.512136878021854]
Social media data has spurred interest in mental health research from a computational lens.
Previous research has raised concerns about possible biases in models produced from this data.
Our study concludes with recommendations on how to avoid these biases in future research.
arXiv Detail & Related papers (2021-03-18T22:34:41Z) - Deep Multi-task Learning for Depression Detection and Prediction in
Longitudinal Data [50.02223091927777]
Depression is among the most prevalent mental disorders, affecting millions of people of all ages globally.
Machine learning techniques have shown effective in enabling automated detection and prediction of depression for early intervention and treatment.
We introduce a novel deep multi-task recurrent neural network to tackle this challenge, in which depression classification is jointly optimized with two auxiliary tasks.
arXiv Detail & Related papers (2020-12-05T05:14:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.