Predicting VBAC Outcomes from U.S. Natality Data using Deep and Classical Machine Learning Models
- URL: http://arxiv.org/abs/2507.21330v1
- Date: Mon, 28 Jul 2025 20:54:55 GMT
- Title: Predicting VBAC Outcomes from U.S. Natality Data using Deep and Classical Machine Learning Models
- Authors: Ananya Anand,
- Abstract summary: This study presents supervised machine learning models for predicting vaginal birth after cesarean (VBAC) from the CDC WONDER Natality dataset.<n>Three classifiers were trained: logistic regression, XGBoost, and a multilayer perceptron (MLP)<n>The models achieved the highest performance with an AUC of 0.7287, followed closely by XGBoost (Boost = 0.727), both surpassing the logistic regression baseline.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Accurately predicting the outcome of a trial of labor after cesarean (TOLAC) is essential for guiding prenatal counseling and minimizing delivery-related risks. This study presents supervised machine learning models for predicting vaginal birth after cesarean (VBAC) using 643,029 TOLAC cases from the CDC WONDER Natality dataset (2017-2023). After filtering for singleton births with one or two prior cesareans and complete data across 47 prenatal-period features, three classifiers were trained: logistic regression, XGBoost, and a multilayer perceptron (MLP). The MLP achieved the highest performance with an AUC of 0.7287, followed closely by XGBoost (AUC = 0.727), both surpassing the logistic regression baseline (AUC = 0.709). To address class imbalance, class weighting was applied to the MLP, and a custom loss function was implemented in XGBoost. Evaluation metrics included ROC curves, confusion matrices, and precision-recall analysis. Logistic regression coefficients highlighted maternal BMI, education, parity, comorbidities, and prenatal care indicators as key predictors. Overall, the results demonstrate that routinely collected, early-pregnancy variables can support scalable and moderately high-performing VBAC prediction models. These models offer potential utility in clinical decision support, particularly in settings lacking access to specialized intrapartum data.
Related papers
- M-TabNet: A Multi-Encoder Transformer Model for Predicting Neonatal Birth Weight from Multimodal Data [3.452389713639621]
Birth weight (BW) is a key indicator of neonatal health, with low birth weight (LBW) linked to increased mortality and morbidity.<n>Existing models often neglect nutritional and genetic influences, focusing mainly on physiological and lifestyle factors.<n>This study presents an attention-based transformer model with a multi-encoder architecture for early (less than 12 weeks of gestation) BW prediction.
arXiv Detail & Related papers (2025-04-20T00:03:47Z) - Machine Learning-Based Model for Postoperative Stroke Prediction in Coronary Artery Disease [0.0]
This study aims to develop and evaluate a sophisticated machine learning prediction model to assess postoperative stroke risk.<n>The dataset has 70% training and 30% test. Numerical values were normalized, whereas categorical variables were one-hot encoded.<n> Logistic Regression, XGBoost, SVM, and CatBoost were employed for predictive modeling, and SHAP analysis assessed stroke risk for each variable.
arXiv Detail & Related papers (2025-03-15T02:50:32Z) - Predicting Fetal Birthweight from High Dimensional Data using Advanced Machine Learning [1.489994236178479]
Birth weight serves as a fundamental indicator of neonatal health, closely linked to medical interventions and long-term developmental risks.<n>Traditional predictive models, often constrained by limited feature selection and incomplete datasets, struggle to achieve overlooking complex maternal and fetal interactions.<n>This research explores machine learning to address these limitations, utilizing a structured methodology that integrates advanced imputation strategies.
arXiv Detail & Related papers (2025-02-20T05:17:39Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - Evaluating the Fairness of the MIMIC-IV Dataset and a Baseline
Algorithm: Application to the ICU Length of Stay Prediction [65.268245109828]
This paper uses the MIMIC-IV dataset to examine the fairness and bias in an XGBoost binary classification model predicting the ICU length of stay.
The research reveals class imbalances in the dataset across demographic attributes and employs data preprocessing and feature extraction.
The paper concludes with recommendations for fairness-aware machine learning techniques for mitigating biases and the need for collaborative efforts among healthcare professionals and data scientists.
arXiv Detail & Related papers (2023-12-31T16:01:48Z) - Multimodal Pretraining of Medical Time Series and Notes [45.89025874396911]
Deep learning models show promise in extracting meaningful patterns, but they require extensive labeled data.
We propose a novel approach employing self-supervised pretraining, focusing on the alignment of clinical measurements and notes.
In downstream tasks, including in-hospital mortality prediction and phenotyping, our model outperforms baselines in settings where only a fraction of the data is labeled.
arXiv Detail & Related papers (2023-12-11T21:53:40Z) - Self-Supervised Pretraining Improves Performance and Inference
Efficiency in Multiple Lung Ultrasound Interpretation Tasks [65.23740556896654]
We investigated whether self-supervised pretraining could produce a neural network feature extractor applicable to multiple classification tasks in lung ultrasound analysis.
When fine-tuning on three lung ultrasound tasks, pretrained models resulted in an improvement of the average across-task area under the receiver operating curve (AUC) by 0.032 and 0.061 on local and external test sets respectively.
arXiv Detail & Related papers (2023-09-05T21:36:42Z) - Closing the Gap in High-Risk Pregnancy Care Using Machine Learning and Human-AI Collaboration [8.36613277875556]
High-risk pregnancy is a pregnancy complicated by factors that can adversely affect the outcomes of the mother or the infant.
This work presents the implementation of a real-world ML-based system to assist care managers in identifying pregnant patients at risk of complications.
arXiv Detail & Related papers (2023-05-26T21:08:49Z) - Predicting Adverse Neonatal Outcomes for Preterm Neonates with
Multi-Task Learning [51.487856868285995]
We first analyze the correlations between three adverse neonatal outcomes and then formulate the diagnosis of multiple neonatal outcomes as a multi-task learning (MTL) problem.
In particular, the MTL framework contains shared hidden layers and multiple task-specific branches.
arXiv Detail & Related papers (2023-03-28T00:44:06Z) - Clinical Deterioration Prediction in Brazilian Hospitals Based on
Artificial Neural Networks and Tree Decision Models [56.93322937189087]
An extremely boosted neural network (XBNet) is used to predict clinical deterioration (CD)
The XGBoost model obtained the best results in predicting CD among Brazilian hospitals' data.
arXiv Detail & Related papers (2022-12-17T23:29:14Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - A Knowledge Distillation Ensemble Framework for Predicting Short and
Long-term Hospitalisation Outcomes from Electronic Health Records Data [5.844828229178025]
Existing outcome prediction models suffer from a low recall of infrequent positive outcomes.
We present a highly-scalable and robust machine learning framework to automatically predict adversity represented by mortality and ICU admission.
arXiv Detail & Related papers (2020-11-18T15:56:28Z) - Gradient Boosting on Decision Trees for Mortality Prediction in
Transcatheter Aortic Valve Implantation [5.050648346154715]
Current prognostic risk scores in cardiac surgery are based on statistics and do not yet benefit from machine learning.
This research aims to create a machine learning model to predict one-year mortality of a patient after TAVI.
arXiv Detail & Related papers (2020-01-08T10:04:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.