Supervised Machine Learning for Breast Cancer Risk Factors Analysis and
Survival Prediction
- URL: http://arxiv.org/abs/2304.07299v1
- Date: Thu, 13 Apr 2023 12:32:14 GMT
- Title: Supervised Machine Learning for Breast Cancer Risk Factors Analysis and
Survival Prediction
- Authors: Khaoula Chtouki, Maryem Rhanoui, Mounia Mikram, Kamelia Amazian, Siham
Yousfi
- Abstract summary: The choice of the most effective treatment may eventually be influenced by breast cancer survival prediction.
In this study, 1904 patient records were utilized to predict a 5-year breast cancer survival using a machine learning approach.
- Score: 0.5249805590164902
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: The choice of the most effective treatment may eventually be influenced by
breast cancer survival prediction. To predict the chances of a patient
surviving, a variety of techniques were employed, such as statistical, machine
learning, and deep learning models. In the current study, 1904 patient records
from the METABRIC dataset were utilized to predict a 5-year breast cancer
survival using a machine learning approach. In this study, we compare the
outcomes of seven classification models to evaluate how well they perform using
the following metrics: recall, AUC, confusion matrix, accuracy, precision,
false positive rate, and true positive rate. The findings demonstrate that the
classifiers for Logistic Regression (LR), Support Vector Machines (SVM),
Decision Tree (DT), Random Forest (RD), Extremely Randomized Trees (ET),
K-Nearest Neighbor (KNN), and Adaptive Boosting (AdaBoost) can accurately
predict the survival rate of the tested samples, which is 75,4\%, 74,7\%,
71,5\%, 75,5\%, 70,3\%, and 78 percent.
Related papers
- Leveraging Machine Learning and Deep Learning Techniques for Improved Pathological Staging of Prostate Cancer [0.4660328753262075]
This study leverages machine learning and deep learning approaches, along with feature selection and extraction methods, to enhance PCa pathological staging predictions.
Gene expression profiles from 486 tumors were analyzed using advanced algorithms, including Random Forest (RF), Logistic Regression (LR), Extreme Gradient Boosting (XGB), and Support Vector Machine (SVM)
The results reveal that the highest test F1-score, approximately 83%, was achieved by the Random Forest algorithm.
arXiv Detail & Related papers (2025-02-13T14:53:09Z) - Evaluating Spoken Language as a Biomarker for Automated Screening of Cognitive Impairment [37.40606157690235]
Alterations in speech and language can be early predictors of Alzheimer's disease and related dementias.
We evaluated machine learning techniques for ADRD screening and severity prediction from spoken language.
Risk stratification and linguistic feature importance analysis enhanced the interpretability and clinical utility of predictions.
arXiv Detail & Related papers (2025-01-30T20:17:17Z) - Tackling Small Sample Survival Analysis via Transfer Learning: A Study of Colorectal Cancer Prognosis [12.786824482430662]
This study deals with small sample survival analysis by leveraging transfer learning.
We propose various transfer learning methods designed for common survival models.
All models trained with data as small as 50 demonstrated even more significant improvement.
arXiv Detail & Related papers (2025-01-21T08:52:57Z) - Improving Breast Cancer Grade Prediction with Multiparametric MRI Created Using Optimized Synthetic Correlated Diffusion Imaging [71.91773485443125]
Grading plays a vital role in breast cancer treatment planning.
The current tumor grading method involves extracting tissue from patients, leading to stress, discomfort, and high medical costs.
This paper examines using optimized CDI$s$ to improve breast cancer grade prediction.
arXiv Detail & Related papers (2024-05-13T15:48:26Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - Predictive Modeling for Breast Cancer Classification in the Context of Bangladeshi Patients: A Supervised Machine Learning Approach with Explainable AI [0.0]
We evaluate and compare the classification accuracy, precision, recall, and F-1 scores of five different machine learning methods.
XGBoost achieved the best model accuracy, which is 97%.
arXiv Detail & Related papers (2024-04-06T17:23:21Z) - Artificial Intelligence (AI) Based Prediction of Mortality, for COVID-19 Patients [0.0]
For severely affected COVID-19 patients, it is crucial to identify high-risk patients and predict survival and need for intensive care (ICU)
This study investigated the performances of nine machine and deep learning algorithms in combination with two widely used feature selection methods.
LSTM performed the best in predicting last status and ICU requirement with 90%, 92%, 86% and 95% accuracy, sensitivity, specificity, and AUC respectively.
arXiv Detail & Related papers (2024-03-28T12:11:29Z) - Machine Learning-Assisted Recurrence Prediction for Early-Stage
Non-Small-Cell Lung Cancer Patients [10.127130900852405]
Stratifying cancer patients according to risk of relapse can personalize their care.
In this work, we provide an answer to how to utilize machine learning to estimate probability of relapse in early-stage non-small-cell lung cancer patients.
arXiv Detail & Related papers (2022-11-17T19:34:16Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - COVID-19 Prognosis via Self-Supervised Representation Learning and
Multi-Image Prediction [32.91440827855392]
We consider the task of predicting two types of patient deterioration based on chest X-rays.
Due to the relative scarcity of COVID-19 patient data, existing solutions leverage supervised pretraining on related non-COVID images.
In this paper, we use self-supervised learning based on the momentum contrast (MoCo) method in the pretraining phase to learn more general image representations to use for downstream tasks.
arXiv Detail & Related papers (2021-01-13T07:03:17Z) - Joint Prediction and Time Estimation of COVID-19 Developing Severe
Symptoms using Chest CT Scan [49.209225484926634]
We propose a joint classification and regression method to determine whether the patient would develop severe symptoms in the later time.
To do this, the proposed method takes into account 1) the weight for each sample to reduce the outliers' influence and explore the problem of imbalance classification.
Our proposed method yields 76.97% of accuracy for predicting the severe cases, 0.524 of the correlation coefficient, and 0.55 days difference for the converted time.
arXiv Detail & Related papers (2020-05-07T12:16:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.