Machine Learning-Assisted Recurrence Prediction for Early-Stage
Non-Small-Cell Lung Cancer Patients
- URL: http://arxiv.org/abs/2211.09856v1
- Date: Thu, 17 Nov 2022 19:34:16 GMT
- Title: Machine Learning-Assisted Recurrence Prediction for Early-Stage
Non-Small-Cell Lung Cancer Patients
- Authors: Adrianna Janik, Maria Torrente, Luca Costabello, Virginia Calvo, Brian
Walsh, Carlos Camps, Sameh K. Mohamed, Ana L. Ortega, V\'it Nov\'a\v{c}ek,
Bartomeu Massut\'i, Pasquale Minervini, M.Rosario Garcia Campelo, Edel del
Barco, Joaquim Bosch-Barrera, Ernestina Menasalvas, Mohan Timilsina, Mariano
Provencio
- Abstract summary: Stratifying cancer patients according to risk of relapse can personalize their care.
In this work, we provide an answer to how to utilize machine learning to estimate probability of relapse in early-stage non-small-cell lung cancer patients.
- Score: 10.127130900852405
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Background: Stratifying cancer patients according to risk of relapse can
personalize their care. In this work, we provide an answer to the following
research question: How to utilize machine learning to estimate probability of
relapse in early-stage non-small-cell lung cancer patients?
Methods: For predicting relapse in 1,387 early-stage (I-II), non-small-cell
lung cancer (NSCLC) patients from the Spanish Lung Cancer Group data (65.7
average age, 24.8% females, 75.2% males) we train tabular and graph machine
learning models. We generate automatic explanations for the predictions of such
models. For models trained on tabular data, we adopt SHAP local explanations to
gauge how each patient feature contributes to the predicted outcome. We explain
graph machine learning predictions with an example-based method that highlights
influential past patients. Results: Machine learning models trained on tabular
data exhibit a 76% accuracy for the Random Forest model at predicting relapse
evaluated with a 10-fold cross-validation (model was trained 10 times with
different independent sets of patients in test, train and validation sets, the
reported metrics are averaged over these 10 test sets). Graph machine learning
reaches 68% accuracy over a 200-patient, held-out test set, calibrated on a
held-out set of 100 patients. Conclusions: Our results show that machine
learning models trained on tabular and graph data can enable objective,
personalised and reproducible prediction of relapse and therefore, disease
outcome in patients with early-stage NSCLC. With further prospective and
multisite validation, and additional radiological and molecular data, this
prognostic model could potentially serve as a predictive decision support tool
for deciding the use of adjuvant treatments in early-stage lung cancer.
Keywords: Non-Small-Cell Lung Cancer, Tumor Recurrence Prediction, Machine
Learning
Related papers
- Predicting Lung Cancer Patient Prognosis with Large Language Models [20.97970447748789]
Large language models (LLMs) have gained attention for their ability to process and generate text based on extensive learned knowledge.
We evaluate the potential of GPT-4o mini and GPT-3.5 in predicting the prognosis of lung cancer patients.
arXiv Detail & Related papers (2024-08-15T06:36:27Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - Predictive Modeling for Breast Cancer Classification in the Context of Bangladeshi Patients: A Supervised Machine Learning Approach with Explainable AI [0.0]
We evaluate and compare the classification accuracy, precision, recall, and F-1 scores of five different machine learning methods.
XGBoost achieved the best model accuracy, which is 97%.
arXiv Detail & Related papers (2024-04-06T17:23:21Z) - Supervised Machine Learning for Breast Cancer Risk Factors Analysis and
Survival Prediction [0.5249805590164902]
The choice of the most effective treatment may eventually be influenced by breast cancer survival prediction.
In this study, 1904 patient records were utilized to predict a 5-year breast cancer survival using a machine learning approach.
arXiv Detail & Related papers (2023-04-13T12:32:14Z) - Machine Learning Techniques for Predicting the Short-Term Outcome of
Resective Surgery in Lesional-Drug Resistance Epilepsy [1.759008116536278]
Seven dif-ferent categorization algorithms were used to analyze the data.
The support vector machine (SVM) with the linear kernel yielded 76.1% in terms of accuracy.
arXiv Detail & Related papers (2023-02-10T13:04:47Z) - Deep learning methods for drug response prediction in cancer:
predominant and emerging trends [50.281853616905416]
Exploiting computational predictive models to study and treat cancer holds great promise in improving drug development and personalized design of treatment plans.
A wave of recent papers demonstrates promising results in predicting cancer response to drug treatments while utilizing deep learning methods.
This review allows to better understand the current state of the field and identify major challenges and promising solution paths.
arXiv Detail & Related papers (2022-11-18T03:26:31Z) - Breast Cancer Induced Bone Osteolysis Prediction Using Temporal
Variational Auto-Encoders [65.95959936242993]
We develop a deep learning framework that can accurately predict and visualize the progression of osteolytic bone lesions.
It will assist in planning and evaluating treatment strategies to prevent skeletal related events (SREs) in breast cancer patients.
arXiv Detail & Related papers (2022-03-20T21:00:10Z) - A Knowledge Distillation Ensemble Framework for Predicting Short and
Long-term Hospitalisation Outcomes from Electronic Health Records Data [5.844828229178025]
Existing outcome prediction models suffer from a low recall of infrequent positive outcomes.
We present a highly-scalable and robust machine learning framework to automatically predict adversity represented by mortality and ICU admission.
arXiv Detail & Related papers (2020-11-18T15:56:28Z) - Hemogram Data as a Tool for Decision-making in COVID-19 Management:
Applications to Resource Scarcity Scenarios [62.997667081978825]
COVID-19 pandemics has challenged emergency response systems worldwide, with widespread reports of essential services breakdown and collapse of health care structure.
This work describes a machine learning model derived from hemogram exam data performed in symptomatic patients.
Proposed models can predict COVID-19 qRT-PCR results in symptomatic individuals with high accuracy, sensitivity and specificity.
arXiv Detail & Related papers (2020-05-10T01:45:03Z) - Joint Prediction and Time Estimation of COVID-19 Developing Severe
Symptoms using Chest CT Scan [49.209225484926634]
We propose a joint classification and regression method to determine whether the patient would develop severe symptoms in the later time.
To do this, the proposed method takes into account 1) the weight for each sample to reduce the outliers' influence and explore the problem of imbalance classification.
Our proposed method yields 76.97% of accuracy for predicting the severe cases, 0.524 of the correlation coefficient, and 0.55 days difference for the converted time.
arXiv Detail & Related papers (2020-05-07T12:16:37Z) - Self-Training with Improved Regularization for Sample-Efficient Chest
X-Ray Classification [80.00316465793702]
We present a deep learning framework that enables robust modeling in challenging scenarios.
Our results show that using 85% lesser labeled data, we can build predictive models that match the performance of classifiers trained in a large-scale data setting.
arXiv Detail & Related papers (2020-05-03T02:36:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.