A Machine Learning Challenge for Prognostic Modelling in Head and Neck
Cancer Using Multi-modal Data
- URL: http://arxiv.org/abs/2101.11935v1
- Date: Thu, 28 Jan 2021 11:20:34 GMT
- Title: A Machine Learning Challenge for Prognostic Modelling in Head and Neck
Cancer Using Multi-modal Data
- Authors: Michal Kazmierski, Mattea Welch, Sejin Kim, Chris McIntosh, Princess
Margaret Head and Neck Cancer Group, Katrina Rey-McIntyre, Shao Hui Huang,
Tirth Patel, Tony Tadic, Michael Milosevic, Fei-Fei Liu, Andrew Hope, Scott
Bratman and Benjamin Haibe-Kains
- Abstract summary: We have conducted an institutional machine learning challenge to develop an accurate model for overall survival prediction in head and neck cancer.
We compared 12 different submissions using imaging and clinical data, separately or in combination.
The winning approach used non-linear, multitask learning on clinical data and tumour volume, achieving high prognostic accuracy for 2-year and lifetime survival prediction.
- Score: 0.10651507097431492
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurate prognosis for an individual patient is a key component of precision
oncology. Recent advances in machine learning have enabled the development of
models using a wider range of data, including imaging. Radiomics aims to
extract quantitative predictive and prognostic biomarkers from routine medical
imaging, but evidence for computed tomography radiomics for prognosis remains
inconclusive. We have conducted an institutional machine learning challenge to
develop an accurate model for overall survival prediction in head and neck
cancer using clinical data etxracted from electronic medical records and
pre-treatment radiological images, as well as to evaluate the true added
benefit of radiomics for head and neck cancer prognosis. Using a large,
retrospective dataset of 2,552 patients and a rigorous evaluation framework, we
compared 12 different submissions using imaging and clinical data, separately
or in combination. The winning approach used non-linear, multitask learning on
clinical data and tumour volume, achieving high prognostic accuracy for 2-year
and lifetime survival prediction and outperforming models relying on clinical
data only, engineered radiomics and deep learning. Combining all submissions in
an ensemble model resulted in improved accuracy, with the highest gain from a
image-based deep learning model. Our results show the potential of machine
learning and simple, informative prognostic factors in combination with large
datasets as a tool to guide personalized cancer care.
Related papers
- Continually Evolved Multimodal Foundation Models for Cancer Prognosis [50.43145292874533]
Cancer prognosis is a critical task that involves predicting patient outcomes and survival rates.
Previous studies have integrated diverse data modalities, such as clinical notes, medical images, and genomic data, leveraging their complementary information.
Existing approaches face two major limitations. First, they struggle to incorporate newly arrived data with varying distributions into training, such as patient records from different hospitals.
Second, most multimodal integration methods rely on simplistic concatenation or task-specific pipelines, which fail to capture the complex interdependencies across modalities.
arXiv Detail & Related papers (2025-01-30T06:49:57Z) - Prediction of Lung Metastasis from Hepatocellular Carcinoma using the SEER Database [0.9055332067000195]
Hepatocellular carcinoma (HCC) is a leading cause of cancer-related mortality.
predictive models for lung metastasis inHCC remain limited in scope and clinical applicability.
We develop and validate an end-to-end machine learning pipeline using data from the Surveillance, Epidemiology, and End Results (SEER) database.
arXiv Detail & Related papers (2025-01-20T20:06:31Z) - Multi-modal Medical Image Fusion For Non-Small Cell Lung Cancer Classification [7.002657345547741]
Non-small cell lung cancer (NSCLC) is a predominant cause of cancer mortality worldwide.
In this paper, we introduce an innovative integration of multi-modal data, synthesizing fused medical imaging (CT and PET scans) with clinical health records and genomic data.
Our research surpasses existing approaches, as evidenced by a substantial enhancement in NSCLC detection and classification precision.
arXiv Detail & Related papers (2024-09-27T12:59:29Z) - MGH Radiology Llama: A Llama 3 70B Model for Radiology [50.42811030970618]
This paper presents an advanced radiology-focused large language model: MGH Radiology Llama.
It is developed using the Llama 3 70B model, building upon previous domain-specific models like Radiology-GPT and Radiology-Llama2.
Our evaluation, incorporating both traditional metrics and a GPT-4-based assessment, highlights the enhanced performance of this work over general-purpose LLMs.
arXiv Detail & Related papers (2024-08-13T01:30:03Z) - Predictive Modeling for Breast Cancer Classification in the Context of Bangladeshi Patients: A Supervised Machine Learning Approach with Explainable AI [0.0]
We evaluate and compare the classification accuracy, precision, recall, and F-1 scores of five different machine learning methods.
XGBoost achieved the best model accuracy, which is 97%.
arXiv Detail & Related papers (2024-04-06T17:23:21Z) - ChatRadio-Valuer: A Chat Large Language Model for Generalizable
Radiology Report Generation Based on Multi-institution and Multi-system Data [115.0747462486285]
ChatRadio-Valuer is a tailored model for automatic radiology report generation that learns generalizable representations.
The clinical dataset utilized in this study encompasses a remarkable total of textbf332,673 observations.
ChatRadio-Valuer consistently outperforms state-of-the-art models, especially ChatGPT (GPT-3.5-Turbo) and GPT-4 et al.
arXiv Detail & Related papers (2023-10-08T17:23:17Z) - Diagnosis and Prognosis of Head and Neck Cancer Patients using
Artificial Intelligence [0.0]
Cancer is one of the most life-threatening diseases worldwide, and head and neck (H&N) cancer is a prevalent type with hundreds of thousands of new cases recorded each year.
Clinicians use medical imaging modalities such as computed tomography and positron emission tomography to detect the presence of a tumor, and they combine that information with clinical data for patient prognosis.
Machine learning and deep learning can automate these tasks to help clinicians with highly promising results.
arXiv Detail & Related papers (2023-05-31T08:22:41Z) - An Ensemble Approach for Patient Prognosis of Head and Neck Tumor Using
Multimodal Data [0.0]
We propose a multimodal network that ensembles deep multi-task logistic regression (MTLR), Cox proportional hazard (CoxPH) and CNN models to predict prognostic outcomes for patients with head and neck tumors.
Our proposed ensemble solution achieves a C-index of 0.72 on The HECKTOR test set that saved us the first place in prognosis task of the HECKTOR challenge.
arXiv Detail & Related papers (2022-02-25T07:50:59Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Self-Training with Improved Regularization for Sample-Efficient Chest
X-Ray Classification [80.00316465793702]
We present a deep learning framework that enables robust modeling in challenging scenarios.
Our results show that using 85% lesser labeled data, we can build predictive models that match the performance of classifiers trained in a large-scale data setting.
arXiv Detail & Related papers (2020-05-03T02:36:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.