Fully transformer-based biomarker prediction from colorectal cancer
histology: a large-scale multicentric study
- URL: http://arxiv.org/abs/2301.09617v1
- Date: Mon, 23 Jan 2023 18:33:38 GMT
- Title: Fully transformer-based biomarker prediction from colorectal cancer
histology: a large-scale multicentric study
- Authors: Sophia J. Wagner, Daniel Reisenb\"uchler, Nicholas P. West, Jan Moritz
Niehues, Gregory Patrick Veldhuizen, Philip Quirke, Heike I. Grabsch, Piet A.
van den Brandt, Gordon G. A. Hutchins, Susan D. Richman, Tanwei Yuan, Rupert
Langer, Josien Christina Anna Jenniskens, Kelly Offermans, Wolfram Mueller,
Richard Gray, Stephen B. Gruber, Joel K. Greenson, Gad Rennert, Joseph D.
Bonner, Daniel Schmolze, Jacqueline A. James, Maurice B. Loughrey, Manuel
Salto-Tellez, Hermann Brenner, Michael Hoffmeister, Daniel Truhn, Julia A.
Schnabel, Melanie Boxberg, Tingying Peng, Jakob Nikolas Kather
- Abstract summary: Deep learning can extract predictive and prognostic biomarkers from routine pathology slides in colorectal cancer.
Transformer networks are outperforming CNNs and are replacing them in many applications, but have not been used for biomarker prediction in cancer at a large scale.
In this study, we developed a new fully transformer-based pipeline for end-to-end biomarker prediction from pathology slides.
- Score: 1.0424274317527076
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Background: Deep learning (DL) can extract predictive and prognostic
biomarkers from routine pathology slides in colorectal cancer. For example, a
DL test for the diagnosis of microsatellite instability (MSI) in CRC has been
approved in 2022. Current approaches rely on convolutional neural networks
(CNNs). Transformer networks are outperforming CNNs and are replacing them in
many applications, but have not been used for biomarker prediction in cancer at
a large scale. In addition, most DL approaches have been trained on small
patient cohorts, which limits their clinical utility. Methods: In this study,
we developed a new fully transformer-based pipeline for end-to-end biomarker
prediction from pathology slides. We combine a pre-trained transformer encoder
and a transformer network for patch aggregation, capable of yielding single and
multi-target prediction at patient level. We train our pipeline on over 9,000
patients from 10 colorectal cancer cohorts. Results: A fully transformer-based
approach massively improves the performance, generalizability, data efficiency,
and interpretability as compared with current state-of-the-art algorithms.
After training on a large multicenter cohort, we achieve a sensitivity of 0.97
with a negative predictive value of 0.99 for MSI prediction on surgical
resection specimens. We demonstrate for the first time that resection
specimen-only training reaches clinical-grade performance on endoscopic biopsy
tissue, solving a long-standing diagnostic problem. Interpretation: A fully
transformer-based end-to-end pipeline trained on thousands of pathology slides
yields clinical-grade performance for biomarker prediction on surgical
resections and biopsies. Our new methods are freely available under an open
source license.
Related papers
- Biomarker based Cancer Classification using an Ensemble with Pre-trained Models [2.2436844508175224]
We propose a novel ensemble model combining pre-trained Hyperfast model, XGBoost, and LightGBM for multi-class classification tasks.
We leverage a meta-trained Hyperfast model for classifying cancer, accomplishing the highest AUC of 0.9929.
We also propose a novel ensemble model combining pre-trained Hyperfast model, XGBoost, and LightGBM for multi-class classification tasks, achieving an incremental increase in accuracy (0.9464)
arXiv Detail & Related papers (2024-06-14T14:43:59Z) - Cancer-Net PCa-Gen: Synthesis of Realistic Prostate Diffusion Weighted
Imaging Data via Anatomic-Conditional Controlled Latent Diffusion [68.45407109385306]
In Canada, prostate cancer is the most common form of cancer in men and accounted for 20% of new cancer cases for this demographic in 2022.
There has been significant interest in the development of deep neural networks for prostate cancer diagnosis, prognosis, and treatment planning using diffusion weighted imaging (DWI) data.
In this study, we explore the efficacy of latent diffusion for generating realistic prostate DWI data through the introduction of an anatomic-conditional controlled latent diffusion strategy.
arXiv Detail & Related papers (2023-11-30T15:11:03Z) - Semi-supervised ViT knowledge distillation network with style transfer
normalization for colorectal liver metastases survival prediction [1.283897253352624]
We propose an end-to-end approach for automated prognosis prediction using histology slides stained with H&E and HPS.
We first employ a Generative Adversarial Network (GAN) for slide normalization to reduce staining variations and improve the overall quality of the images that are used as input to our prediction pipeline.
We exploit the extracted features for the metastatic nodules and surrounding tissue to train a prognosis model. In parallel, we train a vision Transformer (ViT) in a knowledge distillation framework to replicate and enhance the performance of the prognosis prediction.
arXiv Detail & Related papers (2023-11-17T03:32:11Z) - Cross-Shaped Windows Transformer with Self-supervised Pretraining for Clinically Significant Prostate Cancer Detection in Bi-parametric MRI [6.930082824262643]
We introduce a novel end-to-end Cross-Shaped windows (CSwin) transformer UNet model, CSwin UNet, to detect clinically significant prostate cancer (csPCa) in prostate bi-parametric MR imaging (bpMRI)
Using a large prostate bpMRI dataset with 1500 patients, we first pretrain CSwin transformer using multi-task self-supervised learning to improve data-efficiency and network generalizability.
Five-fold cross validation shows that self-supervised CSwin UNet achieves 0.888 AUC and 0.545 Average Precision (AP), significantly outperforming four comparable models (Swin U
arXiv Detail & Related papers (2023-04-30T04:40:32Z) - TMSS: An End-to-End Transformer-based Multimodal Network for
Segmentation and Survival Prediction [0.0]
oncologists do not do this in their analysis but rather fuse the information in their brain from multiple sources such as medical images and patient history.
This work proposes a deep learning method that mimics oncologists' analytical behavior when quantifying cancer and estimating patient survival.
arXiv Detail & Related papers (2022-09-12T06:22:05Z) - EGFR Mutation Prediction of Lung Biopsy Images using Deep Learning [1.793983482813105]
In this work, we used customized deep learning pipelines with weak supervision to identify the morphological correlates of EGFR mutation.
With our pipeline, we achieved an average area under the curve (AUC) of 0.964 for tumor detection, and 0.942 for histological subtyping between adenocarcinoma and squamous cell carcinoma.
For EGFR detection, we achieved an average AUC of 0.864 on the TCGA dataset and 0.783 on the dataset from India.
arXiv Detail & Related papers (2022-08-26T08:56:33Z) - Multi-Scale Hybrid Vision Transformer for Learning Gastric Histology:
AI-Based Decision Support System for Gastric Cancer Treatment [50.89811515036067]
Gastric endoscopic screening is an effective way to decide appropriate gastric cancer (GC) treatment at an early stage, reducing GC-associated mortality rate.
We propose a practical AI system that enables five subclassifications of GC pathology, which can be directly matched to general GC treatment guidance.
arXiv Detail & Related papers (2022-02-17T08:33:52Z) - Lung Cancer Lesion Detection in Histopathology Images Using Graph-Based
Sparse PCA Network [93.22587316229954]
We propose a graph-based sparse principal component analysis (GS-PCA) network, for automated detection of cancerous lesions on histological lung slides stained by hematoxylin and eosin (H&E)
We evaluate the performance of the proposed algorithm on H&E slides obtained from an SVM K-rasG12D lung cancer mouse model using precision/recall rates, F-score, Tanimoto coefficient, and area under the curve (AUC) of the receiver operator characteristic (ROC)
arXiv Detail & Related papers (2021-10-27T19:28:36Z) - Medulloblastoma Tumor Classification using Deep Transfer Learning with
Multi-Scale EfficientNets [63.62764375279861]
We propose an end-to-end MB tumor classification and explore transfer learning with various input sizes and matching network dimensions.
Using a data set with 161 cases, we demonstrate that pre-trained EfficientNets with larger input resolutions lead to significant performance improvements.
arXiv Detail & Related papers (2021-09-10T13:07:11Z) - Controlling False Positive/Negative Rates for Deep-Learning-Based
Prostate Cancer Detection on Multiparametric MR images [58.85481248101611]
We propose a novel PCa detection network that incorporates a lesion-level cost-sensitive loss and an additional slice-level loss based on a lesion-to-slice mapping function.
Our experiments based on 290 clinical patients concludes that 1) The lesion-level FNR was effectively reduced from 0.19 to 0.10 and the lesion-level FPR was reduced from 1.03 to 0.66 by changing the lesion-level cost.
arXiv Detail & Related papers (2021-06-04T09:51:27Z) - Wide & Deep neural network model for patch aggregation in CNN-based
prostate cancer detection systems [51.19354417900591]
Prostate cancer (PCa) is one of the leading causes of death among men, with almost 1.41 million new cases and around 375,000 deaths in 2020.
To perform an automatic diagnosis, prostate tissue samples are first digitized into gigapixel-resolution whole-slide images.
Small subimages called patches are extracted and predicted, obtaining a patch-level classification.
arXiv Detail & Related papers (2021-05-20T18:13:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.