Predicting microsatellite instability and key biomarkers in colorectal
cancer from H&E-stained images: Achieving SOTA with Less Data using Swin
Transformer
- URL: http://arxiv.org/abs/2208.10495v1
- Date: Mon, 22 Aug 2022 02:32:30 GMT
- Title: Predicting microsatellite instability and key biomarkers in colorectal
cancer from H&E-stained images: Achieving SOTA with Less Data using Swin
Transformer
- Authors: Bangwei Guo, Jitendra Jonnagaddala, Hong Zhang, Xu Steven Xu
- Abstract summary: We developed an efficient workflow for biomarkers in colorectal cancers using Shifted Windows (Swin-T)
Swin-T was extremely efficient using small training datasets and exhibits robust predictive performance with only 200-500 training samples.
These data indicate that Swin-T may be 5-10 times more efficient than the current state-of-the-art algorithms for MSI.
- Score: 3.6695403836792493
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Artificial intelligence (AI) models have been developed for predicting
clinically relevant biomarkers, including microsatellite instability (MSI), for
colorectal cancers (CRC). However, the current deep-learning networks are
data-hungry and require large training datasets, which are often lacking in the
medical domain. In this study, based on the latest Hierarchical Vision
Transformer using Shifted Windows (Swin-T), we developed an efficient workflow
for biomarkers in CRC (MSI, hypermutation, chromosomal instability, CpG island
methylator phenotype, BRAF, and TP53 mutation) that only required relatively
small datasets, but achieved the state-of-the-art (SOTA) predictive
performance. Our Swin-T workflow not only substantially outperformed published
models in an intra-study cross-validation experiment using TCGA-CRC-DX dataset
(N = 462), but also showed excellent generalizability in cross-study external
validation and delivered a SOTA AUROC of 0.90 for MSI using the MCO dataset for
training (N = 1065) and the same TCGA-CRC-DX for testing. Similar performance
(AUROC=0.91) was achieved by Echle and colleagues using 8000 training samples
(ResNet18) on the same testing dataset. Swin-T was extremely efficient using
small training datasets and exhibits robust predictive performance with only
200-500 training samples. These data indicate that Swin-T may be 5-10 times
more efficient than the current state-of-the-art algorithms for MSI based on
ResNet18 and ShuffleNet. Furthermore, the Swin-T models showed promise as
pre-screening tests for MSI status and BRAF mutation status, which could
exclude and reduce the samples before the subsequent standard testing in a
cascading diagnostic workflow to allow turnaround time reduction and cost
saving.
Related papers
- STAL: Spike Threshold Adaptive Learning Encoder for Classification of Pain-Related Biosignal Data [2.0738462952016232]
This paper presents the first application of spiking neural networks (SNNs) for the classification of chronic lower back pain (CLBP) using the EmoPain dataset.
We introduce Spike Threshold Adaptive Learning (STAL), a trainable encoder that effectively converts continuous biosignals into spike trains.
We also propose an ensemble of Spiking Recurrent Neural Network (SRNN) classifiers for the multi-stream processing of sEMG and IMU data.
arXiv Detail & Related papers (2024-07-11T10:15:52Z) - MCDFN: Supply Chain Demand Forecasting via an Explainable Multi-Channel Data Fusion Network Model [0.0]
We introduce the Multi-Channel Data Fusion Network (MCDFN), a hybrid architecture that integrates CNN, Long Short-Term Memory networks (LSTM), and Gated Recurrent Units (GRU)
Our comparative benchmarking demonstrates that MCDFN outperforms seven other deep-learning models.
This research advances demand forecasting methodologies and offers practical guidelines for integrating MCDFN into supply chain systems.
arXiv Detail & Related papers (2024-05-24T14:30:00Z) - Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion [56.38386580040991]
Consistency Trajectory Model (CTM) is a generalization of Consistency Models (CM)
CTM enables the efficient combination of adversarial training and denoising score matching loss to enhance performance.
Unlike CM, CTM's access to the score function can streamline the adoption of established controllable/conditional generation methods.
arXiv Detail & Related papers (2023-10-01T05:07:17Z) - The effect of data augmentation and 3D-CNN depth on Alzheimer's Disease
detection [51.697248252191265]
This work summarizes and strictly observes best practices regarding data handling, experimental design, and model evaluation.
We focus on Alzheimer's Disease (AD) detection, which serves as a paradigmatic example of challenging problem in healthcare.
Within this framework, we train predictive 15 models, considering three different data augmentation strategies and five distinct 3D CNN architectures.
arXiv Detail & Related papers (2023-09-13T10:40:41Z) - Convolutional Monge Mapping Normalization for learning on sleep data [63.22081662149488]
We propose a new method called Convolutional Monge Mapping Normalization (CMMN)
CMMN consists in filtering the signals in order to adapt their power spectrum density (PSD) to a Wasserstein barycenter estimated on training data.
Numerical experiments on sleep EEG data show that CMMN leads to significant and consistent performance gains independent from the neural network architecture.
arXiv Detail & Related papers (2023-05-30T08:24:01Z) - DPSeq: A Novel and Efficient Digital Pathology Classifier for Predicting
Cancer Biomarkers using Sequencer Architecture [4.876281217951695]
In digital pathology tasks, transformers have achieved state-of-the-art results, surpassing convolutional neural networks (CNNs)
We developed a novel and efficient digital pathology classifier called DPSeq, to predict cancer biomarkers.
arXiv Detail & Related papers (2023-05-03T08:31:44Z) - Cross-Shaped Windows Transformer with Self-supervised Pretraining for Clinically Significant Prostate Cancer Detection in Bi-parametric MRI [6.930082824262643]
We introduce a novel end-to-end Cross-Shaped windows (CSwin) transformer UNet model, CSwin UNet, to detect clinically significant prostate cancer (csPCa) in prostate bi-parametric MR imaging (bpMRI)
Using a large prostate bpMRI dataset with 1500 patients, we first pretrain CSwin transformer using multi-task self-supervised learning to improve data-efficiency and network generalizability.
Five-fold cross validation shows that self-supervised CSwin UNet achieves 0.888 AUC and 0.545 Average Precision (AP), significantly outperforming four comparable models (Swin U
arXiv Detail & Related papers (2023-04-30T04:40:32Z) - Clinical Deterioration Prediction in Brazilian Hospitals Based on
Artificial Neural Networks and Tree Decision Models [56.93322937189087]
An extremely boosted neural network (XBNet) is used to predict clinical deterioration (CD)
The XGBoost model obtained the best results in predicting CD among Brazilian hospitals' data.
arXiv Detail & Related papers (2022-12-17T23:29:14Z) - Learning brain MRI quality control: a multi-factorial generalization
problem [0.0]
This work aimed at evaluating the performances of the MRIQC pipeline on various large-scale datasets.
We focused our analysis on the MRIQC preprocessing steps and tested the pipeline with and without them.
We concluded that a model trained with data from a heterogeneous population, such as the CATI dataset, provides the best scores on unseen data.
arXiv Detail & Related papers (2022-05-31T15:46:44Z) - Classification of COVID-19 in CT Scans using Multi-Source Transfer
Learning [91.3755431537592]
We propose the use of Multi-Source Transfer Learning to improve upon traditional Transfer Learning for the classification of COVID-19 from CT scans.
With our multi-source fine-tuning approach, our models outperformed baseline models fine-tuned with ImageNet.
Our best performing model was able to achieve an accuracy of 0.893 and a Recall score of 0.897, outperforming its baseline Recall score by 9.3%.
arXiv Detail & Related papers (2020-09-22T11:53:06Z) - CovidDeep: SARS-CoV-2/COVID-19 Test Based on Wearable Medical Sensors
and Efficient Neural Networks [51.589769497681175]
The novel coronavirus (SARS-CoV-2) has led to a pandemic.
The current testing regime based on Reverse Transcription-Polymerase Chain Reaction for SARS-CoV-2 has been unable to keep up with testing demands.
We propose a framework called CovidDeep that combines efficient DNNs with commercially available WMSs for pervasive testing of the virus.
arXiv Detail & Related papers (2020-07-20T21:47:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.