A Study of Deep Learning Colon Cancer Detection in Limited Data Access
Scenarios
- URL: http://arxiv.org/abs/2005.10326v2
- Date: Fri, 22 May 2020 09:03:54 GMT
- Title: A Study of Deep Learning Colon Cancer Detection in Limited Data Access
Scenarios
- Authors: Apostolia Tsirikoglou, Karin Stacke, Gabriel Eilertsen, Martin
Lindvall, Jonas Unger
- Abstract summary: Deep learning methods for classification and detection have shown great potential, but often require large amounts of training data.
For many cancer types, the scarceness of data creates barriers for training DL models.
We show how it is possible to detect cancer metastasis with no or very little lymph node data, opening up for the possibility that existing, annotated histopathology data could generalize to other domains.
- Score: 6.338265282525758
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Digitization of histopathology slides has led to several advances, from easy
data sharing and collaborations to the development of digital diagnostic tools.
Deep learning (DL) methods for classification and detection have shown great
potential, but often require large amounts of training data that are hard to
collect, and annotate. For many cancer types, the scarceness of data creates
barriers for training DL models. One such scenario relates to detecting tumor
metastasis in lymph node tissue, where the low ratio of tumor to non-tumor
cells makes the diagnostic task hard and time-consuming. DL-based tools can
allow faster diagnosis, with potentially increased quality. Unfortunately, due
to the sparsity of tumor cells, annotating this type of data demands a high
level of effort from pathologists. Using weak annotations from slide-level
images have shown great potential, but demand access to a substantial amount of
data as well. In this study, we investigate mitigation strategies for limited
data access scenarios. Particularly, we address whether it is possible to
exploit mutual structure between tissues to develop general techniques, wherein
data from one type of cancer in a particular tissue could have diagnostic value
for other cancers in other tissues. Our case is exemplified by a DL model for
metastatic colon cancer detection in lymph nodes. Could such a model be trained
with little or even no lymph node data? As alternative data sources, we
investigate 1) tumor cells taken from the primary colon tumor tissue, and 2)
cancer data from a different organ (breast), either as is or transformed to the
target domain (colon) using Cycle-GANs. We show that the suggested approaches
make it possible to detect cancer metastasis with no or very little lymph node
data, opening up for the possibility that existing, annotated histopathology
data could generalize to other domains.
Related papers
- MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer [13.74067035373274]
We introduce a multi-modal heterogeneous graph-based conditional feature-guided diffusion model for lymph node metastasis diagnosis based on CT images.
We propose a masked relational representation learning strategy, aiming to uncover the latent prognostic correlations and priorities of primary tumor and lymph node image representations.
arXiv Detail & Related papers (2024-05-15T17:52:00Z) - RCdpia: A Renal Carcinoma Digital Pathology Image Annotation dataset based on pathologists [14.79279940958727]
We have compiled the TCGA digital pathological dataset with independent labeling of tumor regions and adjacent areas (RCdpia)
This dataset is now publicly accessible at http://39.171.241.18:8888/RCdpia/.
arXiv Detail & Related papers (2024-03-17T13:23:25Z) - Deep Reinforcement Learning Framework for Thoracic Diseases
Classification via Prior Knowledge Guidance [49.87607548975686]
The scarcity of labeled data for related diseases poses a huge challenge to an accurate diagnosis.
We propose a novel deep reinforcement learning framework, which introduces prior knowledge to direct the learning of diagnostic agents.
Our approach's performance was demonstrated using the well-known NIHX-ray 14 and CheXpert datasets.
arXiv Detail & Related papers (2023-06-02T01:46:31Z) - A Pathologist-Informed Workflow for Classification of Prostate Glands in
Histopathology [62.997667081978825]
Pathologists diagnose and grade prostate cancer by examining tissue from needle biopsies on glass slides.
Cancer's severity and risk of metastasis are determined by the Gleason grade, a score based on the organization and morphology of prostate cancer glands.
This paper proposes an automated workflow that follows pathologists' textitmodus operandi, isolating and classifying multi-scale patches of individual glands.
arXiv Detail & Related papers (2022-09-27T14:08:19Z) - Domain adaptation strategies for cancer-independent detection of lymph
node metastases [8.00124399861179]
Large, high-quality public datasets have led to the development of convolutional neural networks that can detect lymph node metastases of breast cancer.
We show how to leverage existing high-quality datasets most efficiently in multi-task settings for closely related tasks.
arXiv Detail & Related papers (2022-07-13T13:41:20Z) - Producing Histopathology Phantom Images using Generative Adversarial
Networks to improve Tumor Detection [0.0]
In this paper, we ascertain that data augmentation using GANs can be a viable solution to reduce the unevenness in the distribution of different cancer types in our dataset.
Our demonstration showed that a dataset augmented to a 50% increase causes an increase in tumor detection from 80% to 87.5%.
arXiv Detail & Related papers (2022-05-21T23:04:20Z) - Federated Learning Enables Big Data for Rare Cancer Boundary Detection [98.5549882883963]
We present findings from the largest Federated ML study to-date, involving data from 71 healthcare institutions across 6 continents.
We generate an automatic tumor boundary detector for the rare disease of glioblastoma.
We demonstrate a 33% improvement over a publicly trained model to delineate the surgically targetable tumor, and 23% improvement over the tumor's entire extent.
arXiv Detail & Related papers (2022-04-22T17:27:00Z) - Primary Tumor and Inter-Organ Augmentations for Supervised Lymph Node
Colon Adenocarcinoma Metastasis Detection [8.69535649683089]
scarcity of labeled data is a major bottleneck for developing deep learning-based models for histopathology applications.
This work explores alternatives on how to augment the training data for colon carcinoma metastasis detection when there is limited or no representation of the target domain.
arXiv Detail & Related papers (2021-09-17T17:31:25Z) - Triplet Contrastive Learning for Brain Tumor Classification [99.07846518148494]
We present a novel approach of directly learning deep embeddings for brain tumor types, which can be used for downstream tasks such as classification.
We evaluate our method on an extensive brain tumor dataset which consists of 27 different tumor classes, out of which 13 are defined as rare.
arXiv Detail & Related papers (2021-08-08T11:26:34Z) - Topological Data Analysis of copy number alterations in cancer [70.85487611525896]
We explore the potential to capture information contained in cancer genomic information using a novel topology-based approach.
We find that this technique has the potential to extract meaningful low-dimensional representations in cancer somatic genetic data.
arXiv Detail & Related papers (2020-11-22T17:31:23Z) - Segmentation for Classification of Screening Pancreatic Neuroendocrine
Tumors [72.65802386845002]
This work presents comprehensive results to detect in the early stage the pancreatic neuroendocrine tumors (PNETs) in abdominal CT scans.
To the best of our knowledge, this task has not been studied before as a computational task.
Our approach outperforms state-of-the-art segmentation networks and achieves a sensitivity of $89.47%$ at a specificity of $81.08%$.
arXiv Detail & Related papers (2020-04-04T21:21:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.