Pathologist-like explainable AI for interpretable Gleason grading in prostate cancer
- URL: http://arxiv.org/abs/2410.15012v1
- Date: Sat, 19 Oct 2024 06:58:26 GMT
- Title: Pathologist-like explainable AI for interpretable Gleason grading in prostate cancer
- Authors: Gesa Mittmann, Sara Laiouar-Pedari, Hendrik A. Mehrtens, Sarah Haggenmüller, Tabea-Clara Bucher, Tirtha Chanda, Nadine T. Gaisa, Mathias Wagner, Gilbert Georg Klamminger, Tilman T. Rau, Christina Neppl, Eva Maria Compérat, Andreas Gocht, Monika Hämmerle, Niels J. Rupp, Jula Westhoff, Irene Krücken, Maximillian Seidl, Christian M. Schürch, Marcus Bauer, Wiebke Solass, Yu Chun Tam, Florian Weber, Rainer Grobholz, Jaroslaw Augustyniak, Thomas Kalinski, Christian Hörner, Kirsten D. Mertz, Constanze Döring, Andreas Erbersdobler, Gabriele Deubler, Felix Bremmer, Ulrich Sommer, Michael Brodhun, Jon Griffin, Maria Sarah L. Lenon, Kiril Trpkov, Liang Cheng, Fei Chen, Angelique Levi, Guoping Cai, Tri Q. Nguyen, Ali Amin, Alessia Cimadamore, Ahmed Shabaik, Varsha Manucha, Nazeel Ahmad, Nidia Messias, Francesca Sanguedolce, Diana Taheri, Ezra Baraban, Liwei Jia, Rajal B. Shah, Farshid Siadat, Nicole Swarbrick, Kyung Park, Oudai Hassan, Siamak Sakhaie, Michelle R. Downes, Hiroshi Miyamoto, Sean R. Williamson, Tim Holland-Letz, Carolin V. Schneider, Jakob Nikolas Kather, Yuri Tolkach, Titus J. Brinker,
- Abstract summary: We introduce a novel dataset of 1,015 tissue microarray core images, annotated by an international group of 54 pathologists.
The annotations provide detailed localized pattern descriptions for Gleason grading in line with international guidelines.
We develop an inherently explainable AI system based on a U-Net architecture that provides predictions leveraging pathologists' terminology.
- Score: 3.7226270582597656
- License:
- Abstract: The aggressiveness of prostate cancer, the most common cancer in men worldwide, is primarily assessed based on histopathological data using the Gleason scoring system. While artificial intelligence (AI) has shown promise in accurately predicting Gleason scores, these predictions often lack inherent explainability, potentially leading to distrust in human-machine interactions. To address this issue, we introduce a novel dataset of 1,015 tissue microarray core images, annotated by an international group of 54 pathologists. The annotations provide detailed localized pattern descriptions for Gleason grading in line with international guidelines. Utilizing this dataset, we develop an inherently explainable AI system based on a U-Net architecture that provides predictions leveraging pathologists' terminology. This approach circumvents post-hoc explainability methods while maintaining or exceeding the performance of methods trained directly for Gleason pattern segmentation (Dice score: 0.713 $\pm$ 0.003 trained on explanations vs. 0.691 $\pm$ 0.010 trained on Gleason patterns). By employing soft labels during training, we capture the intrinsic uncertainty in the data, yielding strong results in Gleason pattern segmentation even in the context of high interobserver variability. With the release of this dataset, we aim to encourage further research into segmentation in medical tasks with high levels of subjectivity and to advance the understanding of pathologists' reasoning processes.
Related papers
- Poisson Ordinal Network for Gleason Group Estimation Using Bi-Parametric MRI [15.754944195515504]
Gleason groups serve as the primary histological grading system for prostate cancer.
In clinical practice, pathologists determine the Gleason groups based on specimens obtained from ultrasound-guided biopsies.
We investigate the feasibility of directly estimating the Gleason groups from MRI scans to reduce otherwise required biopsies.
arXiv Detail & Related papers (2024-07-08T09:56:30Z) - Assessing the Performance of Deep Learning for Automated Gleason Grading in Prostate Cancer [0.0]
This study explores the potential of 11 deep neural network architectures for automated Gleason grading in prostate carcinoma.
A standardized image classification pipeline, based on the AUCMEDI framework, facilitated robust evaluation.
Newer architectures achieved superior performance, even though with challenges in differentiating closely related Gleason grades.
arXiv Detail & Related papers (2024-03-25T12:26:32Z) - Semantic Latent Space Regression of Diffusion Autoencoders for Vertebral
Fracture Grading [72.45699658852304]
This paper proposes a novel approach to train a generative Diffusion Autoencoder model as an unsupervised feature extractor.
We model fracture grading as a continuous regression, which is more reflective of the smooth progression of fractures.
Importantly, the generative nature of our method allows us to visualize different grades of a given vertebra, providing interpretability and insight into the features that contribute to automated grading.
arXiv Detail & Related papers (2023-03-21T17:16:01Z) - Learning to diagnose cirrhosis from radiological and histological labels
with joint self and weakly-supervised pretraining strategies [62.840338941861134]
We propose to leverage transfer learning from large datasets annotated by radiologists, to predict the histological score available on a small annex dataset.
We compare different pretraining methods, namely weakly-supervised and self-supervised ones, to improve the prediction of the cirrhosis.
This method outperforms the baseline classification of the METAVIR score, reaching an AUC of 0.84 and a balanced accuracy of 0.75.
arXiv Detail & Related papers (2023-02-16T17:06:23Z) - Informing clinical assessment by contextualizing post-hoc explanations
of risk prediction models in type-2 diabetes [50.8044927215346]
We consider a comorbidity risk prediction scenario and focus on contexts regarding the patients clinical state.
We employ several state-of-the-art LLMs to present contexts around risk prediction model inferences and evaluate their acceptability.
Our paper is one of the first end-to-end analyses identifying the feasibility and benefits of contextual explanations in a real-world clinical use case.
arXiv Detail & Related papers (2023-02-11T18:07:11Z) - WSSS4LUAD: Grand Challenge on Weakly-supervised Tissue Semantic
Segmentation for Lung Adenocarcinoma [51.50991881342181]
This challenge includes 10,091 patch-level annotations and over 130 million labeled pixels.
First place team achieved mIoU of 0.8413 (tumor: 0.8389, stroma: 0.7931, normal: 0.8919)
arXiv Detail & Related papers (2022-04-13T15:27:05Z) - Assessing glaucoma in retinal fundus photographs using Deep Feature
Consistent Variational Autoencoders [63.391402501241195]
glaucoma is challenging to detect since it remains asymptomatic until the symptoms are severe.
Early identification of glaucoma is generally made based on functional, structural, and clinical assessments.
Deep learning methods have partially solved this dilemma by bypassing the marker identification stage and analyzing high-level information directly to classify the data.
arXiv Detail & Related papers (2021-10-04T16:06:49Z) - Going Deeper through the Gleason Scoring Scale: An Automatic end-to-end
System for Histology Prostate Grading and Cribriform Pattern Detection [7.929433631399375]
The objective of this work is to develop a deep-learning-based system able to support pathologists in the daily analysis of prostate biopsies.
The methodological core of this work is a patch-wise predictive model based on convolutional neural networks able to determine the presence of cancerous patterns.
arXiv Detail & Related papers (2021-05-21T17:51:53Z) - WeGleNet: A Weakly-Supervised Convolutional Neural Network for the
Semantic Segmentation of Gleason Grades in Prostate Histology Images [1.52819437883813]
We propose a deep-learning-based system able to detect local cancerous patterns in the prostate tissue using only the global-level Gleason score during training.
We obtained a Cohen's quadratic kappa (k) of 0.67 for the pixel-level prediction of cancerous patterns in the validation cohort.
We compared the model performance for semantic segmentation of Gleason grades with supervised state-of-the-art architectures in the test cohort.
arXiv Detail & Related papers (2021-05-21T16:27:16Z) - Self-learning for weakly supervised Gleason grading of local patterns [6.97280833203187]
We propose a weakly-supervised deep-learning model, based on self-learning CNNs, to accurately perform both, grading of patch-level patterns and biopsy-level scoring.
We empirically demonstrate that our approach outperforms its supervised counterpart on patch-level Gleason grading by a large margin.
arXiv Detail & Related papers (2021-05-21T15:39:50Z) - Gleason Grading of Histology Prostate Images through Semantic
Segmentation via Residual U-Net [60.145440290349796]
The final diagnosis of prostate cancer is based on the visual detection of Gleason patterns in prostate biopsy by pathologists.
Computer-aided-diagnosis systems allow to delineate and classify the cancerous patterns in the tissue.
The methodological core of this work is a U-Net convolutional neural network for image segmentation modified with residual blocks able to segment cancerous tissue.
arXiv Detail & Related papers (2020-05-22T19:49:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.