ScoreNet: Learning Non-Uniform Attention and Augmentation for
Transformer-Based Histopathological Image Classification
- URL: http://arxiv.org/abs/2202.07570v1
- Date: Tue, 15 Feb 2022 16:55:09 GMT
- Title: ScoreNet: Learning Non-Uniform Attention and Augmentation for
Transformer-Based Histopathological Image Classification
- Authors: Thomas Stegm\"uller, Antoine Spahr, Behzad Bozorgtabar, Jean-Philippe
Thiran
- Abstract summary: High-resolution images hinder progress in digital pathology.
patch-based processing often incorporates multiple instance learning (MIL) to aggregate local patch-level representations yielding image-level prediction.
This paper proposes a transformer-based architecture specifically tailored for histological image classification.
It combines fine-grained local attention with a coarse global attention mechanism to learn meaningful representations of high-resolution images at an efficient computational cost.
- Score: 11.680355561258427
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Progress in digital pathology is hindered by high-resolution images and the
prohibitive cost of exhaustive localized annotations. The commonly used
paradigm to categorize pathology images is patch-based processing, which often
incorporates multiple instance learning (MIL) to aggregate local patch-level
representations yielding image-level prediction. Nonetheless, diagnostically
relevant regions may only take a small fraction of the whole tissue, and
MIL-based aggregation operation assumes that all patch representations are
independent and thus mislays the contextual information from adjacent cell and
tissue microenvironments. Consequently, the computational resources dedicated
to a specific region are independent of its information contribution. This
paper proposes a transformer-based architecture specifically tailored for
histopathological image classification, which combines fine-grained local
attention with a coarse global attention mechanism to learn meaningful
representations of high-resolution images at an efficient computational cost.
More importantly, based on the observation above, we propose a novel
mixing-based data-augmentation strategy, namely ScoreMix, by leveraging the
distribution of the semantic regions of images during the training and
carefully guiding the data mixing via sampling the locations of discriminative
image content. Thorough experiments and ablation studies on three challenging
representative cohorts of Haematoxylin & Eosin (H&E) tumour regions-of-interest
(TRoIs) datasets have validated the superiority of our approach over existing
state-of-the-art methods and effectiveness of our proposed components, e.g.,
data augmentation in improving classification performance. We also demonstrate
our method's interpretability, robustness, and cross-domain generalization
capability.
Related papers
- Revisiting Adaptive Cellular Recognition Under Domain Shifts: A Contextual Correspondence View [49.03501451546763]
We identify the importance of implicit correspondences across biological contexts for exploiting domain-invariant pathological composition.
We propose self-adaptive dynamic distillation to secure instance-aware trade-offs across different model constituents.
arXiv Detail & Related papers (2024-07-14T04:41:16Z) - Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection [76.11864242047074]
We propose a novel Affine-Consistent Transformer (AC-Former), which directly yields a sequence of nucleus positions.
We introduce an Adaptive Affine Transformer (AAT) module, which can automatically learn the key spatial transformations to warp original images for local network training.
Experimental results demonstrate that the proposed method significantly outperforms existing state-of-the-art algorithms on various benchmarks.
arXiv Detail & Related papers (2023-10-22T02:27:02Z) - A2V: A Semi-Supervised Domain Adaptation Framework for Brain Vessel Segmentation via Two-Phase Training Angiography-to-Venography Translation [4.452428104996953]
We present a semi-supervised domain adaptation framework for brain vessel segmentation from different image modalities.
By relying on annotated angiographies and a limited number of annotated venographies, our framework accomplishes image-to-image translation and semantic segmentation.
arXiv Detail & Related papers (2023-09-12T09:12:37Z) - Self-supervised Semantic Segmentation: Consistency over Transformation [3.485615723221064]
We propose a novel self-supervised algorithm, textbfS$3$-Net, which integrates a robust framework based on the proposed Inception Large Kernel Attention (I-LKA) modules.
We leverage deformable convolution as an integral component to effectively capture and delineate lesion deformations for superior object boundary definition.
Our experimental results on skin lesion and lung organ segmentation tasks show the superior performance of our method compared to the SOTA approaches.
arXiv Detail & Related papers (2023-08-31T21:28:46Z) - Improving Mitosis Detection Via UNet-based Adversarial Domain
Homogenizer [1.7298084639157258]
This paper proposes a domain homogenizer for mitosis detection that attempts to alleviate domain differences in histology images via adversarial reconstruction of input images.
We demonstrate our domain homogenizer's effectiveness by observing the reduction in domain differences between the preprocessed images.
Using this homogenizer, along with a subsequent retina-net object detector, we were able to outperform the baselines of the 2021 MIDOG challenge in terms of average precision of the detected mitotic figures.
arXiv Detail & Related papers (2022-09-15T11:15:57Z) - Cross-level Contrastive Learning and Consistency Constraint for
Semi-supervised Medical Image Segmentation [46.678279106837294]
We propose a cross-level constrastive learning scheme to enhance representation capacity for local features in semi-supervised medical image segmentation.
With the help of the cross-level contrastive learning and consistency constraint, the unlabelled data can be effectively explored to improve segmentation performance.
arXiv Detail & Related papers (2022-02-08T15:12:11Z) - Self-Attentive Spatial Adaptive Normalization for Cross-Modality Domain
Adaptation [9.659642285903418]
Cross-modality synthesis of medical images to reduce the costly annotation burden by radiologists.
We present a novel approach for image-to-image translation in medical images, capable of supervised or unsupervised (unpaired image data) setups.
arXiv Detail & Related papers (2021-03-05T16:22:31Z) - Few-shot Medical Image Segmentation using a Global Correlation Network
with Discriminative Embedding [60.89561661441736]
We propose a novel method for few-shot medical image segmentation.
We construct our few-shot image segmentor using a deep convolutional network trained episodically.
We enhance discriminability of deep embedding to encourage clustering of the feature domains of the same class.
arXiv Detail & Related papers (2020-12-10T04:01:07Z) - Unsupervised Instance Segmentation in Microscopy Images via Panoptic
Domain Adaptation and Task Re-weighting [86.33696045574692]
We propose a Cycle Consistency Panoptic Domain Adaptive Mask R-CNN (CyC-PDAM) architecture for unsupervised nuclei segmentation in histopathology images.
We first propose a nuclei inpainting mechanism to remove the auxiliary generated objects in the synthesized images.
Secondly, a semantic branch with a domain discriminator is designed to achieve panoptic-level domain adaptation.
arXiv Detail & Related papers (2020-05-05T11:08:26Z) - Pathological Retinal Region Segmentation From OCT Images Using Geometric
Relation Based Augmentation [84.7571086566595]
We propose improvements over previous GAN-based medical image synthesis methods by jointly encoding the intrinsic relationship of geometry and shape.
The proposed method outperforms state-of-the-art segmentation methods on the public RETOUCH dataset having images captured from different acquisition procedures.
arXiv Detail & Related papers (2020-03-31T11:50:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.