Self-distillation Augmented Masked Autoencoders for Histopathological
Image Classification
- URL: http://arxiv.org/abs/2203.16983v4
- Date: Mon, 29 May 2023 02:14:39 GMT
- Title: Self-distillation Augmented Masked Autoencoders for Histopathological
Image Classification
- Authors: Yang Luo, Zhineng Chen, Shengtian Zhou, Xieping Gao
- Abstract summary: Masked autoencoders (MAE) building self-supervised learning (SSL) from a generative paradigm is probably a more appropriate pre-training.
A novel SD-MAE model is proposed to enable a self-distillation augmented MAE.
Experiments demonstrate that SD-MAE shows highly competitive performance when compared with other SSL methods.
- Score: 11.573165017470867
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Self-supervised learning (SSL) has drawn increasing attention in
histopathological image analysis in recent years. Compared to contrastive
learning which is troubled with the false negative problem, i.e., semantically
similar images are selected as negative samples, masked autoencoders (MAE)
building SSL from a generative paradigm is probably a more appropriate
pre-training. In this paper, we introduce MAE and verify the effect of visible
patches for histopathological image understanding. Moreover, a novel SD-MAE
model is proposed to enable a self-distillation augmented MAE. Besides the
reconstruction loss on masked image patches, SD-MAE further imposes the
self-distillation loss on visible patches to enhance the representational
capacity of the encoder located shallow layer. We apply SD-MAE to
histopathological image classification, cell segmentation and object detection.
Experiments demonstrate that SD-MAE shows highly competitive performance when
compared with other SSL methods in these tasks.
Related papers
- PathSegDiff: Pathology Segmentation using Diffusion model representations [63.20694440934692]
We propose PathSegDiff, a novel approach for histopathology image segmentation that leverages Latent Diffusion Models (LDMs) as pre-trained featured extractors.
Our method utilizes a pathology-specific LDM, guided by a self-supervised encoder, to extract rich semantic information from H&E stained histopathology images.
Our experiments demonstrate significant improvements over traditional methods on the BCSS and GlaS datasets.
arXiv Detail & Related papers (2025-04-09T14:58:21Z) - MIRAM: Masked Image Reconstruction Across Multiple Scales for Breast Lesion Risk Prediction [2.0199924721373392]
Masked image modeling (MIM) has emerged as a more potent SSL technique.
This research paper introduces a scalable and practical SSL approach centered around more challenging pretext tasks.
Our hypothesis posits that reconstructing high-resolution images enables the model to attend to finer spatial details.
arXiv Detail & Related papers (2025-03-10T10:32:55Z) - Dynamic Entity-Masked Graph Diffusion Model for histopathological image Representation Learning [25.197342542821843]
We introduce H-MGDM, a novel self-supervised Histopathology image representation learning method through the Dynamic Entity-Masked Graph Diffusion Model.
Specifically, we propose to use complementary subgraphs as latent diffusion conditions and self-supervised targets respectively during pre-training.
arXiv Detail & Related papers (2024-12-13T10:18:36Z) - Enhancing Weakly-Supervised Histopathology Image Segmentation with Knowledge Distillation on MIL-Based Pseudo-Labels [8.934328206473456]
We propose a novel distillation framework for histopathology image segmentation.
This framework introduces a iterative fusion-knowledge distillation strategy, enabling the student model to learn directly from the teacher's comprehensive outcomes.
arXiv Detail & Related papers (2024-07-14T17:15:47Z) - Features Fusion for Dual-View Mammography Mass Detection [1.5146068448101746]
We propose a new model called MAMM-Net, which allows the processing of both mammography views simultaneously.
Our experiments show superior performance on the publicM dataset compared to the previous state-of-the-art model.
arXiv Detail & Related papers (2024-04-25T16:30:30Z) - Retinal OCT Synthesis with Denoising Diffusion Probabilistic Models for
Layer Segmentation [2.4113205575263708]
We propose an image synthesis method that utilizes denoising diffusion probabilistic models (DDPMs) to automatically generate retinal optical coherence tomography ( OCT) images.
We observe a consistent improvement in layer segmentation accuracy, which is validated using various neural networks.
These findings demonstrate the promising potential of DDPMs in reducing the need for manual annotations of retinal OCT images.
arXiv Detail & Related papers (2023-11-09T16:09:24Z) - Understanding Masked Autoencoders From a Local Contrastive Perspective [80.57196495601826]
Masked AutoEncoder (MAE) has revolutionized the field of self-supervised learning with its simple yet effective masking and reconstruction strategies.
We introduce a new empirical framework, called Local Contrastive MAE, to analyze both reconstructive and contrastive aspects of MAE.
arXiv Detail & Related papers (2023-10-03T12:08:15Z) - Improving Masked Autoencoders by Learning Where to Mask [65.89510231743692]
Masked image modeling is a promising self-supervised learning method for visual data.
We present AutoMAE, a framework that uses Gumbel-Softmax to interlink an adversarially-trained mask generator and a mask-guided image modeling process.
In our experiments, AutoMAE is shown to provide effective pretraining models on standard self-supervised benchmarks and downstream tasks.
arXiv Detail & Related papers (2023-03-12T05:28:55Z) - Exploring The Role of Mean Teachers in Self-supervised Masked
Auto-Encoders [64.03000385267339]
Masked image modeling (MIM) has become a popular strategy for self-supervised learning(SSL) of visual representations with Vision Transformers.
We present a simple SSL method, the Reconstruction-Consistent Masked Auto-Encoder (RC-MAE) by adding an EMA teacher to MAE.
RC-MAE converges faster and requires less memory usage than state-of-the-art self-distillation methods during pre-training.
arXiv Detail & Related papers (2022-10-05T08:08:55Z) - Attentive Symmetric Autoencoder for Brain MRI Segmentation [56.02577247523737]
We propose a novel Attentive Symmetric Auto-encoder based on Vision Transformer (ViT) for 3D brain MRI segmentation tasks.
In the pre-training stage, the proposed auto-encoder pays more attention to reconstruct the informative patches according to the gradient metrics.
Experimental results show that our proposed attentive symmetric auto-encoder outperforms the state-of-the-art self-supervised learning methods and medical image segmentation models.
arXiv Detail & Related papers (2022-09-19T09:43:19Z) - Multi-modal Masked Autoencoders Learn Compositional Histopathological
Representations [3.2780506066663655]
Masked Autoencoders (MAE) is a recent SSL method suitable for digital pathology.
We introduce a multi-modal MAE (MMAE) that leverages the specific compositionality of Hematoxylin & Eosin stained WSIs.
Results show that the MMAE architecture outperforms supervised baselines and other state-of-the-art SSL techniques for an eight-class tissue phenotyping task.
arXiv Detail & Related papers (2022-09-04T05:25:31Z) - Improved Slice-wise Tumour Detection in Brain MRIs by Computing
Dissimilarities between Latent Representations [68.8204255655161]
Anomaly detection for Magnetic Resonance Images (MRIs) can be solved with unsupervised methods.
We have proposed a slice-wise semi-supervised method for tumour detection based on the computation of a dissimilarity function in the latent space of a Variational AutoEncoder.
We show that by training the models on higher resolution images and by improving the quality of the reconstructions, we obtain results which are comparable with different baselines.
arXiv Detail & Related papers (2020-07-24T14:02:09Z) - Retinopathy of Prematurity Stage Diagnosis Using Object Segmentation and
Convolutional Neural Networks [68.96150598294072]
Retinopathy of Prematurity (ROP) is an eye disorder primarily affecting premature infants with lower weights.
It causes proliferation of vessels in the retina and could result in vision loss and, eventually, retinal detachment, leading to blindness.
In recent years, there has been a significant effort to automate the diagnosis using deep learning.
This paper builds upon the success of previous models and develops a novel architecture, which combines object segmentation and convolutional neural networks (CNN)
Our proposed system first trains an object segmentation model to identify the demarcation line at a pixel level and adds the resulting mask as an additional "color" channel in
arXiv Detail & Related papers (2020-04-03T14:07:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.