Related papers: DerMAE: Improving skin lesion classification through conditioned latent diffusion and MAE distillation

DerMAE: Improving skin lesion classification through conditioned latent diffusion and MAE distillation

URL: http://arxiv.org/abs/2602.19848v1
Date: Mon, 23 Feb 2026 13:52:28 GMT
Title: DerMAE: Improving skin lesion classification through conditioned latent diffusion and MAE distillation
Authors: Francisco Filho, Kelvin Cunha, Fábio Papais, Emanoel dos Santos, Rodrigo Mota, Thales Bezerra, Erico Medeiros, Paulo Borba, Tsang Ing Ren,
Abstract summary: We use class-conditioned diffusion models to generate synthetic dermatological images, followed by self-supervised MAE pretraining to enable huge ViT models to learn robust, domain-relevant features.<n>We apply knowledge distillation to transfer these representations to a smaller ViT student suitable for mobile devices.<n>Our results show that MAE pretraining on synthetic data, combined with distillation, improves classification performance while enabling efficient on-device inference for practical clinical use.
Score: 1.485045763113618
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Skin lesion classification datasets often suffer from severe class imbalance, with malignant cases significantly underrepresented, leading to biased decision boundaries during deep learning training. We address this challenge using class-conditioned diffusion models to generate synthetic dermatological images, followed by self-supervised MAE pretraining to enable huge ViT models to learn robust, domain-relevant features. To support deployment in practical clinical settings, where lightweight models are required, we apply knowledge distillation to transfer these representations to a smaller ViT student suitable for mobile devices. Our results show that MAE pretraining on synthetic data, combined with distillation, improves classification performance while enabling efficient on-device inference for practical clinical use.

Related papers

Self-learned representation-guided latent diffusion model for breast cancer classification in deep ultraviolet whole surface images [4.203807616568477]
We propose an Self-Supervised Learning (SSL)-guided Latent Model (LDM) to generate high-quality synthetic training patches.<n>By guiding the LDM with embeddings from a fine-tuned DINO teacher, we inject rich semantic details of cellular structures into the synthetic data.<n> Experiments using 5-fold cross-validation demonstrate that our method achieves 96.47 % accuracy and reduces the FID score to 45.72, significantly outperforming class-conditioned baselines.
arXiv Detail & Related papers (2026-01-16T00:22:22Z)
Toward Accessible Dermatology: Skin Lesion Classification Using Deep Learning Models on Mobile-Acquired Images [0.0]
In this work, we curate a large dataset of over 50 skin disease categories captured with mobile devices.<n>We evaluate multiple convolutional neural networks and Transformer-based architectures.<n>Our results underscore the potential of Transformer-based approaches for mobile-acquired skin lesion classification.
arXiv Detail & Related papers (2025-09-05T04:31:16Z)
MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification [13.350688594462214]
We propose a novel approach explicitly modeling such metadata into a generative Diffusion model framework (MeDi)<n>MeDi allows for a targeted augmentation of underrepresented subpopulations with synthetic data.<n>We experimentally show that MeDi generates high-quality histopathology images for unseen subpopulations in TCGA.
arXiv Detail & Related papers (2025-06-20T16:41:25Z)
Conditional Diffusion Models are Medical Image Classifiers that Provide Explainability and Uncertainty for Free [0.7624308578421438]
This work presents the first exploration of the potential of class conditional diffusion models for 2D medical image classification.<n>We develop a novel majority voting scheme shown to improve the performance of medical diffusion classifiers.<n>Experiments on the CheXpert and ISIC Melanoma skin cancer datasets demonstrate that foundation and trained-from-scratch diffusion models achieve competitive performance.
arXiv Detail & Related papers (2025-02-06T00:37:21Z)
Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis [55.959002385347645]
Latent Drifting enables diffusion models to be conditioned for medical images fitted for the complex task of counterfactual image generation.<n>We evaluate our method on three public longitudinal benchmark datasets of brain MRI and chest X-rays for counterfactual image generation.
arXiv Detail & Related papers (2024-12-30T01:59:34Z)
LeFusion: Controllable Pathology Synthesis via Lesion-Focused Diffusion Models [42.922303491557244]
Patient data from real-world clinical practice often suffers from data scarcity and long-tail imbalances. This study addresses these challenges by generating lesion-containing image-segmentation pairs from lesion-free images. LeFusion-generated data significantly improves the performance of state-of-the-art segmentation models.
arXiv Detail & Related papers (2024-03-21T01:25:39Z)
Derm-T2IM: Harnessing Synthetic Skin Lesion Data via Stable Diffusion Models for Enhanced Skin Disease Classification using ViT and CNN [1.0499611180329804]
We aim to incorporate enhanced data transformation techniques by extending the recent success of few-shot learning. We investigate the impact of incorporating newly generated synthetic data into the training pipeline of state-of-art machine learning models.
arXiv Detail & Related papers (2024-01-10T13:46:03Z)
Latent Alignment with Deep Set EEG Decoders [44.128689862889715]
We introduce the Latent Alignment method that won the Benchmarks for EEG Transfer Learning competition. We present its formulation as a deep set applied on the set of trials from a given subject. Our experimental results show that performing statistical distribution alignment at later stages in a deep learning model is beneficial to the classification accuracy.
arXiv Detail & Related papers (2023-11-29T12:40:45Z)
Uncovering the Hidden Cost of Model Compression [43.62624133952414]
Visual Prompting has emerged as a pivotal method for transfer learning in computer vision. Model compression detrimentally impacts the performance of visual prompting-based transfer. However, negative effects on calibration are not present when models are compressed via quantization.
arXiv Detail & Related papers (2023-08-29T01:47:49Z)
An Adversarial Active Sampling-based Data Augmentation Framework for Manufacturable Chip Design [55.62660894625669]
Lithography modeling is a crucial problem in chip design to ensure a chip design mask is manufacturable. Recent developments in machine learning have provided alternative solutions in replacing the time-consuming lithography simulations with deep neural networks. We propose a litho-aware data augmentation framework to resolve the dilemma of limited data and improve the machine learning model performance.
arXiv Detail & Related papers (2022-10-27T20:53:39Z)
SSD-KD: A Self-supervised Diverse Knowledge Distillation Method for Lightweight Skin Lesion Classification Using Dermoscopic Images [62.60956024215873]
Skin cancer is one of the most common types of malignancy, affecting a large population and causing a heavy economic burden worldwide. Most studies in skin cancer detection keep pursuing high prediction accuracies without considering the limitation of computing resources on portable devices. This study specifically proposes a novel method, termed SSD-KD, that unifies diverse knowledge into a generic KD framework for skin diseases classification.
arXiv Detail & Related papers (2022-03-22T06:54:29Z)
FairIF: Boosting Fairness in Deep Learning via Influence Functions with Validation Set Sensitive Attributes [51.02407217197623]
We propose a two-stage training algorithm named FAIRIF. It minimizes the loss over the reweighted data set where the sample weights are computed. We show that FAIRIF yields models with better fairness-utility trade-offs against various types of bias.
arXiv Detail & Related papers (2022-01-15T05:14:48Z)
Self-Training with Improved Regularization for Sample-Efficient Chest X-Ray Classification [80.00316465793702]
We present a deep learning framework that enables robust modeling in challenging scenarios. Our results show that using 85% lesser labeled data, we can build predictive models that match the performance of classifiers trained in a large-scale data setting.
arXiv Detail & Related papers (2020-05-03T02:36:00Z)

This list is automatically generated from the titles and abstracts of the papers in this site.