ConvNeXt with Histopathology-Specific Augmentations for Mitotic Figure Classification
- URL: http://arxiv.org/abs/2509.02595v1
- Date: Fri, 29 Aug 2025 13:18:32 GMT
- Title: ConvNeXt with Histopathology-Specific Augmentations for Mitotic Figure Classification
- Authors: Hana Feki, Alice Blondel, Thomas Walter,
- Abstract summary: We propose a solution based on the lightweight ConvNeXt architecture to maximize domain coverage.<n>On the preliminary leaderboard, our model achieved a balanced accuracy of 0.8961, ranking among the top entries.
- Score: 1.398256265458105
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurate mitotic figure classification is crucial in computational pathology, as mitotic activity informs cancer grading and patient prognosis. Distinguishing atypical mitotic figures (AMFs), which indicate higher tumor aggressiveness, from normal mitotic figures (NMFs) remains challenging due to subtle morphological differences and high intra-class variability. This task is further complicated by domain shifts, including variations in organ, tissue type, and scanner, as well as limited annotations and severe class imbalance. To address these challenges in Track 2 of the MIDOG 2025 Challenge, we propose a solution based on the lightweight ConvNeXt architecture, trained on all available datasets (AMi-Br, AtNorM-Br, AtNorM-MD, and OMG-Octo) to maximize domain coverage. Robustness is enhanced through a histopathology-specific augmentation pipeline, including elastic and stain-specific transformations, and balanced sampling to mitigate class imbalance. A grouped 5-fold cross-validation strategy ensures reliable evaluation. On the preliminary leaderboard, our model achieved a balanced accuracy of 0.8961, ranking among the top entries. These results highlight that broad domain exposure combined with targeted augmentation strategies is key to building accurate and generalizable mitotic figure classifiers.
Related papers
- A Contrastive Variational AutoEncoder for NSCLC Survival Prediction with Missing Modalities [41.8469011437549]
Predicting survival outcomes for non-small cell lung cancer (NSCLC) patients is challenging due to the different individual prognostic features.<n>State-of-the-art models rely on available data to create patient-level representations or use generative models to infer missing modalities.<n>We propose a Multimodal Contrastive Variational AutoEncoder (MCVAE) to address this issue.
arXiv Detail & Related papers (2026-02-19T14:29:34Z) - XtraLight-MedMamba for Classification of Neoplastic Tubular Adenomas [1.9322492427205764]
XtraLight-MedMamba is an ultra-lightweight state-space-based deep learning framework for classifying tubular adenomas.<n>The model was evaluated on a curated dataset acquired from patients with low-grade tubular adenomas.<n>XtraLight-MedMamba achieved an accuracy of 97.18% and an F1-score of 0.9767 using approximately 32,000 parameters.
arXiv Detail & Related papers (2026-02-04T18:07:51Z) - A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis [82.01597026329158]
We introduce a Correlation-Regulated Alignment Framework for Tissue Synthesis (CRAFTS) for pathology-specific text-to-image synthesis.<n>CRAFTS incorporates a novel alignment mechanism that suppresses semantic drift to ensure biological accuracy.<n>This model generates diverse pathological images spanning 30 cancer types, with quality rigorously validated by objective metrics and pathologist evaluations.
arXiv Detail & Related papers (2025-12-15T10:22:43Z) - Robust Atypical Mitosis Classification with DenseNet121: Stain-Aware Augmentation and Hybrid Loss for Domain Generalization [16.048949963804123]
We present a DenseNet-121-based framework tailored for atypical mitosis classification in the MIDOG 2025 (Track 2) setting.<n>Our method integrates stain-aware augmentation (Macenko), geometric and intensity transformations, and imbalance-aware learning via weighted sampling with a hybrid objective combining class-weighted binary cross-entropy and focal loss.<n> Trained end-to-end with AdamW and evaluated across multiple independent domains, the model demonstrates strong generalization under scanner and staining shifts, achieving balanced accuracy 85.0%, AUROC 0.927, sensitivity 89.2%, and specificity 80.9% on the official test set.
arXiv Detail & Related papers (2025-10-26T11:24:55Z) - REN: Anatomically-Informed Mixture-of-Experts for Interstitial Lung Disease Diagnosis [32.83724094606554]
We introduce Regional Expert Networks (REN), the first anatomically-informed MoE framework tailored specifically for medical image classification.<n>REN leverages anatomical priors to train seven specialized experts, each dedicated to distinct lung lobes and bilateral lung combinations.<n>Through rigorous patient-level cross-validation, REN demonstrates strong generalizability and clinical interpretability.
arXiv Detail & Related papers (2025-10-06T15:35:08Z) - Single Domain Generalization in Diabetic Retinopathy: A Neuro-Symbolic Learning Approach [0.5581472054346949]
KG-DG is a neuro-symbolic framework for diabetic retinopathy (DR) classification.<n>It integrates vision transformers with expert-guided symbolic reasoning to enable generalization across unseen domains.
arXiv Detail & Related papers (2025-09-03T01:07:49Z) - Adaptive Learning Strategies for Mitotic Figure Classification in MIDOG2025 Challenge [7.3323821474776]
We investigated three variants of adapting the pathology foundation model UNI2-h for the MIDOG2025 Track 2 challenge.<n>We found that visual prompt tuning (VPT) substantially improved generalization, and that further integrating test-time augmentation (TTA) with Vahadane and Macenko stain normalization provided the best robustness.
arXiv Detail & Related papers (2025-09-01T22:42:53Z) - MIDOG 2025 Track 2: A Deep Learning Model for Classification of Atypical and Normal Mitotic Figures under Class and Hardness Imbalances [0.0]
We propose a novel deep learning approach based on a ResNet backbone with specialized classification heads.<n>Our architecture uniquely models both the mitotic figure phenotype and the instance difficulty simultaneously.
arXiv Detail & Related papers (2025-09-01T15:12:05Z) - Ensemble of Pathology Foundation Models for MIDOG 2025 Track 2: Atypical Mitosis Classification [0.0]
We leveraged Pathology Foundation Models (PFMs) pre-trained on large histopathology datasets.<n>We incorporated ConvNeXt V2, a state-of-the-art convolutional neural network architecture, to complement PFMs.<n>We ensembled multiple PFMs to integrate complementary morphological insights, achieving balanced accuracy on the Preliminary Evaluation Phase dataset.
arXiv Detail & Related papers (2025-08-29T03:24:57Z) - Automated Classification of Normal and Atypical Mitotic Figures Using ConvNeXt V2: MIDOG 2025 Track 2 [0.026042848991788176]
This paper presents our solution for the MIDOG 2025 Challenge Track 2, which focuses on binary classification of normal mitotic figures (NMFs) versus atypical mitotic figures (AMFs)<n>Our approach leverages a ConvNeXt V2 base model with center cropping preprocessing and 5-fold cross-validation ensemble strategy.<n>The solution demonstrates the effectiveness of modern convolutional architectures for mitotic figure subtyping while maintaining computational efficiency.
arXiv Detail & Related papers (2025-08-26T09:11:12Z) - HistoViT: Vision Transformer for Accurate and Scalable Histopathological Cancer Diagnosis [1.5939351525664014]
We propose a transformer-based deep learning framework for multi-class tumor classification.<n>Our method addresses key limitations of conventional convolutional neural networks.<n>Our approach classification achieves accuracies of 99.32%, 96.92%, 95.28%, and 96.94% for breast, prostate, bone, and cervical cancers respectively.
arXiv Detail & Related papers (2025-08-15T03:10:52Z) - CSASN: A Multitask Attention-Based Framework for Heterogeneous Thyroid Carcinoma Classification in Ultrasound Images [4.577163442985675]
Heterogeneous morphological features and data imbalance pose significant challenges in rare thyroid carcinoma classification using ultrasound imaging.<n>We propose a novel multitask learning framework, Channel-Spatial Attention Synergy Network (CSASN), which integrates a dual-branch feature extractor.
arXiv Detail & Related papers (2025-05-04T18:23:03Z) - Interpretable Graph Kolmogorov-Arnold Networks for Multi-Cancer Classification and Biomarker Identification using Multi-Omics Data [36.92842246372894]
Multi-Omics Graph Kolmogorov-Arnold Network (MOGKAN) is a deep learning framework that utilizes messenger-RNA, micro-RNA sequences, and DNA methylation samples.<n>By integrating multi-omics data with graph-based deep learning, our proposed approach demonstrates robust predictive performance and interpretability.
arXiv Detail & Related papers (2025-03-29T02:14:05Z) - FundusGAN: A Hierarchical Feature-Aware Generative Framework for High-Fidelity Fundus Image Generation [35.46876389599076]
FundusGAN is a novel hierarchical feature-aware generative framework specifically designed for high-fidelity fundus image synthesis.<n>We show that FundusGAN consistently outperforms state-of-the-art methods across multiple metrics.
arXiv Detail & Related papers (2025-03-22T18:08:07Z) - Diffusion Models with Ensembled Structure-Based Anomaly Scoring for Unsupervised Anomaly Detection [35.46541584018842]
unsupervised anomaly detection (UAD) emerges as a viable alternative for pathology segmentation.
Recent UAD anomaly scoring functions often focus on intensity only and neglect structural differences, which impedes the segmentation performance.
Structural Similarity (SSIM) captures both intensity and structural disparities and can be advantageous over the classical $l1$ error.
arXiv Detail & Related papers (2024-03-21T09:50:39Z) - Breast Ultrasound Tumor Classification Using a Hybrid Multitask
CNN-Transformer Network [63.845552349914186]
Capturing global contextual information plays a critical role in breast ultrasound (BUS) image classification.
Vision Transformers have an improved capability of capturing global contextual information but may distort the local image patterns due to the tokenization operations.
In this study, we proposed a hybrid multitask deep neural network called Hybrid-MT-ESTAN, designed to perform BUS tumor classification and segmentation.
arXiv Detail & Related papers (2023-08-04T01:19:32Z) - Multiple Instance Ensembling For Paranasal Anomaly Classification In The
Maxillary Sinus [46.1292414445895]
Paranasal anomalies can present with a wide range of morphological features.
Current approaches to paranasal anomaly classification are constrained to identifying a single anomaly at a time.
We investigate the feasibility of using a 3D convolutional neural network (CNN) to classify healthy maxillary (MS) and MS with polyps or cysts.
arXiv Detail & Related papers (2023-03-31T09:23:27Z) - HistoPerm: A Permutation-Based View Generation Approach for Improving
Histopathologic Feature Representation Learning [33.1098457952173]
HistoPerm is a view generation method for representation learning using joint embedding architectures.
HistoPerm permutes augmented views of patches extracted from whole-slide histology images to improve classification performance.
Our results show that HistoPerm consistently improves patch- and slide-level classification performance in terms of accuracy, F1-score, and AUC.
arXiv Detail & Related papers (2022-09-13T17:35:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.