SCORPION: Addressing Scanner-Induced Variability in Histopathology
- URL: http://arxiv.org/abs/2507.20907v1
- Date: Mon, 28 Jul 2025 15:00:49 GMT
- Title: SCORPION: Addressing Scanner-Induced Variability in Histopathology
- Authors: Jeongun Ryu, Heon Song, Seungeun Lee, Soo Ick Cho, Jiwon Shin, Kyunghyun Paeng, Sérgio Pereira,
- Abstract summary: Ensuring reliable model performance across diverse domains is a critical challenge in computational pathology.<n>We release SCORPION, a new dataset explicitly designed to evaluate model reliability under scanner variability.<n>We propose SimCons, a flexible framework that combines augmentation-based domain generalization techniques with a consistency loss to explicitly address scanner generalization.
- Score: 4.296051492560909
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Ensuring reliable model performance across diverse domains is a critical challenge in computational pathology. A particular source of variability in Whole-Slide Images is introduced by differences in digital scanners, thus calling for better scanner generalization. This is critical for the real-world adoption of computational pathology, where the scanning devices may differ per institution or hospital, and the model should not be dependent on scanner-induced details, which can ultimately affect the patient's diagnosis and treatment planning. However, past efforts have primarily focused on standard domain generalization settings, evaluating on unseen scanners during training, without directly evaluating consistency across scanners for the same tissue. To overcome this limitation, we introduce SCORPION, a new dataset explicitly designed to evaluate model reliability under scanner variability. SCORPION includes 480 tissue samples, each scanned with 5 scanners, yielding 2,400 spatially aligned patches. This scanner-paired design allows for the isolation of scanner-induced variability, enabling a rigorous evaluation of model consistency while controlling for differences in tissue composition. Furthermore, we propose SimCons, a flexible framework that combines augmentation-based domain generalization techniques with a consistency loss to explicitly address scanner generalization. We empirically show that SimCons improves model consistency on varying scanners without compromising task-specific performance. By releasing the SCORPION dataset and proposing SimCons, we provide the research community with a crucial resource for evaluating and improving model consistency across diverse scanners, setting a new standard for reliability testing.
Related papers
- Pathology Foundation Models are Scanner Sensitive: Benchmark and Mitigation with Contrastive ScanGen Loss [6.310092608526967]
We show that Foundation Models (FMs) still suffer from scanner bias.<n>We propose ScanGen, a contrastive loss function applied during task-specific fine-tuning that mitigates scanner bias.<n>Our approach is applied to the Multiple Instance Learning task of Epidermal Growth Factor Receptor (EGFR) mutation prediction from H&E-stained WSIs in lung cancer.
arXiv Detail & Related papers (2025-07-29T12:35:08Z) - RadFabric: Agentic AI System with Reasoning Capability for Radiology [61.25593938175618]
RadFabric is a multi agent, multimodal reasoning framework that unifies visual and textual analysis for comprehensive CXR interpretation.<n>System employs specialized CXR agents for pathology detection, an Anatomical Interpretation Agent to map visual findings to precise anatomical structures, and a Reasoning Agent powered by large multimodal reasoning models to synthesize visual, anatomical, and clinical data into transparent and evidence based diagnoses.
arXiv Detail & Related papers (2025-06-17T03:10:33Z) - DISARM++: Beyond scanner-free harmonization [0.0]
Harmonization of T1-weighted MR images across different scanners is crucial for ensuring consistency in neuroimaging studies.<n>This study introduces a novel approach to direct image harmonization, moving beyond feature standardization to ensure that extracted features remain inherently reliable for downstream analysis.<n>Our method enables image transfer in two ways: (1) mapping images to a scanner-free space for uniform appearance across all scanners, and (2) transforming images into the domain of a specific scanner used in model training, embedding its unique characteristics.
arXiv Detail & Related papers (2025-05-06T17:36:49Z) - A Unified Model for Compressed Sensing MRI Across Undersampling Patterns [69.19631302047569]
We propose a unified MRI reconstruction model robust to various measurement undersampling patterns and image resolutions.<n>Our model improves SSIM by 11% and PSNR by 4 dB over a state-of-the-art CNN (End-to-End VarNet) with 600$times$ faster inference than diffusion methods.
arXiv Detail & Related papers (2024-10-05T20:03:57Z) - The Impact of Scanner Domain Shift on Deep Learning Performance in Medical Imaging: an Experimental Study [1.4628485867112924]
We evaluate the impact of scanner domain shift on convolutional neural network performance for different automated diagnostic tasks.
We find that network performance on data from a different scanner is almost always worse than on same-scanner data.
We attribute this drop to the standardized nature of CT acquisition systems which is not present in MRI or X-ray.
arXiv Detail & Related papers (2024-09-06T15:59:30Z) - On Sensitivity and Robustness of Normalization Schemes to Input
Distribution Shifts in Automatic MR Image Diagnosis [58.634791552376235]
Deep Learning (DL) models have achieved state-of-the-art performance in diagnosing multiple diseases using reconstructed images as input.
DL models are sensitive to varying artifacts as it leads to changes in the input data distribution between the training and testing phases.
We propose to use other normalization techniques, such as Group Normalization and Layer Normalization, to inject robustness into model performance against varying image artifacts.
arXiv Detail & Related papers (2023-06-23T03:09:03Z) - PrepNet: A Convolutional Auto-Encoder to Homogenize CT Scans for
Cross-Dataset Medical Image Analysis [0.22485007639406518]
COVID-19 diagnosis can now be done efficiently using PCR tests, but this use case exemplifies the need for a methodology to overcome data variability issues.
We propose a novel generative approach that aims at erasing the differences induced by e.g. the imaging technology while simultaneously introducing minimal changes to the CT scans.
arXiv Detail & Related papers (2022-08-19T15:49:47Z) - Improving Classification Model Performance on Chest X-Rays through Lung
Segmentation [63.45024974079371]
We propose a deep learning approach to enhance abnormal chest x-ray (CXR) identification performance through segmentations.
Our approach is designed in a cascaded manner and incorporates two modules: a deep neural network with criss-cross attention modules (XLSor) for localizing lung region in CXR images and a CXR classification model with a backbone of a self-supervised momentum contrast (MoCo) model pre-trained on large-scale CXR data sets.
arXiv Detail & Related papers (2022-02-22T15:24:06Z) - An augmentation strategy to mimic multi-scanner variability in MRI [0.2839215671218125]
Most publicly available brain MRI datasets are very homogeneous in terms of scanner and protocols.
It is difficult for models that learn from such data to generalize to multi-center and multi-scanner data.
We propose a novel data augmentation approach with the aim of approximating the variability in terms of intensities and contrasts present in real world clinical data.
arXiv Detail & Related papers (2021-03-23T14:49:38Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Fader Networks for domain adaptation on fMRI: ABIDE-II study [68.5481471934606]
We use 3D convolutional autoencoders to build the domain irrelevant latent space image representation and demonstrate this method to outperform existing approaches on ABIDE data.
arXiv Detail & Related papers (2020-10-14T16:50:50Z) - Improved inter-scanner MS lesion segmentation by adversarial training on
longitudinal data [0.0]
The evaluation of white matter lesion progression is an important biomarker in the follow-up of MS patients.
Current automated lesion segmentation algorithms are susceptible to variability in image characteristics related to MRI scanner or protocol differences.
We propose a model that improves the consistency of MS lesion segmentations in inter-scanner studies.
arXiv Detail & Related papers (2020-02-03T16:56:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.