Multiplex-detection Based Multiple Instance Learning Network for Whole
Slide Image Classification
- URL: http://arxiv.org/abs/2208.03526v1
- Date: Sat, 6 Aug 2022 14:36:48 GMT
- Title: Multiplex-detection Based Multiple Instance Learning Network for Whole
Slide Image Classification
- Authors: Zhikang Wang, Yue Bi, Tong Pan, Chris Bain, Richard Bassed, Seiya
Imoto, Jianhua Yao, Jiangning Song
- Abstract summary: Multiple instance learning (MIL) is a powerful approach to classify whole slide images (WSIs) for diagnostic pathology.
We propose a novel multiplex-detection-based multiple instance learning (MDMIL) to tackle the issues above.
Specifically, MDMIL is constructed by the internal query generation module (IQGM) and the multiplex detection module (MDM)
- Score: 2.61155594652503
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Multiple instance learning (MIL) is a powerful approach to classify whole
slide images (WSIs) for diagnostic pathology. A fundamental challenge of MIL on
WSI classification is to discover the \textit{critical instances} that trigger
the bag label. However, previous methods are primarily designed under the
independent and identical distribution hypothesis (\textit{i.i.d}), ignoring
either the correlations between instances or heterogeneity of tumours. In this
paper, we propose a novel multiplex-detection-based multiple instance learning
(MDMIL) to tackle the issues above. Specifically, MDMIL is constructed by the
internal query generation module (IQGM) and the multiplex detection module
(MDM) and assisted by the memory-based contrastive loss during training.
Firstly, IQGM gives the probability of instances and generates the internal
query (IQ) for the subsequent MDM by aggregating highly reliable features after
the distribution analysis. Secondly, the multiplex-detection cross-attention
(MDCA) and multi-head self-attention (MHSA) in MDM cooperate to generate the
final representations for the WSI. In this process, the IQ and trainable
variational query (VQ) successfully build up the connections between instances
and significantly improve the model's robustness toward heterogeneous tumours.
At last, to further enforce constraints in the feature space and stabilize the
training process, we adopt a memory-based contrastive loss, which is
practicable for WSI classification even with a single sample as input in each
iteration. We conduct experiments on three computational pathology datasets,
e.g., CAMELYON16, TCGA-NSCLC, and TCGA-RCC datasets. The superior accuracy and
AUC demonstrate the superiority of our proposed MDMIL over other
state-of-the-art methods.
Related papers
- Binary Code Similarity Detection via Graph Contrastive Learning on Intermediate Representations [52.34030226129628]
Binary Code Similarity Detection (BCSD) plays a crucial role in numerous fields, including vulnerability detection, malware analysis, and code reuse identification.
In this paper, we propose IRBinDiff, which mitigates compilation differences by leveraging LLVM-IR with higher-level semantic abstraction.
Our extensive experiments, conducted under varied compilation settings, demonstrate that IRBinDiff outperforms other leading BCSD methods in both One-to-one comparison and One-to-many search scenarios.
arXiv Detail & Related papers (2024-10-24T09:09:20Z) - Multi-Source and Test-Time Domain Adaptation on Multivariate Signals using Spatio-Temporal Monge Alignment [59.75420353684495]
Machine learning applications on signals such as computer vision or biomedical data often face challenges due to the variability that exists across hardware devices or session recordings.
In this work, we propose Spatio-Temporal Monge Alignment (STMA) to mitigate these variabilities.
We show that STMA leads to significant and consistent performance gains between datasets acquired with very different settings.
arXiv Detail & Related papers (2024-07-19T13:33:38Z) - cDP-MIL: Robust Multiple Instance Learning via Cascaded Dirichlet Process [23.266122629592807]
Multiple instance learning (MIL) has been extensively applied to whole slide histoparametric image (WSI) analysis.
The existing aggregation strategy in MIL, which primarily relies on the first-order distance between instances, fails to accurately approximate the true feature distribution of each instance.
We propose a new Bayesian nonparametric framework for multiple instance learning, which adopts a cascade of Dirichlet processes (cDP) to incorporate the instance-to-bag characteristic of the WSIs.
arXiv Detail & Related papers (2024-07-16T07:28:39Z) - MamMIL: Multiple Instance Learning for Whole Slide Images with State Space Models [56.37780601189795]
We propose a framework named MamMIL for WSI analysis.
We represent each WSI as an undirected graph.
To address the problem that Mamba can only process 1D sequences, we propose a topology-aware scanning mechanism.
arXiv Detail & Related papers (2024-03-08T09:02:13Z) - Multi-Modality Multi-Scale Cardiovascular Disease Subtypes
Classification Using Raman Image and Medical History [2.9315342447802317]
We propose a multi-modality multi-scale model called M3S, which is a novel deep learning method with two core modules to address these issues.
First, we convert RS data to various resolution images by the Gramian angular field (GAF) to enlarge nuance, and a two-branch structure is leveraged to get embeddings for distinction.
Second, a probability matrix and a weight matrix are used to enhance the classification capacity by combining the RS and medical history data.
arXiv Detail & Related papers (2023-04-18T22:09:16Z) - Coarse-to-Fine Sparse Transformer for Hyperspectral Image Reconstruction [138.04956118993934]
We propose a novel Transformer-based method, coarse-to-fine sparse Transformer (CST)
CST embedding HSI sparsity into deep learning for HSI reconstruction.
In particular, CST uses our proposed spectra-aware screening mechanism (SASM) for coarse patch selecting. Then the selected patches are fed into our customized spectra-aggregation hashing multi-head self-attention (SAH-MSA) for fine pixel clustering and self-similarity capturing.
arXiv Detail & Related papers (2022-03-09T16:17:47Z) - Modality Completion via Gaussian Process Prior Variational Autoencoders
for Multi-Modal Glioma Segmentation [75.58395328700821]
We propose a novel model, Multi-modal Gaussian Process Prior Variational Autoencoder (MGP-VAE), to impute one or more missing sub-modalities for a patient scan.
MGP-VAE can leverage the Gaussian Process (GP) prior on the Variational Autoencoder (VAE) to utilize the subjects/patients and sub-modalities correlations.
We show the applicability of MGP-VAE on brain tumor segmentation where either, two, or three of four sub-modalities may be missing.
arXiv Detail & Related papers (2021-07-07T19:06:34Z) - TransMIL: Transformer based Correlated Multiple Instance Learning for
Whole Slide Image Classication [38.58585442160062]
Multiple instance learning (MIL) is a powerful tool to solve the weakly supervised classification in whole slide image (WSI) based pathology diagnosis.
We proposed a new framework, called correlated MIL, and provided a proof for convergence.
We conducted various experiments for three different computational pathology problems and achieved better performance and faster convergence compared with state-of-the-art methods.
arXiv Detail & Related papers (2021-06-02T02:57:54Z) - Multiple-shooting adjoint method for whole-brain dynamic causal modeling [8.943170877509923]
Multiple-Shooting Adjoint (MSA) is suitable for large-scale systems such as whole-brain analysis in functional MRI (fMRI)
MSA achieves better accuracy in parameter value estimation than EM.
We show improved classification of autism spectrum disorder (ASD) vs. control compared to using the functional connectome.
arXiv Detail & Related papers (2021-02-14T05:00:12Z) - Learning Multi-Modal Volumetric Prostate Registration with Weak
Inter-Subject Spatial Correspondence [2.6894568533991543]
We introduce an auxiliary input to the neural network for the prior information about the prostate location in the MR sequence.
With weakly labelled MR-TRUS prostate data, we showed registration quality comparable to the state-of-the-art deep learning-based method.
arXiv Detail & Related papers (2021-02-09T16:48:59Z) - Deep Representational Similarity Learning for analyzing neural
signatures in task-based fMRI dataset [81.02949933048332]
This paper develops Deep Representational Similarity Learning (DRSL), a deep extension of Representational Similarity Analysis (RSA)
DRSL is appropriate for analyzing similarities between various cognitive tasks in fMRI datasets with a large number of subjects.
arXiv Detail & Related papers (2020-09-28T18:30:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.