Rethinking Mitosis Detection: Towards Diverse Data and Feature
  Representation
        - URL: http://arxiv.org/abs/2307.05889v1
- Date: Wed, 12 Jul 2023 03:33:11 GMT
- Title: Rethinking Mitosis Detection: Towards Diverse Data and Feature
  Representation
- Authors: Hao Wang, Jiatai Lin, Danyi Li, Jing Wang, Bingchao Zhao, Zhenwei Shi,
  Xipeng Pan, Huadeng Wang, Bingbing Li, Changhong Liang, Guoqiang Han, Li
  Liang, Chu Han, Zaiyi Liu
- Abstract summary: We propose a novel generalizable framework (MitDet) for mitosis detection.
Our proposed model outperforms all the SOTA approaches in several popular mitosis detection datasets.
- Score: 30.882319057927052
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract:   Mitosis detection is one of the fundamental tasks in computational pathology,
which is extremely challenging due to the heterogeneity of mitotic cell. Most
of the current studies solve the heterogeneity in the technical aspect by
increasing the model complexity. However, lacking consideration of the
biological knowledge and the complex model design may lead to the overfitting
problem while limited the generalizability of the detection model. In this
paper, we systematically study the morphological appearances in different
mitotic phases as well as the ambiguous non-mitotic cells and identify that
balancing the data and feature diversity can achieve better generalizability.
Based on this observation, we propose a novel generalizable framework (MitDet)
for mitosis detection. The data diversity is considered by the proposed
diversity-guided sample balancing (DGSB). And the feature diversity is
preserved by inter- and intra- class feature diversity-preserved module
(InCDP). Stain enhancement (SE) module is introduced to enhance the
domain-relevant diversity of both data and features simultaneously. Extensive
experiments have demonstrated that our proposed model outperforms all the SOTA
approaches in several popular mitosis detection datasets in both internal and
external test sets using minimal annotation efforts with point annotations
only. Comprehensive ablation studies have also proven the effectiveness of the
rethinking of data and feature diversity balancing. By analyzing the results
quantitatively and qualitatively, we believe that our proposed model not only
achieves SOTA performance but also might inspire the future studies in new
perspectives. Source code is at https://github.com/Onehour0108/MitDet.
 
      
        Related papers
        - Robust Multimodal Survival Prediction with the Latent Differentiation   Conditional Variational AutoEncoder [18.519138120118125]
 We propose a Conditional Latent Differentiation Variational AutoEncoder (LD-CVAE) for robust multimodal survival prediction.
Specifically, a Variational Information Bottleneck Transformer (VIB-Trans) module is proposed to learn compressed pathological representations from the gigapixel WSIs.
We develop a novel Latent Differentiation Variational AutoEncoder (LD-VAE) to learn the common and specific posteriors for the genomic embeddings with diverse functions.
 arXiv  Detail & Related papers  (2025-03-12T15:58:37Z)
- Exploring the Efficacy of Meta-Learning: Unveiling Superior Data   Diversity Utilization of MAML Over Pre-training [1.3980986259786223]
 We show that dataset diversity can impact the performance of vision models.
Our study shows positive correlations between test set accuracy and data diversity.
These findings support our hypothesis and demonstrate a promising way for a deeper exploration of how formal data diversity influences model performance.
 arXiv  Detail & Related papers  (2025-01-15T00:56:59Z)
- Stabilizing Machine Learning for Reproducible and Explainable Results: A   Novel Validation Approach to Subject-Specific Insights [2.7516838144367735]
 We propose a novel validation approach that uses a general ML model to ensure reproducible performance and robust feature importance analysis.
We tested a single Random Forest (RF) model on nine datasets varying in domain, sample size, and demographics.
Our repeated trials approach consistently identified key features at the subject level and improved group-level feature importance analysis.
 arXiv  Detail & Related papers  (2024-12-16T23:14:26Z)
- UNICORN: A Deep Learning Model for Integrating Multi-Stain Data in   Histopathology [2.9389205138207277]
 UNICORN is a multi-modal transformer capable of processing multi-stain histopathology for atherosclerosis severity class prediction.
The architecture comprises a two-stage, end-to-end trainable model with specialized modules utilizing transformer self-attention blocks.
UNICORN achieved a classification accuracy of 0.67, outperforming other state-of-the-art models.
 arXiv  Detail & Related papers  (2024-09-26T12:13:52Z)
- GenBench: A Benchmarking Suite for Systematic Evaluation of Genomic   Foundation Models [56.63218531256961]
 We introduce GenBench, a benchmarking suite specifically tailored for evaluating the efficacy of Genomic Foundation Models.
GenBench offers a modular and expandable framework that encapsulates a variety of state-of-the-art methodologies.
We provide a nuanced analysis of the interplay between model architecture and dataset characteristics on task-specific performance.
 arXiv  Detail & Related papers  (2024-06-01T08:01:05Z)
- Seeing Unseen: Discover Novel Biomedical Concepts via
  Geometry-Constrained Probabilistic Modeling [53.7117640028211]
 We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues.
We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space.
A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
 arXiv  Detail & Related papers  (2024-03-02T00:56:05Z)
- Beyond DAGs: A Latent Partial Causal Model for Multimodal Learning [80.44084021062105]
 We propose a novel latent partial causal model for multimodal data, featuring two latent coupled variables, connected by an undirected edge, to represent the transfer of knowledge across modalities.<n>Under specific statistical assumptions, we establish an identifiability result, demonstrating that representations learned by multimodal contrastive learning correspond to the latent coupled variables up to a trivial transformation.<n>Experiments on a pre-trained CLIP model embodies disentangled representations, enabling few-shot learning and improving domain generalization across diverse real-world datasets.
 arXiv  Detail & Related papers  (2024-02-09T07:18:06Z)
- Differentiable Agent-based Epidemiology [71.81552021144589]
 We introduce GradABM: a scalable, differentiable design for agent-based modeling that is amenable to gradient-based learning with automatic differentiation.
 GradABM can quickly simulate million-size populations in few seconds on commodity hardware, integrate with deep neural networks and ingest heterogeneous data sources.
 arXiv  Detail & Related papers  (2022-07-20T07:32:02Z)
- Equivariance Allows Handling Multiple Nuisance Variables When Analyzing
  Pooled Neuroimaging Datasets [53.34152466646884]
 In this paper, we show how bringing recent results on equivariant representation learning instantiated on structured spaces together with simple use of classical results on causal inference provides an effective practical solution.
We demonstrate how our model allows dealing with more than one nuisance variable under some assumptions and can enable analysis of pooled scientific datasets in scenarios that would otherwise entail removing a large portion of the samples.
 arXiv  Detail & Related papers  (2022-03-29T04:54:06Z)
- MoReL: Multi-omics Relational Learning [26.484803417186384]
 We propose a novel deep Bayesian generative model to efficiently infer a multi-partite graph encoding molecular interactions across heterogeneous views.
With such an optimal transport regularization in the deep Bayesian generative model, it not only allows incorporating view-specific side information, but also increases the model flexibility with the distribution-based regularization.
 arXiv  Detail & Related papers  (2022-03-15T02:50:07Z)
- Robust Finite Mixture Regression for Heterogeneous Targets [70.19798470463378]
 We propose an FMR model that finds sample clusters and jointly models multiple incomplete mixed-type targets simultaneously.
We provide non-asymptotic oracle performance bounds for our model under a high-dimensional learning framework.
The results show that our model can achieve state-of-the-art performance.
 arXiv  Detail & Related papers  (2020-10-12T03:27:07Z)
- Modeling Shared Responses in Neuroimaging Studies through MultiView ICA [94.31804763196116]
 Group studies involving large cohorts of subjects are important to draw general conclusions about brain functional organization.
We propose a novel MultiView Independent Component Analysis model for group studies, where data from each subject are modeled as a linear combination of shared independent sources plus noise.
We demonstrate the usefulness of our approach first on fMRI data, where our model demonstrates improved sensitivity in identifying common sources among subjects.
 arXiv  Detail & Related papers  (2020-06-11T17:29:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.