Rethinking Mitosis Detection: Towards Diverse Data and Feature
Representation
- URL: http://arxiv.org/abs/2307.05889v1
- Date: Wed, 12 Jul 2023 03:33:11 GMT
- Title: Rethinking Mitosis Detection: Towards Diverse Data and Feature
Representation
- Authors: Hao Wang, Jiatai Lin, Danyi Li, Jing Wang, Bingchao Zhao, Zhenwei Shi,
Xipeng Pan, Huadeng Wang, Bingbing Li, Changhong Liang, Guoqiang Han, Li
Liang, Chu Han, Zaiyi Liu
- Abstract summary: We propose a novel generalizable framework (MitDet) for mitosis detection.
Our proposed model outperforms all the SOTA approaches in several popular mitosis detection datasets.
- Score: 30.882319057927052
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Mitosis detection is one of the fundamental tasks in computational pathology,
which is extremely challenging due to the heterogeneity of mitotic cell. Most
of the current studies solve the heterogeneity in the technical aspect by
increasing the model complexity. However, lacking consideration of the
biological knowledge and the complex model design may lead to the overfitting
problem while limited the generalizability of the detection model. In this
paper, we systematically study the morphological appearances in different
mitotic phases as well as the ambiguous non-mitotic cells and identify that
balancing the data and feature diversity can achieve better generalizability.
Based on this observation, we propose a novel generalizable framework (MitDet)
for mitosis detection. The data diversity is considered by the proposed
diversity-guided sample balancing (DGSB). And the feature diversity is
preserved by inter- and intra- class feature diversity-preserved module
(InCDP). Stain enhancement (SE) module is introduced to enhance the
domain-relevant diversity of both data and features simultaneously. Extensive
experiments have demonstrated that our proposed model outperforms all the SOTA
approaches in several popular mitosis detection datasets in both internal and
external test sets using minimal annotation efforts with point annotations
only. Comprehensive ablation studies have also proven the effectiveness of the
rethinking of data and feature diversity balancing. By analyzing the results
quantitatively and qualitatively, we believe that our proposed model not only
achieves SOTA performance but also might inspire the future studies in new
perspectives. Source code is at https://github.com/Onehour0108/MitDet.
Related papers
- UNICORN: A Deep Learning Model for Integrating Multi-Stain Data in Histopathology [2.9389205138207277]
UNICORN is a multi-modal transformer capable of processing multi-stain histopathology for atherosclerosis severity class prediction.
The architecture comprises a two-stage, end-to-end trainable model with specialized modules utilizing transformer self-attention blocks.
UNICORN achieved a classification accuracy of 0.67, outperforming other state-of-the-art models.
arXiv Detail & Related papers (2024-09-26T12:13:52Z) - GenBench: A Benchmarking Suite for Systematic Evaluation of Genomic Foundation Models [56.63218531256961]
We introduce GenBench, a benchmarking suite specifically tailored for evaluating the efficacy of Genomic Foundation Models.
GenBench offers a modular and expandable framework that encapsulates a variety of state-of-the-art methodologies.
We provide a nuanced analysis of the interplay between model architecture and dataset characteristics on task-specific performance.
arXiv Detail & Related papers (2024-06-01T08:01:05Z) - Seeing Unseen: Discover Novel Biomedical Concepts via
Geometry-Constrained Probabilistic Modeling [53.7117640028211]
We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues.
We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space.
A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
arXiv Detail & Related papers (2024-03-02T00:56:05Z) - Differentiable Agent-based Epidemiology [71.81552021144589]
We introduce GradABM: a scalable, differentiable design for agent-based modeling that is amenable to gradient-based learning with automatic differentiation.
GradABM can quickly simulate million-size populations in few seconds on commodity hardware, integrate with deep neural networks and ingest heterogeneous data sources.
arXiv Detail & Related papers (2022-07-20T07:32:02Z) - Equivariance Allows Handling Multiple Nuisance Variables When Analyzing
Pooled Neuroimaging Datasets [53.34152466646884]
In this paper, we show how bringing recent results on equivariant representation learning instantiated on structured spaces together with simple use of classical results on causal inference provides an effective practical solution.
We demonstrate how our model allows dealing with more than one nuisance variable under some assumptions and can enable analysis of pooled scientific datasets in scenarios that would otherwise entail removing a large portion of the samples.
arXiv Detail & Related papers (2022-03-29T04:54:06Z) - MoReL: Multi-omics Relational Learning [26.484803417186384]
We propose a novel deep Bayesian generative model to efficiently infer a multi-partite graph encoding molecular interactions across heterogeneous views.
With such an optimal transport regularization in the deep Bayesian generative model, it not only allows incorporating view-specific side information, but also increases the model flexibility with the distribution-based regularization.
arXiv Detail & Related papers (2022-03-15T02:50:07Z) - Robust Finite Mixture Regression for Heterogeneous Targets [70.19798470463378]
We propose an FMR model that finds sample clusters and jointly models multiple incomplete mixed-type targets simultaneously.
We provide non-asymptotic oracle performance bounds for our model under a high-dimensional learning framework.
The results show that our model can achieve state-of-the-art performance.
arXiv Detail & Related papers (2020-10-12T03:27:07Z) - Modeling Shared Responses in Neuroimaging Studies through MultiView ICA [94.31804763196116]
Group studies involving large cohorts of subjects are important to draw general conclusions about brain functional organization.
We propose a novel MultiView Independent Component Analysis model for group studies, where data from each subject are modeled as a linear combination of shared independent sources plus noise.
We demonstrate the usefulness of our approach first on fMRI data, where our model demonstrates improved sensitivity in identifying common sources among subjects.
arXiv Detail & Related papers (2020-06-11T17:29:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.