UNet-2022: Exploring Dynamics in Non-isomorphic Architecture
- URL: http://arxiv.org/abs/2210.15566v1
- Date: Thu, 27 Oct 2022 16:00:04 GMT
- Title: UNet-2022: Exploring Dynamics in Non-isomorphic Architecture
- Authors: Jiansen Guo, Hong-Yu Zhou, Liansheng Wang, Yizhou Yu
- Abstract summary: We propose a parallel non-isomorphic block that takes the advantages of self-attention and convolution with simple parallelization.
We name the resulting U-shape segmentation model as UNet-2022.
In experiments, UNet-2022 obviously outperforms its counterparts in a range segmentation tasks.
- Score: 52.04899592688968
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Recent medical image segmentation models are mostly hybrid, which integrate
self-attention and convolution layers into the non-isomorphic architecture.
However, one potential drawback of these approaches is that they failed to
provide an intuitive explanation of why this hybrid combination manner is
beneficial, making it difficult for subsequent work to make improvements on top
of them. To address this issue, we first analyze the differences between the
weight allocation mechanisms of the self-attention and convolution. Based on
this analysis, we propose to construct a parallel non-isomorphic block that
takes the advantages of self-attention and convolution with simple
parallelization. We name the resulting U-shape segmentation model as UNet-2022.
In experiments, UNet-2022 obviously outperforms its counterparts in a range
segmentation tasks, including abdominal multi-organ segmentation, automatic
cardiac diagnosis, neural structures segmentation, and skin lesion
segmentation, sometimes surpassing the best performing baseline by 4%.
Specifically, UNet-2022 surpasses nnUNet, the most recognized segmentation
model at present, by large margins. These phenomena indicate the potential of
UNet-2022 to become the model of choice for medical image segmentation.
Related papers
- MedMAP: Promoting Incomplete Multi-modal Brain Tumor Segmentation with Alignment [20.358300924109162]
In clinical practice, certain modalities of MRI may be missing, which presents a more difficult scenario.
Knowledge Distillation, Domain Adaption, and Shared Latent Space have emerged as commonly promising strategies.
We propose a novel paradigm that aligns latent features of involved modalities to a well-defined distribution anchor as the substitution of the pre-trained model.
arXiv Detail & Related papers (2024-08-18T13:16:30Z) - HATs: Hierarchical Adaptive Taxonomy Segmentation for Panoramic Pathology Image Analysis [19.04633470168871]
Panoramic image segmentation in computational pathology presents a remarkable challenge due to the morphologically complex and variably scaled anatomy.
In this paper, we propose a novel Hierarchical Adaptive Taxonomy (HATs) method, which is designed to thoroughly segment panoramic views of kidney structures by leveraging detailed anatomical insights.
Our approach entails (1) the innovative HATs technique which translates spatial relationships among 15 distinct object classes into a versatile "plug-and-play" loss function that spans across regions, functional units, and cells, (2) the incorporation of anatomical hierarchies and scale considerations into a unified simple matrix representation for all panoramic entities, and (3) the
arXiv Detail & Related papers (2024-06-30T05:35:26Z) - Lory: Fully Differentiable Mixture-of-Experts for Autoregressive Language Model Pre-training [73.90260246781435]
We present Lory, the first approach that scales such architectures to autoregressive language model pre-training.
We show significant performance gains over parameter-matched dense models on both perplexity and a variety of downstream tasks.
Despite segment-level routing, Lory models achieve competitive performance compared to state-of-the-art MoE models with token-level routing.
arXiv Detail & Related papers (2024-05-06T03:06:33Z) - Optimizing contrastive learning for cortical folding pattern detection [0.0]
We build a self-supervised deep learning model to detect folding patterns in the cingulate region.
This is the first time that a self-supervised deep learning model has been applied to cortical skeletons on such a large dataset.
arXiv Detail & Related papers (2024-01-31T17:59:57Z) - SLCA: Slow Learner with Classifier Alignment for Continual Learning on a
Pre-trained Model [73.80068155830708]
We present an extensive analysis for continual learning on a pre-trained model (CLPM)
We propose a simple but extremely effective approach named Slow Learner with Alignment (SLCA)
Across a variety of scenarios, our proposal provides substantial improvements for CLPM.
arXiv Detail & Related papers (2023-03-09T08:57:01Z) - Orthogonal Ensemble Networks for Biomedical Image Segmentation [10.011414604407681]
We introduce Orthogonal Ensemble Networks (OEN), a novel framework to explicitly enforce model diversity.
We benchmark the proposed framework in two challenging brain lesion segmentation tasks.
The experimental results show that our approach produces more robust and well-calibrated ensemble models.
arXiv Detail & Related papers (2021-05-22T23:44:55Z) - A Twin Neural Model for Uplift [59.38563723706796]
Uplift is a particular case of conditional treatment effect modeling.
We propose a new loss function defined by leveraging a connection with the Bayesian interpretation of the relative risk.
We show our proposed method is competitive with the state-of-the-art in simulation setting and on real data from large scale randomized experiments.
arXiv Detail & Related papers (2021-05-11T16:02:39Z) - Generalized Organ Segmentation by Imitating One-shot Reasoning using
Anatomical Correlation [55.1248480381153]
We propose OrganNet which learns a generalized organ concept from a set of annotated organ classes and then transfer this concept to unseen classes.
We show that OrganNet can effectively resist the wide variations in organ morphology and produce state-of-the-art results in one-shot segmentation task.
arXiv Detail & Related papers (2021-03-30T13:41:12Z) - Learning Bijective Feature Maps for Linear ICA [73.85904548374575]
We show that existing probabilistic deep generative models (DGMs) which are tailor-made for image data, underperform on non-linear ICA tasks.
To address this, we propose a DGM which combines bijective feature maps with a linear ICA model to learn interpretable latent structures for high-dimensional data.
We create models that converge quickly, are easy to train, and achieve better unsupervised latent factor discovery than flow-based models, linear ICA, and Variational Autoencoders on images.
arXiv Detail & Related papers (2020-02-18T17:58:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.