3D Mitochondria Instance Segmentation with Spatio-Temporal Transformers
- URL: http://arxiv.org/abs/2303.12073v1
- Date: Tue, 21 Mar 2023 17:58:49 GMT
- Title: 3D Mitochondria Instance Segmentation with Spatio-Temporal Transformers
- Authors: Omkar Thawakar, Rao Muhammad Anwer, Jorma Laaksonen, Orly Reiner,
Mubarak Shah, Fahad Shahbaz Khan
- Abstract summary: We propose a hybrid encoder-decoder framework that efficiently computes spatial and temporal attentions in parallel.
We also introduce a semantic clutter-background adversarial loss during training that aids in the region of mitochondria instances from the background.
- Score: 101.44668514239959
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Accurate 3D mitochondria instance segmentation in electron microscopy (EM) is
a challenging problem and serves as a prerequisite to empirically analyze their
distributions and morphology. Most existing approaches employ 3D convolutions
to obtain representative features. However, these convolution-based approaches
struggle to effectively capture long-range dependencies in the volume
mitochondria data, due to their limited local receptive field. To address this,
we propose a hybrid encoder-decoder framework based on a split spatio-temporal
attention module that efficiently computes spatial and temporal self-attentions
in parallel, which are later fused through a deformable convolution. Further,
we introduce a semantic foreground-background adversarial loss during training
that aids in delineating the region of mitochondria instances from the
background clutter. Our extensive experiments on three benchmarks, Lucchi,
MitoEM-R and MitoEM-H, reveal the benefits of the proposed contributions
achieving state-of-the-art results on all three datasets. Our code and models
are available at https://github.com/OmkarThawakar/STT-UNET.
Related papers
- EM-GANSim: Real-time and Accurate EM Simulation Using Conditional GANs for 3D Indoor Scenes [55.2480439325792]
We present a novel machine-learning (ML) approach (EM-GANSim) for real-time electromagnetic (EM) propagation.
In practice, it can compute the signal strength in a few milliseconds on any location in 3D indoor environments.
arXiv Detail & Related papers (2024-05-27T17:19:02Z) - Leveraging Frequency Domain Learning in 3D Vessel Segmentation [50.54833091336862]
In this study, we leverage Fourier domain learning as a substitute for multi-scale convolutional kernels in 3D hierarchical segmentation models.
We show that our novel network achieves remarkable dice performance (84.37% on ASACA500 and 80.32% on ImageCAS) in tubular vessel segmentation tasks.
arXiv Detail & Related papers (2024-01-11T19:07:58Z) - E2ENet: Dynamic Sparse Feature Fusion for Accurate and Efficient 3D
Medical Image Segmentation [36.367368163120794]
We propose a 3D medical image segmentation model, named Efficient to Efficient Network (E2ENet)
It incorporates two parametrically and computationally efficient designs.
It consistently achieves a superior trade-off between accuracy and efficiency across various resource constraints.
arXiv Detail & Related papers (2023-12-07T22:13:37Z) - UNETR++: Delving into Efficient and Accurate 3D Medical Image Segmentation [93.88170217725805]
We propose a 3D medical image segmentation approach, named UNETR++, that offers both high-quality segmentation masks as well as efficiency in terms of parameters, compute cost, and inference speed.
The core of our design is the introduction of a novel efficient paired attention (EPA) block that efficiently learns spatial and channel-wise discriminative features.
Our evaluations on five benchmarks, Synapse, BTCV, ACDC, BRaTs, and Decathlon-Lung, reveal the effectiveness of our contributions in terms of both efficiency and accuracy.
arXiv Detail & Related papers (2022-12-08T18:59:57Z) - Semi-Supervised Segmentation of Mitochondria from Electron Microscopy
Images Using Spatial Continuity [3.631638087834872]
We propose a semi-supervised deep learning model that segments mitochondria by leveraging the spatial continuity of their structural, morphological, and contextual information.
Our model achieves performance similar to that of state-of-the-art fully supervised models but requires only 20% of their annotated training data.
arXiv Detail & Related papers (2022-06-06T06:52:19Z) - MISSU: 3D Medical Image Segmentation via Self-distilling TransUNet [55.16833099336073]
We propose to self-distill a Transformer-based UNet for medical image segmentation.
It simultaneously learns global semantic information and local spatial-detailed features.
Our MISSU achieves the best performance over previous state-of-the-art methods.
arXiv Detail & Related papers (2022-06-02T07:38:53Z) - A persistent homology-based topological loss for CNN-based multi-class
segmentation of CMR [5.898114915426535]
Multi-class segmentation of cardiac magnetic resonance (CMR) images seeks a separation of data into anatomical components with known structure and configuration.
Most popular CNN-based methods are optimised using pixel wise loss functions, ignorant of the spatially extended features that characterise anatomy.
We extend these approaches to the task of multi-class segmentation by building an enriched topological description of all class labels and class label pairs.
arXiv Detail & Related papers (2021-07-27T09:21:38Z) - Advanced Deep Networks for 3D Mitochondria Instance Segmentation [46.295601731565725]
We propose two advanced deep networks, named ResUNet-R and ResUNet-H, for 3D mitochondria instance segmentation from Rat and Human samples.
Specifically, we design a simple yet effective anisotropic convolution block and deploy a multi-scale training strategy, which together boost the segmentation performance.
arXiv Detail & Related papers (2021-04-16T08:27:44Z) - HIVE-Net: Centerline-Aware HIerarchical View-Ensemble Convolutional
Network for Mitochondria Segmentation in EM Images [3.1498833540989413]
We introduce a novel hierarchical view-ensemble convolution (HVEC) to learn 3D spatial contexts using more efficient 2D convolutions.
The proposed method performs favorably against the state-of-the-art methods in accuracy and visual quality but with a greatly reduced model size.
arXiv Detail & Related papers (2021-01-08T06:56:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.