OSS-Net: Memory Efficient High Resolution Semantic Segmentation of 3D
Medical Data
- URL: http://arxiv.org/abs/2110.10640v1
- Date: Wed, 20 Oct 2021 16:14:26 GMT
- Title: OSS-Net: Memory Efficient High Resolution Semantic Segmentation of 3D
Medical Data
- Authors: Christoph Reich, Tim Prangemeier, \"Ozdemir Cetin, Heinz Koeppl
- Abstract summary: Convolutional neural networks (CNNs) are the current state-of-the-art meta-algorithm for volumetric segmentation of medical data.
A key limitation of 3D CNNs on voxelised data is that the memory consumption grows cubically with the training data resolution.
We propose Occupancy Networks (OSS-Nets) to accurately and memory-efficiently segment 3D medical data.
- Score: 21.42609249273068
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Convolutional neural networks (CNNs) are the current state-of-the-art
meta-algorithm for volumetric segmentation of medical data, for example, to
localize COVID-19 infected tissue on computer tomography scans or the detection
of tumour volumes in magnetic resonance imaging. A key limitation of 3D CNNs on
voxelised data is that the memory consumption grows cubically with the training
data resolution. Occupancy networks (O-Nets) are an alternative for which the
data is represented continuously in a function space and 3D shapes are learned
as a continuous decision boundary. While O-Nets are significantly more memory
efficient than 3D CNNs, they are limited to simple shapes, are relatively slow
at inference, and have not yet been adapted for 3D semantic segmentation of
medical data. Here, we propose Occupancy Networks for Semantic Segmentation
(OSS-Nets) to accurately and memory-efficiently segment 3D medical data. We
build upon the original O-Net with modifications for increased expressiveness
leading to improved segmentation performance comparable to 3D CNNs, as well as
modifications for faster inference. We leverage local observations to represent
complex shapes and prior encoder predictions to expedite inference. We showcase
OSS-Net's performance on 3D brain tumour and liver segmentation against a
function space baseline (O-Net), a performance baseline (3D residual U-Net),
and an efficiency baseline (2D residual U-Net). OSS-Net yields segmentation
results similar to the performance baseline and superior to the function space
and efficiency baselines. In terms of memory efficiency, OSS-Net consumes
comparable amounts of memory as the function space baseline, somewhat more
memory than the efficiency baseline and significantly less than the performance
baseline. As such, OSS-Net enables memory-efficient and accurate 3D semantic
segmentation that can scale to high resolutions.
Related papers
- UNETR++: Delving into Efficient and Accurate 3D Medical Image Segmentation [93.88170217725805]
We propose a 3D medical image segmentation approach, named UNETR++, that offers both high-quality segmentation masks as well as efficiency in terms of parameters, compute cost, and inference speed.
The core of our design is the introduction of a novel efficient paired attention (EPA) block that efficiently learns spatial and channel-wise discriminative features.
Our evaluations on five benchmarks, Synapse, BTCV, ACDC, BRaTs, and Decathlon-Lung, reveal the effectiveness of our contributions in terms of both efficiency and accuracy.
arXiv Detail & Related papers (2022-12-08T18:59:57Z) - GLEAM: Greedy Learning for Large-Scale Accelerated MRI Reconstruction [50.248694764703714]
Unrolled neural networks have recently achieved state-of-the-art accelerated MRI reconstruction.
These networks unroll iterative optimization algorithms by alternating between physics-based consistency and neural-network based regularization.
We propose Greedy LEarning for Accelerated MRI reconstruction, an efficient training strategy for high-dimensional imaging settings.
arXiv Detail & Related papers (2022-07-18T06:01:29Z) - Memory-efficient Segmentation of High-resolution Volumetric MicroCT
Images [11.723370840090453]
We propose a memory-efficient network architecture for 3D high-resolution image segmentation.
The network incorporates both global and local features via a two-stage U-net-based cascaded framework.
Experiments show that it outperforms state-of-the-art 3D segmentation methods in terms of both segmentation accuracy and memory efficiency.
arXiv Detail & Related papers (2022-05-31T16:42:48Z) - MDA-Net: Multi-Dimensional Attention-Based Neural Network for 3D Image
Segmentation [4.221871357181261]
We propose a multi-dimensional attention network (MDA-Net) to efficiently integrate slice-wise, spatial, and channel-wise attention into a U-Net based network.
We evaluate our model on the MICCAI iSeg and IBSR datasets, and the experimental results demonstrate consistent improvements over existing methods.
arXiv Detail & Related papers (2021-05-10T16:58:34Z) - CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image
Segmentation [95.51455777713092]
Convolutional neural networks (CNNs) have been the de facto standard for nowadays 3D medical image segmentation.
We propose a novel framework that efficiently bridges a bf Convolutional neural network and a bf Transformer bf (CoTr) for accurate 3D medical image segmentation.
arXiv Detail & Related papers (2021-03-04T13:34:22Z) - Binary Graph Neural Networks [69.51765073772226]
Graph Neural Networks (GNNs) have emerged as a powerful and flexible framework for representation learning on irregular data.
In this paper, we present and evaluate different strategies for the binarization of graph neural networks.
We show that through careful design of the models, and control of the training process, binary graph neural networks can be trained at only a moderate cost in accuracy on challenging benchmarks.
arXiv Detail & Related papers (2020-12-31T18:48:58Z) - Efficient embedding network for 3D brain tumor segmentation [0.33727511459109777]
In this paper, we investigate a way to transfer the performance of a two-dimensional classiffication network for the purpose of three-dimensional semantic segmentation of brain tumors.
As the input data is in 3D, the first layers of the encoder are devoted to the reduction of the third dimension in order to fit the input of the EfficientNet network.
Experimental results on validation and test data from the BraTS 2020 challenge demonstrate that the proposed method achieve promising performance.
arXiv Detail & Related papers (2020-11-22T16:17:29Z) - DoDNet: Learning to segment multi-organ and tumors from multiple
partially labeled datasets [102.55303521877933]
We propose a dynamic on-demand network (DoDNet) that learns to segment multiple organs and tumors on partially labelled datasets.
DoDNet consists of a shared encoder-decoder architecture, a task encoding module, a controller for generating dynamic convolution filters, and a single but dynamic segmentation head.
arXiv Detail & Related papers (2020-11-20T04:56:39Z) - EDNet: Efficient Disparity Estimation with Cost Volume Combination and
Attention-based Spatial Residual [17.638034176859932]
Existing disparity estimation works mostly leverage the 4D concatenation volume and construct a very deep 3D convolution neural network (CNN) for disparity regression.
In this paper, we propose a network named EDNet for efficient disparity estimation.
Experiments on the Scene Flow and KITTI datasets show that EDNet outperforms the previous 3D CNN based works.
arXiv Detail & Related papers (2020-10-26T04:49:44Z) - Local Grid Rendering Networks for 3D Object Detection in Point Clouds [98.02655863113154]
CNNs are powerful but it would be computationally costly to directly apply convolutions on point data after voxelizing the entire point clouds to a dense regular 3D grid.
We propose a novel and principled Local Grid Rendering (LGR) operation to render the small neighborhood of a subset of input points into a low-resolution 3D grid independently.
We validate LGR-Net for 3D object detection on the challenging ScanNet and SUN RGB-D datasets.
arXiv Detail & Related papers (2020-07-04T13:57:43Z) - DDU-Nets: Distributed Dense Model for 3D MRI Brain Tumor Segmentation [27.547646527286886]
Three patterns of distributed dense connections (DDCs) are proposed to enhance feature reuse and propagation of CNNs.
For better detecting and segmenting brain tumors from 3D MR images, CNN-based models embedded with DDCs (DDU-Nets) are trained efficiently from pixel to pixel.
The proposed method is evaluated on the BraTS 2019 dataset with results demonstrating the effectiveness of the DDU-Nets.
arXiv Detail & Related papers (2020-03-03T05:08:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.