Leveraging Image Complexity in Macro-Level Neural Network Design for
Medical Image Segmentation
- URL: http://arxiv.org/abs/2112.11065v1
- Date: Tue, 21 Dec 2021 09:49:47 GMT
- Title: Leveraging Image Complexity in Macro-Level Neural Network Design for
Medical Image Segmentation
- Authors: Tariq M. Khan, Syed S. Naqvi, Erik Meijering
- Abstract summary: We show that image complexity can be used as a guideline in choosing what is best for a given dataset.
For high-complexity datasets, a shallow network running on the original images may yield better segmentation results than a deep network running on downsampled images.
- Score: 3.974175960216864
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Recent progress in encoder-decoder neural network architecture design has led
to significant performance improvements in a wide range of medical image
segmentation tasks. However, state-of-the-art networks for a given task may be
too computationally demanding to run on affordable hardware, and thus users
often resort to practical workarounds by modifying various macro-level design
aspects. Two common examples are downsampling of the input images and reducing
the network depth to meet computer memory constraints. In this paper we
investigate the effects of these changes on segmentation performance and show
that image complexity can be used as a guideline in choosing what is best for a
given dataset. We consider four statistical measures to quantify image
complexity and evaluate their suitability on ten different public datasets. For
the purpose of our experiments we also propose two new encoder-decoder
architectures representing shallow and deep networks that are more memory
efficient than currently popular networks. Our results suggest that median
frequency is the best complexity measure in deciding about an acceptable input
downsampling factor and network depth. For high-complexity datasets, a shallow
network running on the original images may yield better segmentation results
than a deep network running on downsampled images, whereas the opposite may be
the case for low-complexity images.
Related papers
- Parameter-Inverted Image Pyramid Networks [49.35689698870247]
We propose a novel network architecture known as the Inverted Image Pyramid Networks (PIIP)
Our core idea is to use models with different parameter sizes to process different resolution levels of the image pyramid.
PIIP achieves superior performance in tasks such as object detection, segmentation, and image classification.
arXiv Detail & Related papers (2024-06-06T17:59:10Z) - HistoSeg : Quick attention with multi-loss function for multi-structure
segmentation in digital histology images [0.696194614504832]
Medical image segmentation assists in computer-aided diagnosis, surgeries, and treatment.
We proposed an generalization-Decoder Network, Quick Attention Module and a Multi Loss Function.
We evaluate the capability of our proposed network on two publicly available datasets for medical image segmentation MoNuSeg and GlaS.
arXiv Detail & Related papers (2022-09-01T21:10:00Z) - Learning Enriched Features for Fast Image Restoration and Enhancement [166.17296369600774]
This paper presents a holistic goal of maintaining spatially-precise high-resolution representations through the entire network.
We learn an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details.
Our approach achieves state-of-the-art results for a variety of image processing tasks, including defocus deblurring, image denoising, super-resolution, and image enhancement.
arXiv Detail & Related papers (2022-04-19T17:59:45Z) - Restormer: Efficient Transformer for High-Resolution Image Restoration [118.9617735769827]
convolutional neural networks (CNNs) perform well at learning generalizable image priors from large-scale data.
Transformers have shown significant performance gains on natural language and high-level vision tasks.
Our model, named Restoration Transformer (Restormer), achieves state-of-the-art results on several image restoration tasks.
arXiv Detail & Related papers (2021-11-18T18:59:10Z) - SDWNet: A Straight Dilated Network with Wavelet Transformation for Image
Deblurring [23.86692375792203]
Image deblurring is a computer vision problem that aims to recover a sharp image from a blurred image.
Our model uses dilated convolution to enable the obtainment of the large receptive field with high spatial resolution.
We propose a novel module using the wavelet transform, which effectively helps the network to recover clear high-frequency texture details.
arXiv Detail & Related papers (2021-10-12T07:58:10Z) - Image Complexity Guided Network Compression for Biomedical Image
Segmentation [5.926887379656135]
We propose an image complexity-guided network compression technique for biomedical image segmentation.
We map the dataset complexity to the target network accuracy degradation caused by compression.
The mapping is used to determine the convolutional layer-wise multiplicative factor for generating a compressed network.
arXiv Detail & Related papers (2021-07-06T22:28:10Z) - Scalable Visual Transformers with Hierarchical Pooling [61.05787583247392]
We propose a Hierarchical Visual Transformer (HVT) which progressively pools visual tokens to shrink the sequence length.
It brings a great benefit by scaling dimensions of depth/width/resolution/patch size without introducing extra computational complexity.
Our HVT outperforms the competitive baselines on ImageNet and CIFAR-100 datasets.
arXiv Detail & Related papers (2021-03-19T03:55:58Z) - Learning Frequency-aware Dynamic Network for Efficient Super-Resolution [56.98668484450857]
This paper explores a novel frequency-aware dynamic network for dividing the input into multiple parts according to its coefficients in the discrete cosine transform (DCT) domain.
In practice, the high-frequency part will be processed using expensive operations and the lower-frequency part is assigned with cheap operations to relieve the computation burden.
Experiments conducted on benchmark SISR models and datasets show that the frequency-aware dynamic network can be employed for various SISR neural architectures.
arXiv Detail & Related papers (2021-03-15T12:54:26Z) - LSHR-Net: a hardware-friendly solution for high-resolution computational
imaging using a mixed-weights neural network [5.475867050068397]
We propose a novel hardware-friendly solution based on mixed-weights neural networks for computational imaging.
In particular, learned binary-weight sensing patterns are tailored to the sampling device.
Our method has been validated on benchmark datasets and achieved the state of the art reconstruction accuracy.
arXiv Detail & Related papers (2020-04-27T20:59:51Z) - Learning Enriched Features for Real Image Restoration and Enhancement [166.17296369600774]
convolutional neural networks (CNNs) have achieved dramatic improvements over conventional approaches for image restoration task.
We present a novel architecture with the collective goals of maintaining spatially-precise high-resolution representations through the entire network.
Our approach learns an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details.
arXiv Detail & Related papers (2020-03-15T11:04:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.