An Empirical Study of Attention Networks for Semantic Segmentation
- URL: http://arxiv.org/abs/2309.10217v1
- Date: Tue, 19 Sep 2023 00:07:57 GMT
- Title: An Empirical Study of Attention Networks for Semantic Segmentation
- Authors: Hao Guo, Hongbiao Si, Guilin Jiang, Wei Zhang, Zhiyan Liu, Xuanyi Zhu,
Xulong Zhang, Yang Liu
- Abstract summary: Recently, the decoders based on attention achieve state-of-the-art (SOTA) performance on various datasets.
This paper first conducts experiments to analyze their computation complexity and compare their performance.
- Score: 11.000308726481236
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Semantic segmentation is a vital problem in computer vision. Recently, a
common solution to semantic segmentation is the end-to-end convolution neural
network, which is much more accurate than traditional methods.Recently, the
decoders based on attention achieve state-of-the-art (SOTA) performance on
various datasets. But these networks always are compared with the mIoU of
previous SOTA networks to prove their superiority and ignore their
characteristics without considering the computation complexity and precision in
various categories, which is essential for engineering applications. Besides,
the methods to analyze the FLOPs and memory are not consistent between
different networks, which makes the comparison hard to be utilized. What's
more, various methods utilize attention in semantic segmentation, but the
conclusion of these methods is lacking. This paper first conducts experiments
to analyze their computation complexity and compare their performance. Then it
summarizes suitable scenes for these networks and concludes key points that
should be concerned when constructing an attention network. Last it points out
some future directions of the attention network.
Related papers
- The Multiple Subnetwork Hypothesis: Enabling Multidomain Learning by
Isolating Task-Specific Subnetworks in Feedforward Neural Networks [0.0]
We identify a methodology and network representational structure which allows a pruned network to employ previously unused weights to learn subsequent tasks.
We show that networks trained using our approaches are able to learn multiple tasks, which may be related or unrelated, in parallel or in sequence without sacrificing performance on any task or exhibiting catastrophic forgetting.
arXiv Detail & Related papers (2022-07-18T15:07:13Z) - Self-Ensembling GAN for Cross-Domain Semantic Segmentation [107.27377745720243]
This paper proposes a self-ensembling generative adversarial network (SE-GAN) exploiting cross-domain data for semantic segmentation.
In SE-GAN, a teacher network and a student network constitute a self-ensembling model for generating semantic segmentation maps, which together with a discriminator, forms a GAN.
Despite its simplicity, we find SE-GAN can significantly boost the performance of adversarial training and enhance the stability of the model.
arXiv Detail & Related papers (2021-12-15T09:50:25Z) - M-FasterSeg: An Efficient Semantic Segmentation Network Based on Neural
Architecture Search [0.0]
This paper proposes an improved structure of a semantic segmentation network based on a deep learning network.
First, a neural network search method NAS (Neural Architecture Search) is used to find a semantic segmentation network with multiple resolution branches.
In the search process, combine the self-attention network structure module to adjust the searched neural network structure, and then combine the semantic segmentation network searched by different branches to form a fast semantic segmentation network structure.
arXiv Detail & Related papers (2021-12-15T06:46:55Z) - CI-Net: Contextual Information for Joint Semantic Segmentation and Depth
Estimation [2.8785764686013837]
We propose a network injected with contextual information (CI-Net) to solve the problem.
With supervision from semantic labels, the network is embedded with contextual information so that it could understand the scene better.
We evaluate the proposed CI-Net on the NYU-Depth-v2 and SUN-RGBD datasets.
arXiv Detail & Related papers (2021-07-29T07:58:25Z) - Variational Structured Attention Networks for Deep Visual Representation
Learning [49.80498066480928]
We propose a unified deep framework to jointly learn both spatial attention maps and channel attention in a principled manner.
Specifically, we integrate the estimation and the interaction of the attentions within a probabilistic representation learning framework.
We implement the inference rules within the neural network, thus allowing for end-to-end learning of the probabilistic and the CNN front-end parameters.
arXiv Detail & Related papers (2021-03-05T07:37:24Z) - SOSD-Net: Joint Semantic Object Segmentation and Depth Estimation from
Monocular images [94.36401543589523]
We introduce the concept of semantic objectness to exploit the geometric relationship of these two tasks.
We then propose a Semantic Object and Depth Estimation Network (SOSD-Net) based on the objectness assumption.
To the best of our knowledge, SOSD-Net is the first network that exploits the geometry constraint for simultaneous monocular depth estimation and semantic segmentation.
arXiv Detail & Related papers (2021-01-19T02:41:03Z) - Training and Inference for Integer-Based Semantic Segmentation Network [18.457074855823315]
We propose a new quantization framework for training and inference of semantic segmentation networks.
Our framework is evaluated on mainstream semantic segmentation networks like FCN-VGG16 and DeepLabv3-ResNet50.
arXiv Detail & Related papers (2020-11-30T02:07:07Z) - Learning Connectivity of Neural Networks from a Topological Perspective [80.35103711638548]
We propose a topological perspective to represent a network into a complete graph for analysis.
By assigning learnable parameters to the edges which reflect the magnitude of connections, the learning process can be performed in a differentiable manner.
This learning process is compatible with existing networks and owns adaptability to larger search spaces and different tasks.
arXiv Detail & Related papers (2020-08-19T04:53:31Z) - Boosting Connectivity in Retinal Vessel Segmentation via a Recursive
Semantics-Guided Network [23.936946593048987]
A U-shape network is enhanced by introducing a semantics-guided module, which integrates the enriched semantics information to shallow layers for guiding the network to explore more powerful features.
The carefully designed semantics-guided network has been extensively evaluated on several public datasets.
arXiv Detail & Related papers (2020-04-24T09:18:04Z) - Binary Neural Networks: A Survey [126.67799882857656]
The binary neural network serves as a promising technique for deploying deep models on resource-limited devices.
The binarization inevitably causes severe information loss, and even worse, its discontinuity brings difficulty to the optimization of the deep network.
We present a survey of these algorithms, mainly categorized into the native solutions directly conducting binarization, and the optimized ones using techniques like minimizing the quantization error, improving the network loss function, and reducing the gradient error.
arXiv Detail & Related papers (2020-03-31T16:47:20Z) - Weakly-Supervised Semantic Segmentation by Iterative Affinity Learning [86.45526827323954]
Weakly-supervised semantic segmentation is a challenging task as no pixel-wise label information is provided for training.
We propose an iterative algorithm to learn such pairwise relations.
We show that the proposed algorithm performs favorably against the state-of-the-art methods.
arXiv Detail & Related papers (2020-02-19T10:32:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.