Dense Dual-Path Network for Real-time Semantic Segmentation
- URL: http://arxiv.org/abs/2010.10778v1
- Date: Wed, 21 Oct 2020 06:11:41 GMT
- Title: Dense Dual-Path Network for Real-time Semantic Segmentation
- Authors: Xinneng Yang, Yan Wu, Junqiao Zhao, Feilin Liu
- Abstract summary: We introduce a novel Dual-Path Network (DDPNet) for real-time semantic segmentation under resource constraints.
DDPNet achieves 75.3% mIoU with 52.6 FPS for an input of 1024 X 2048 resolution on a single GTX 1080Ti card.
- Score: 7.8381744043673045
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Semantic segmentation has achieved remarkable results with high computational
cost and a large number of parameters. However, real-world applications require
efficient inference speed on embedded devices. Most previous works address the
challenge by reducing depth, width and layer capacity of network, which leads
to poor performance. In this paper, we introduce a novel Dense Dual-Path
Network (DDPNet) for real-time semantic segmentation under resource
constraints. We design a light-weight and powerful backbone with dense
connectivity to facilitate feature reuse throughout the whole network and the
proposed Dual-Path module (DPM) to sufficiently aggregate multi-scale contexts.
Meanwhile, a simple and effective framework is built with a skip architecture
utilizing the high-resolution feature maps to refine the segmentation output
and an upsampling module leveraging context information from the feature maps
to refine the heatmaps. The proposed DDPNet shows an obvious advantage in
balancing accuracy and speed. Specifically, on Cityscapes test dataset, DDPNet
achieves 75.3% mIoU with 52.6 FPS for an input of 1024 X 2048 resolution on a
single GTX 1080Ti card. Compared with other state-of-the-art methods, DDPNet
achieves a significant better accuracy with a comparable speed and fewer
parameters.
Related papers
- Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation [15.83905822380148]
RDRNet is a Dual-Resolution Network dedicated to real-time semantic segmentation.
RDRNet employs a two-branch architecture, utilizing multi-path blocks during training and re parameterizing them into single-path blocks during inference.
Experimental results on the Cityscapes, CamVid, and Pascal VOC 2012 datasets demonstrate that RDRNet outperforms existing state-of-the-art models in terms of both performance and speed.
arXiv Detail & Related papers (2024-06-18T10:59:10Z) - DPNet: Dual-Path Network for Real-time Object Detection with Lightweight
Attention [15.360769793764526]
This paper presents a dual-path network, named DPNet, with a lightweight attention scheme for real-time object detection.
DPNet achieves state-of-the-art trade-off between detection accuracy and implementation efficiency.
arXiv Detail & Related papers (2022-09-28T09:11:01Z) - Lightweight and Progressively-Scalable Networks for Semantic
Segmentation [100.63114424262234]
Multi-scale learning frameworks have been regarded as a capable class of models to boost semantic segmentation.
In this paper, we thoroughly analyze the design of convolutional blocks and the ways of interactions across multiple scales.
We devise Lightweight and Progressively-Scalable Networks (LPS-Net) that novelly expands the network complexity in a greedy manner.
arXiv Detail & Related papers (2022-07-27T16:00:28Z) - SFNet: Faster and Accurate Semantic Segmentation via Semantic Flow [88.97790684009979]
A common practice to improve the performance is to attain high-resolution feature maps with strong semantic representation.
We propose a Flow Alignment Module (FAM) to learn textitSemantic Flow between feature maps of adjacent levels.
We also present a novel Gated Dual Flow Alignment Module to directly align high-resolution feature maps and low-resolution feature maps.
arXiv Detail & Related papers (2022-07-10T08:25:47Z) - DPNET: Dual-Path Network for Efficient Object Detectioj with Lightweight
Self-Attention [16.13989397708127]
DPNet is a dual path network for efficient object detection with lightweight self-attention.
It achieves 29.0% AP on COCO dataset, with only 1.14 GFLOPs and 2.27M model size for a 320x320 image.
arXiv Detail & Related papers (2021-10-31T13:38:16Z) - DS-Net++: Dynamic Weight Slicing for Efficient Inference in CNNs and
Transformers [105.74546828182834]
We show a hardware-efficient dynamic inference regime, named dynamic weight slicing, which adaptively slice a part of network parameters for inputs with diverse difficulty levels.
We present dynamic slimmable network (DS-Net) and dynamic slice-able network (DS-Net++) by input-dependently adjusting filter numbers of CNNs and multiple dimensions in both CNNs and transformers.
arXiv Detail & Related papers (2021-09-21T09:57:21Z) - Rethinking BiSeNet For Real-time Semantic Segmentation [6.622485130017622]
BiSeNet has been proved to be a popular two-stream network for real-time segmentation.
We propose a novel structure named Short-Term Dense Concatenate network (STDC) by removing structure redundancy.
arXiv Detail & Related papers (2021-04-27T13:49:47Z) - Binary DAD-Net: Binarized Driveable Area Detection Network for
Autonomous Driving [94.40107679615618]
This paper proposes a novel binarized driveable area detection network (binary DAD-Net)
It uses only binary weights and activations in the encoder, the bottleneck, and the decoder part.
It outperforms state-of-the-art semantic segmentation networks on public datasets.
arXiv Detail & Related papers (2020-06-15T07:09:01Z) - BiSeNet V2: Bilateral Network with Guided Aggregation for Real-time
Semantic Segmentation [118.46210049742993]
We propose an efficient and effective architecture with a good trade-off between speed and accuracy, termed Bilateral spatial Network (BiSeNet V2)
For a 2,048x1, input, we achieve 72.6% Mean IoU on the Cityscapes test set with a speed of 156 FPS on one NVIDIA GeForce 1080 Ti card, which is significantly faster than existing methods, yet we achieve better segmentation accuracy.
arXiv Detail & Related papers (2020-04-05T10:26:38Z) - Real-Time High-Performance Semantic Image Segmentation of Urban Street
Scenes [98.65457534223539]
We propose a real-time high-performance DCNN-based method for robust semantic segmentation of urban street scenes.
The proposed method achieves the accuracy of 73.6% and 68.0% mean Intersection over Union (mIoU) with the inference speed of 51.0 fps and 39.3 fps.
arXiv Detail & Related papers (2020-03-11T08:45:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.