Selective Convolutional Network: An Efficient Object Detector with
Ignoring Background
- URL: http://arxiv.org/abs/2002.01205v1
- Date: Tue, 4 Feb 2020 10:07:01 GMT
- Title: Selective Convolutional Network: An Efficient Object Detector with
Ignoring Background
- Authors: Hefei Ling, Yangyang Qin, Li Zhang, Yuxuan Shi, Ping Li
- Abstract summary: We introduce an efficient object detector called Selective Convolutional Network (SCN), which selectively calculates only on the locations that contain meaningful and conducive information.
To solve it, we design an elaborate structure with negligible overheads to guide the network where to look next.
- Score: 28.591619763438054
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: It is well known that attention mechanisms can effectively improve the
performance of many CNNs including object detectors. Instead of refining
feature maps prevalently, we reduce the prohibitive computational complexity by
a novel attempt at attention. Therefore, we introduce an efficient object
detector called Selective Convolutional Network (SCN), which selectively
calculates only on the locations that contain meaningful and conducive
information. The basic idea is to exclude the insignificant background areas,
which effectively reduces the computational cost especially during the feature
extraction. To solve it, we design an elaborate structure with negligible
overheads to guide the network where to look next. It's end-to-end trainable
and easy-embedding. Without additional segmentation datasets, we explores two
different train strategies including direct supervision and indirect
supervision. Extensive experiments assess the performance on PASCAL VOC2007 and
MS COCO detection datasets. Results show that SSD and Pelee integrated with our
method averagely reduce the calculations in a range of 1/5 and 1/3 with slight
loss of accuracy, demonstrating the feasibility of SCN.
Related papers
- Renormalized Connection for Scale-preferred Object Detection in Satellite Imagery [51.83786195178233]
We design a Knowledge Discovery Network (KDN) to implement the renormalization group theory in terms of efficient feature extraction.
Renormalized connection (RC) on the KDN enables synergistic focusing'' of multi-scale features.
RCs extend the multi-level feature's divide-and-conquer'' mechanism of the FPN-based detectors to a wide range of scale-preferred tasks.
arXiv Detail & Related papers (2024-09-09T13:56:22Z) - ESOD: Efficient Small Object Detection on High-Resolution Images [36.80623357577051]
Small objects are usually sparsely distributed and locally clustered.
Massive feature extraction computations are wasted on the non-target background area of images.
We propose to reuse the detector's backbone to conduct feature-level object-seeking and patch-slicing.
arXiv Detail & Related papers (2024-07-23T12:21:23Z) - Efficient Visual Fault Detection for Freight Train via Neural Architecture Search with Data Volume Robustness [6.5769476554745925]
We propose an efficient NAS-based framework for visual fault detection of freight trains.
First, we design a scale-aware search space for discovering an effective receptive field in the head.
Second, we explore the robustness of data volume to reduce search costs based on the specifically designed search space.
arXiv Detail & Related papers (2024-05-27T09:47:49Z) - ELGC-Net: Efficient Local-Global Context Aggregation for Remote Sensing Change Detection [65.59969454655996]
We propose an efficient change detection framework, ELGC-Net, which leverages rich contextual information to precisely estimate change regions.
Our proposed ELGC-Net sets a new state-of-the-art performance in remote sensing change detection benchmarks.
We also introduce ELGC-Net-LW, a lighter variant with significantly reduced computational complexity, suitable for resource-constrained settings.
arXiv Detail & Related papers (2024-03-26T17:46:25Z) - ELA: Efficient Local Attention for Deep Convolutional Neural Networks [15.976475674061287]
This paper introduces an Efficient Local Attention (ELA) method that achieves substantial performance improvements with a simple structure.
To overcome these challenges, we propose the incorporation of 1D convolution and Group Normalization feature enhancement techniques.
ELA can be seamlessly integrated into deep CNN networks such as ResNet, MobileNet, and DeepLab.
arXiv Detail & Related papers (2024-03-02T08:06:18Z) - LO-Det: Lightweight Oriented Object Detection in Remote Sensing Images [11.41884406231953]
In this paper, we propose an effective lightweight oriented object detector (LO-Det)
Specifically, a channel separation-aggregation (CSA) structure is designed to simplify the complexity of stacked separable convolutions.
The proposed LO-Det can run very fast even on embedded devices with the competitive accuracy of detecting oriented objects.
arXiv Detail & Related papers (2022-09-16T04:28:01Z) - CloudAttention: Efficient Multi-Scale Attention Scheme For 3D Point
Cloud Learning [81.85951026033787]
We set transformers in this work and incorporate them into a hierarchical framework for shape classification and part and scene segmentation.
We also compute efficient and dynamic global cross attentions by leveraging sampling and grouping at each iteration.
The proposed hierarchical model achieves state-of-the-art shape classification in mean accuracy and yields results on par with the previous segmentation methods.
arXiv Detail & Related papers (2022-07-31T21:39:15Z) - SAR Despeckling Using Overcomplete Convolutional Networks [53.99620005035804]
despeckling is an important problem in remote sensing as speckle degrades SAR images.
Recent studies show that convolutional neural networks(CNNs) outperform classical despeckling methods.
This study employs an overcomplete CNN architecture to focus on learning low-level features by restricting the receptive field.
We show that the proposed network improves despeckling performance compared to recent despeckling methods on synthetic and real SAR images.
arXiv Detail & Related papers (2022-05-31T15:55:37Z) - Resolution Adaptive Networks for Efficient Inference [53.04907454606711]
We propose a novel Resolution Adaptive Network (RANet), which is inspired by the intuition that low-resolution representations are sufficient for classifying "easy" inputs.
In RANet, the input images are first routed to a lightweight sub-network that efficiently extracts low-resolution representations.
High-resolution paths in the network maintain the capability to recognize the "hard" samples.
arXiv Detail & Related papers (2020-03-16T16:54:36Z) - BiDet: An Efficient Binarized Object Detector [96.19708396510894]
We propose a binarized neural network learning method called BiDet for efficient object detection.
Our BiDet fully utilizes the representational capacity of the binary neural networks for object detection by redundancy removal.
Our method outperforms the state-of-the-art binary neural networks by a sizable margin.
arXiv Detail & Related papers (2020-03-09T08:16:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.