Hybrid Channel Based Pedestrian Detection
- URL: http://arxiv.org/abs/1912.12431v2
- Date: Thu, 30 Jan 2020 04:20:34 GMT
- Title: Hybrid Channel Based Pedestrian Detection
- Authors: Fiseha B. Tesema, Hong Wu, Mingjian Chen, Junpeng Lin, William Zhu,
Kaizhu Huang
- Abstract summary: We propose a new pedestrian detection framework, which extends the successful RPN+BF framework to combine handcrafted features and CNN features.
Our experiments show that the developed handcrafted features can reach better detection accuracy than the CNN features extracted from the VGG-16 net.
- Score: 15.696919306737321
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Pedestrian detection has achieved great improvements with the help of
Convolutional Neural Networks (CNNs). CNN can learn high-level features from
input images, but the insufficient spatial resolution of CNN feature channels
(feature maps) may cause a loss of information, which is harmful especially to
small instances. In this paper, we propose a new pedestrian detection
framework, which extends the successful RPN+BF framework to combine handcrafted
features and CNN features. RoI-pooling is used to extract features from both
handcrafted channels (e.g. HOG+LUV, CheckerBoards or RotatedFilters) and CNN
channels. Since handcrafted channels always have higher spatial resolution than
CNN channels, we apply RoI-pooling with larger output resolution to handcrafted
channels to keep more detailed information. Our ablation experiments show that
the developed handcrafted features can reach better detection accuracy than the
CNN features extracted from the VGG-16 net, and a performance gain can be
achieved by combining them. Experimental results on Caltech pedestrian dataset
with the original annotations and the improved annotations demonstrate the
effectiveness of the proposed approach. When using a more advanced RPN in our
framework, our approach can be further improved and get competitive results on
both benchmarks.
Related papers
- Distilling Channels for Efficient Deep Tracking [68.13422829310835]
This paper presents a novel framework termed channel distillation to facilitate deep trackers.
We show that an integrated formulation can turn feature compression, response map generation, and model update into a unified energy minimization problem.
The resulting deep tracker is accurate, fast, and has low memory requirements.
arXiv Detail & Related papers (2024-09-18T08:09:20Z) - Research on fusing topological data analysis with convolutional neural network [1.0222202698528364]
This paper proposes a feature fusion method based on Topological Data Analysis (TDA) and CNN, named TDA-CNN.
This method combines numerical distribution features captured by CNN with topological structure features captured by TDA to improve the feature learning and representation ability of CNN.
Experimental validation on datasets such as Intel Image, Gender Images, and Chinese calligraphy Styles by Calligraphers demonstrates that TDA-CNN improves the performance of VGG16, DenseNet121, and GoogleNet networks by 17.5%, 7.11%, and 4.45%, respectively.
arXiv Detail & Related papers (2024-06-19T07:43:53Z) - Kronecker Product Feature Fusion for Convolutional Neural Network in
Remote Sensing Scene Classification [0.0]
CNN can extract hierarchical convolutional features from remote sensing imagery.
Two successful Feature Fusion methods, Add and Concat, are employed in certain state-of-the-art CNN algorithms.
We propose a novel Feature Fusion algorithm, which unifies the aforementioned methods using the Kronecker Product (KPFF)
arXiv Detail & Related papers (2024-01-08T19:01:01Z) - Testing the Channels of Convolutional Neural Networks [8.927538538637783]
We propose techniques for testing the channels of convolutional neural networks (CNNs)
We design FtGAN, an extension to GAN, that can generate test data with varying the intensity of a channel of a target CNN.
We also proposed a channel selection algorithm to find representative channels for testing.
arXiv Detail & Related papers (2023-03-06T09:58:39Z) - An Efficient Evolutionary Deep Learning Framework Based on Multi-source
Transfer Learning to Evolve Deep Convolutional Neural Networks [8.40112153818812]
Convolutional neural networks (CNNs) have constantly achieved better performance over years by introducing more complex topology, and enlarging the capacity towards deeper and wider CNNs.
The computational cost is still the bottleneck of automatically designing CNNs.
In this paper, inspired by transfer learning, a new evolutionary computation based framework is proposed to efficiently evolve CNNs.
arXiv Detail & Related papers (2022-12-07T20:22:58Z) - Attention-based Feature Compression for CNN Inference Offloading in Edge
Computing [93.67044879636093]
This paper studies the computational offloading of CNN inference in device-edge co-inference systems.
We propose a novel autoencoder-based CNN architecture (AECNN) for effective feature extraction at end-device.
Experiments show that AECNN can compress the intermediate data by more than 256x with only about 4% accuracy loss.
arXiv Detail & Related papers (2022-11-24T18:10:01Z) - SAR Despeckling Using Overcomplete Convolutional Networks [53.99620005035804]
despeckling is an important problem in remote sensing as speckle degrades SAR images.
Recent studies show that convolutional neural networks(CNNs) outperform classical despeckling methods.
This study employs an overcomplete CNN architecture to focus on learning low-level features by restricting the receptive field.
We show that the proposed network improves despeckling performance compared to recent despeckling methods on synthetic and real SAR images.
arXiv Detail & Related papers (2022-05-31T15:55:37Z) - Image Super-resolution with An Enhanced Group Convolutional Neural
Network [102.2483249598621]
CNNs with strong learning abilities are widely chosen to resolve super-resolution problem.
We present an enhanced super-resolution group CNN (ESRGCNN) with a shallow architecture.
Experiments report that our ESRGCNN surpasses the state-of-the-arts in terms of SISR performance, complexity, execution speed, image quality evaluation and visual effect in SISR.
arXiv Detail & Related papers (2022-05-29T00:34:25Z) - DeepTx: Deep Learning Beamforming with Channel Prediction [8.739166282613118]
In this study, we focus on machine learning algorithms for the transmitter.
We consider beamforming and propose a CNN which, for a given uplink channel estimate as input, outputs downlink channel information to be used for beamforming.
The main task of the neural network is to predict the channel evolution between uplink and downlink slots, but it can also learn to handle inefficiencies and errors in the whole chain.
arXiv Detail & Related papers (2022-02-16T11:19:54Z) - The Mind's Eye: Visualizing Class-Agnostic Features of CNNs [92.39082696657874]
We propose an approach to visually interpret CNN features given a set of images by creating corresponding images that depict the most informative features of a specific layer.
Our method uses a dual-objective activation and distance loss, without requiring a generator network nor modifications to the original model.
arXiv Detail & Related papers (2021-01-29T07:46:39Z) - Approximation and Non-parametric Estimation of ResNet-type Convolutional
Neural Networks [52.972605601174955]
We show a ResNet-type CNN can attain the minimax optimal error rates in important function classes.
We derive approximation and estimation error rates of the aformentioned type of CNNs for the Barron and H"older classes.
arXiv Detail & Related papers (2019-03-24T19:42:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.