A Framework for Fast Scalable BNN Inference using Googlenet and Transfer
Learning
- URL: http://arxiv.org/abs/2101.00793v2
- Date: Tue, 5 Jan 2021 07:28:38 GMT
- Title: A Framework for Fast Scalable BNN Inference using Googlenet and Transfer
Learning
- Authors: Karthik E
- Abstract summary: This thesis aims to achieve high accuracy in object detection with good real-time performance.
The binarized neural network has shown high performance in various vision tasks such as image classification, object detection, and semantic segmentation.
Results show that the accuracy of objects detected by the transfer learning method is more when compared to the existing methods.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Efficient and accurate object detection in video and image analysis is one of
the major beneficiaries of the advancement in computer vision systems with the
help of deep learning. With the aid of deep learning, more powerful tools
evolved, which are capable to learn high-level and deeper features and thus can
overcome the existing problems in traditional architectures of object detection
algorithms. The work in this thesis aims to achieve high accuracy in object
detection with good real-time performance.
In the area of computer vision, a lot of research is going into the area of
detection and processing of visual information, by improving the existing
algorithms. The binarized neural network has shown high performance in various
vision tasks such as image classification, object detection, and semantic
segmentation. The Modified National Institute of Standards and Technology
database (MNIST), Canadian Institute for Advanced Research (CIFAR), and Street
View House Numbers (SVHN) datasets are used which is implemented using a
pre-trained convolutional neural network (CNN) that is 22 layers deep.
Supervised learning is used in the work, which classifies the particular
dataset with the proper structure of the model. In still images, to improve
accuracy, Googlenet is used. The final layer of the Googlenet is replaced with
the transfer learning to improve the accuracy of the Googlenet. At the same
time, the accuracy in moving images can be maintained by transfer learning
techniques. Hardware is the main backbone for any model to obtain faster
results with a large number of datasets. Here, Nvidia Jetson Nano is used which
is a graphics processing unit (GPU), that can handle a large number of
computations in the process of object detection. Results show that the accuracy
of objects detected by the transfer learning method is more when compared to
the existing methods.
Related papers
- Task-Oriented Real-time Visual Inference for IoVT Systems: A Co-design Framework of Neural Networks and Edge Deployment [61.20689382879937]
Task-oriented edge computing addresses this by shifting data analysis to the edge.
Existing methods struggle to balance high model performance with low resource consumption.
We propose a novel co-design framework to optimize neural network architecture.
arXiv Detail & Related papers (2024-10-29T19:02:54Z) - Accelerating Object Detection with YOLOv4 for Real-Time Applications [0.276240219662896]
Convolutional Neural Network (CNN) have emerged as a powerful tool for recognizing image content and in computer vision approach for most problems.
This paper introduces the brief introduction of deep learning and object detection framework like Convolutional Neural Network(CNN)
arXiv Detail & Related papers (2024-10-17T17:44:57Z) - Deep Neural Networks in Video Human Action Recognition: A Review [21.00217656391331]
Video behavior recognition is one of the most foundational tasks of computer vision.
Deep neural networks are built for recognizing pixel-level information such as images with RGB, RGB-D, or optical flow formats.
In our article, the performance of deep neural networks surpassed most of the techniques in the feature learning and extraction tasks.
arXiv Detail & Related papers (2023-05-25T03:54:41Z) - Deep Learning Computer Vision Algorithms for Real-time UAVs On-board
Camera Image Processing [77.34726150561087]
This paper describes how advanced deep learning based computer vision algorithms are applied to enable real-time on-board sensor processing for small UAVs.
All algorithms have been developed using state-of-the-art image processing methods based on deep neural networks.
arXiv Detail & Related papers (2022-11-02T11:10:42Z) - Comparison Analysis of Traditional Machine Learning and Deep Learning
Techniques for Data and Image Classification [62.997667081978825]
The purpose of the study is to analyse and compare the most common machine learning and deep learning techniques used for computer vision 2D object classification tasks.
Firstly, we will present the theoretical background of the Bag of Visual words model and Deep Convolutional Neural Networks (DCNN)
Secondly, we will implement a Bag of Visual Words model, the VGG16 CNN Architecture.
arXiv Detail & Related papers (2022-04-11T11:34:43Z) - Hybrid Optimized Deep Convolution Neural Network based Learning Model
for Object Detection [0.0]
Object identification is one of the most fundamental and difficult issues in computer vision.
In recent years, deep learning-based object detection techniques have grabbed the public's interest.
In this study, a unique deep learning classification technique is used to create an autonomous object detecting system.
The suggested framework has a detection accuracy of 0.9864, which is greater than current techniques.
arXiv Detail & Related papers (2022-03-02T04:39:37Z) - NAS-FCOS: Efficient Search for Object Detection Architectures [113.47766862146389]
We propose an efficient method to obtain better object detectors by searching for the feature pyramid network (FPN) and the prediction head of a simple anchor-free object detector.
With carefully designed search space, search algorithms, and strategies for evaluating network quality, we are able to find top-performing detection architectures within 4 days using 8 V100 GPUs.
arXiv Detail & Related papers (2021-10-24T12:20:04Z) - PCLs: Geometry-aware Neural Reconstruction of 3D Pose with Perspective
Crop Layers [111.55817466296402]
We introduce Perspective Crop Layers (PCLs) - a form of perspective crop of the region of interest based on the camera geometry.
PCLs deterministically remove the location-dependent perspective effects while leaving end-to-end training and the number of parameters of the underlying neural network.
PCL offers an easy way to improve the accuracy of existing 3D reconstruction networks by making them geometry aware.
arXiv Detail & Related papers (2020-11-27T08:48:43Z) - Unsupervised Foveal Vision Neural Networks with Top-Down Attention [0.3058685580689604]
We propose the fusion of bottom-up saliency and top-down attention employing only unsupervised learning techniques.
We test the performance of the proposed Gamma saliency technique on the Toronto and CAT2000 databases.
We also develop a topdown attention mechanism based on the Gamma saliency applied to the top layer of CNNs to improve scene understanding in multi-object images or images with strong background clutter.
arXiv Detail & Related papers (2020-10-18T20:55:49Z) - Satellite Image Classification with Deep Learning [0.0]
We describe a deep learning system for classifying objects and facilities from the IARPA Functional Map of the World (fMoW) dataset into 63 different classes.
The system consists of an ensemble of convolutional neural networks and additional neural networks that integrate satellite metadata with image features.
At the time of writing the system is in 2nd place in the fMoW TopCoder competition.
arXiv Detail & Related papers (2020-10-13T15:56:58Z) - Curriculum By Smoothing [52.08553521577014]
Convolutional Neural Networks (CNNs) have shown impressive performance in computer vision tasks such as image classification, detection, and segmentation.
We propose an elegant curriculum based scheme that smoothes the feature embedding of a CNN using anti-aliasing or low-pass filters.
As the amount of information in the feature maps increases during training, the network is able to progressively learn better representations of the data.
arXiv Detail & Related papers (2020-03-03T07:27:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.