Related papers: On-Sensor Binarized Fully Convolutional Neural Network with A Pixel Processor Array

On-Sensor Binarized Fully Convolutional Neural Network with A Pixel Processor Array

URL: http://arxiv.org/abs/2202.00836v1
Date: Wed, 2 Feb 2022 01:18:40 GMT
Title: On-Sensor Binarized Fully Convolutional Neural Network with A Pixel Processor Array
Authors: Yanan Liu, Laurie Bose, Yao Lu, Piotr Dudek, Walterio Mayol-Cuevas
Abstract summary: This work presents a method to implement fully convolutional neural networks (FCNs) on Pixel Processor Array ( PPA) sensors. We design and train binarized FCN for both binary weights and activations using batchnorm, group convolution, and learnable threshold for binarization. We demonstrate the first implementation of an FCN on a PPA device, performing three convolution layers entirely in the pixel-level processors.
Score: 17.4097919720973
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: This work presents a method to implement fully convolutional neural networks (FCNs) on Pixel Processor Array (PPA) sensors, and demonstrates coarse segmentation and object localisation tasks. We design and train binarized FCN for both binary weights and activations using batchnorm, group convolution, and learnable threshold for binarization, producing networks small enough to be embedded on the focal plane of the PPA, with limited local memory resources, and using parallel elementary add/subtract, shifting, and bit operations only. We demonstrate the first implementation of an FCN on a PPA device, performing three convolution layers entirely in the pixel-level processors. We use this architecture to demonstrate inference generating heat maps for object segmentation and localisation at over 280 FPS using the SCAMP-5 PPA vision chip.

Related papers

Efficient Spatio-Temporal Signal Recognition on Edge Devices Using PointLCA-Net [0.45609532372046985]
This paper presents an approach that combines PointNet's feature extraction with the in-memory computing capabilities and energy efficiency of neuromorphic systems fortemporal signal recognition. PointNet achieves high accuracy and significantly lower energy burden during both inference and training than comparable approaches.
arXiv Detail & Related papers (2024-11-21T20:48:40Z)
Tiled Bit Networks: Sub-Bit Neural Network Compression Through Reuse of Learnable Binary Vectors [4.95475852994362]
We propose a new form of quantization to tile neural network layers with sequences of bits to achieve sub-bit compression of binary-weighted neural networks. We employ the approach to both fully-connected and convolutional layers, which make up the breadth of space in most neural architectures.
arXiv Detail & Related papers (2024-07-16T15:55:38Z)
BDC-Occ: Binarized Deep Convolution Unit For Binarized Occupancy Network [55.21288428359509]
Existing 3D occupancy networks demand significant hardware resources, hindering the deployment of edge devices. We propose a novel binarized deep convolution (BDC) unit that effectively enhances performance while increasing the number of binarized convolutional layers. Our BDC-Occ model is created by applying the proposed BDC unit to binarize the existing 3D occupancy networks.
arXiv Detail & Related papers (2024-05-27T10:44:05Z)
Signal Processing for Implicit Neural Representations [80.38097216996164]
Implicit Neural Representations (INRs) encode continuous multi-media data via multi-layer perceptrons. Existing works manipulate such continuous representations via processing on their discretized instance. We propose an implicit neural signal processing network, dubbed INSP-Net, via differential operators on INR.
arXiv Detail & Related papers (2022-10-17T06:29:07Z)
Bandwidth-efficient distributed neural network architectures with application to body sensor networks [73.02174868813475]
This paper describes a conceptual design methodology to design distributed neural network architectures. We show that the proposed framework enables up to a factor 20 in bandwidth reduction with minimal loss. While the application focus of this paper is on wearable brain-computer interfaces, the proposed methodology can be applied in other sensor network-like applications as well.
arXiv Detail & Related papers (2022-10-14T12:35:32Z)
Two-Stream Networks for Object Segmentation in Videos [83.1383102535413]
We present a Two-Stream Network (TSN) to segment the seen pixels based on their pixellevel memory retrieval. A holistic understanding of the instance is obtained with dynamic segmentation heads conditioned on the features of the target instance. The compact instance stream effectively improves the segmentation accuracy of the unseen pixels, while fusing two streams with the adaptive routing map leads to an overall performance boost.
arXiv Detail & Related papers (2022-08-08T10:22:42Z)
Trident Pyramid Networks: The importance of processing at the feature pyramid level for better object detection [50.008529403150206]
We present a new core architecture called Trident Pyramid Network (TPN) TPN allows for a deeper design and for a better balance between communication-based processing and self-processing. We show consistent improvements when using our TPN core on the object detection benchmark, outperforming the popular BiFPN baseline by 1.5 AP.
arXiv Detail & Related papers (2021-10-08T09:59:59Z)
Quantized Neural Networks via {-1, +1} Encoding Decomposition and Acceleration [83.84684675841167]
We propose a novel encoding scheme using -1, +1 to decompose quantized neural networks (QNNs) into multi-branch binary networks. We validate the effectiveness of our method on large-scale image classification, object detection, and semantic segmentation tasks.
arXiv Detail & Related papers (2021-06-18T03:11:15Z)
Efficient 3D Point Cloud Feature Learning for Large-Scale Place Recognition [21.818744369503197]
We develop an efficient point cloud learning network ( EPC-Net) to form a global descriptor for visual place recognition. Our proposed method can achieve state-of-the-art performance with lower parameters, FLOPs, and runtime per frame.
arXiv Detail & Related papers (2021-01-07T05:15:31Z)
Efficient Deep Learning of Non-local Features for Hyperspectral Image Classification [28.72648031677868]
A deep fully convolutional network (FCN) with an efficient non-local module, named ENL-FCN, is proposed for hyperspectral image (HSI) classification. The proposed framework, a deep FCN considers an entire HSI as input and extracts spectral-spatial information in a local receptive field. By using a recurrent operation, each pixel's response is aggregated from all pixels of HSI.
arXiv Detail & Related papers (2020-08-02T19:13:22Z)
Fully Embedding Fast Convolutional Networks on Pixel Processor Arrays [16.531637803429277]
We present a novel method of CNN inference for pixel processor array ( PPA) vision sensors. Our approach can perform convolutional layers, max pooling, ReLu, and a final fully connected layer entirely upon the PPA sensor. This is the first work demonstrating CNN inference conducted entirely upon the processor array of a PPA vision sensor device, requiring no external processing.
arXiv Detail & Related papers (2020-04-27T01:00:35Z)

This list is automatically generated from the titles and abstracts of the papers in this site.