Hardware architecture for high throughput event visual data filtering
with matrix of IIR filters algorithm
- URL: http://arxiv.org/abs/2207.00860v1
- Date: Sat, 2 Jul 2022 15:18:53 GMT
- Title: Hardware architecture for high throughput event visual data filtering
with matrix of IIR filters algorithm
- Authors: Marcin Kowalczyk and Tomasz Kryjak
- Abstract summary: Neuromorphic vision is a rapidly growing field with numerous applications in the perception systems of autonomous vehicles.
There is a significant amount of noise in the event stream due to the sensors working principle.
We present a novel algorithm based on an IIR filter matrix for filtering this type of noise and a hardware architecture that allows its acceleration.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Neuromorphic vision is a rapidly growing field with numerous applications in
the perception systems of autonomous vehicles. Unfortunately, due to the
sensors working principle, there is a significant amount of noise in the event
stream. In this paper we present a novel algorithm based on an IIR filter
matrix for filtering this type of noise and a hardware architecture that allows
its acceleration using an SoC FPGA. Our method has a very good filtering
efficiency for uncorrelated noise - over 99% of noisy events are removed. It
has been tested for several event data sets with added random noise. We
designed the hardware architecture in such a way as to reduce the utilisation
of the FPGA's internal BRAM resources. This enabled a very low latency and a
throughput of up to 385.8 MEPS million events per second.The proposed hardware
architecture was verified in simulation and in hardware on the Xilinx Zynq
Ultrascale+ MPSoC chip on the Mercury+ XU9 module with the Mercury+ ST1 base
board.
Related papers
- Interpolation-Based Event Visual Data Filtering Algorithms [0.0]
We propose a method for event data that is capable of removing approximately 99% of noise while preserving the majority of the valid signal.<n>The proposed methods use about 30KB of memory for a sensor with a resolution of 1280 x 720 and is therefore well suited for implementation in embedded devices.
arXiv Detail & Related papers (2025-07-02T10:13:20Z) - High Throughput Event Filtering: The Interpolation-based DIF Algorithm Hardware Architecture [0.0]
We propose a hardware architecture of the Distance-based Interpolation with Frequency Weights filter and implement it on an FPGA chip.<n>Our architecture achieved a throughput of 403.39 million events per second for a sensor resolution of 1280 x 720 and 428.45 MEPS for a resolution of 640 x 480.<n>The average values of the Area Under the Receiver Operating Characteristic (AUROC) index ranged from 0.844 to 0.999 depending on the dataset.
arXiv Detail & Related papers (2025-06-06T07:49:18Z) - Noise Filtering Benchmark for Neuromorphic Satellites Observations [39.781091151259766]
Event cameras capture sparse, asynchronous brightness changes which offer high temporal resolution, high dynamic range, low power consumption, and sparse data output.
These advantages make them ideal for Space Situational Awareness, particularly in detecting resident space objects moving within a telescope's field of view.
However, the output from event cameras often includes substantial background activity noise, which is known to be more prevalent in low-light conditions.
This noise can overwhelm the sparse events generated by satellite signals, making detection and tracking more challenging.
arXiv Detail & Related papers (2024-11-18T02:02:24Z) - Fast Algorithms for Spiking Neural Network Simulation with FPGAs [0.0]
We create spiking neural network (SNN) simulators for the Potjans-Diesmann cortical microcircuit for a high-end Field-Programmable Gate Array (FPGA)
Our best simulators simulate the circuit 25% faster than real-time, require less than 21 nJ per synaptic event, and are bottle-necked by the device's on-chip memory.
This result is the first for simulating the circuit on a single hardware accelerator.
arXiv Detail & Related papers (2024-05-03T11:39:25Z) - Accelerating local laplacian filters on FPGAs [11.061707876645764]
Local Laplacian Filtering is an edge-aware image processing technique that involves the construction of simple Gaussian and Laplacian pyramids.
This paper proposes a hardware accelerator, which exploits fully the available parallelism in the Local Laplacian Filtering algorithm.
On Virtex-7 FPGA, we obtain a 7.5x speed-up to process a 1 MB image when compared to an optimized baseline CPU implementation.
arXiv Detail & Related papers (2024-02-18T10:49:23Z) - INR-Arch: A Dataflow Architecture and Compiler for Arbitrary-Order
Gradient Computations in Implicit Neural Representation Processing [66.00729477511219]
Given a function represented as a computation graph, traditional architectures face challenges in efficiently computing its nth-order gradient.
We introduce INR-Arch, a framework that transforms the computation graph of an nth-order gradient into a hardware-optimized dataflow architecture.
We present results that demonstrate 1.8-4.8x and 1.5-3.6x speedup compared to CPU and GPU baselines respectively.
arXiv Detail & Related papers (2023-08-11T04:24:39Z) - Batch-efficient EigenDecomposition for Small and Medium Matrices [65.67315418971688]
EigenDecomposition (ED) is at the heart of many computer vision algorithms and applications.
We propose a QR-based ED method dedicated to the application scenarios of computer vision.
arXiv Detail & Related papers (2022-07-09T09:14:12Z) - Real-Time GPU-Accelerated Machine Learning Based Multiuser Detection for
5G and Beyond [70.81551587109833]
nonlinear beamforming filters can significantly outperform linear approaches in stationary scenarios with massive connectivity.
One of the main challenges comes from the real-time implementation of these algorithms.
This paper explores the acceleration of APSM-based algorithms through massive parallelization.
arXiv Detail & Related papers (2022-01-13T15:20:45Z) - Accelerating Markov Random Field Inference with Uncertainty
Quantification [10.825800519362579]
probabilistic algorithms are computationally expensive on conventional processors.
Their statistical properties, namely interpretability and uncertainty quantification (UQ) make them an attractive alternative approach.
We propose a high- throughput accelerator for Markov Random Field (MRF) inference.
We also propose a novel hybrid on-chip/off-chip memory system and logging scheme to efficiently support UQ.
arXiv Detail & Related papers (2021-08-02T00:02:53Z) - FastFlowNet: A Lightweight Network for Fast Optical Flow Estimation [81.76975488010213]
Dense optical flow estimation plays a key role in many robotic vision tasks.
Current networks often occupy large number of parameters and require heavy computation costs.
Our proposed FastFlowNet works in the well-known coarse-to-fine manner with following innovations.
arXiv Detail & Related papers (2021-03-08T03:09:37Z) - Learning based signal detection for MIMO systems with unknown noise
statistics [84.02122699723536]
This paper aims to devise a generalized maximum likelihood (ML) estimator to robustly detect signals with unknown noise statistics.
In practice, there is little or even no statistical knowledge on the system noise, which in many cases is non-Gaussian, impulsive and not analyzable.
Our framework is driven by an unsupervised learning approach, where only the noise samples are required.
arXiv Detail & Related papers (2021-01-21T04:48:15Z) - A fully pipelined FPGA accelerator for scale invariant feature transform
keypoint descriptor matching, [0.0]
We design a novel fully pipelined hardware accelerator architecture for SIFT keypoint descriptor matching.
The proposed hardware architecture is able to properly handle the memory bandwidth necessary for a fully-pipelined implementation.
Our hardware implementation is 15.7 times faster than the comparable software approach.
arXiv Detail & Related papers (2020-12-17T15:29:41Z) - Near-chip Dynamic Vision Filtering for Low-Bandwidth Pedestrian
Detection [99.94079901071163]
This paper presents a novel end-to-end system for pedestrian detection using Dynamic Vision Sensors (DVSs)
We target applications where multiple sensors transmit data to a local processing unit, which executes a detection algorithm.
Our detector is able to perform a detection every 450 ms, with an overall testing F1 score of 83%.
arXiv Detail & Related papers (2020-04-03T17:36:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.