Analysis of Optical Loss and Crosstalk Noise in MZI-based Coherent
Photonic Neural Networks
- URL: http://arxiv.org/abs/2308.03249v1
- Date: Mon, 7 Aug 2023 02:01:18 GMT
- Title: Analysis of Optical Loss and Crosstalk Noise in MZI-based Coherent
Photonic Neural Networks
- Authors: Amin Shafiee, Sanmitra Banerjee, Krishnendu Chakrabarty, Sudeep
Pasricha, Mahdi Nikdast
- Abstract summary: silicon-photonic-based neural network (SP-NN) accelerators have emerged as a promising alternative to electronic accelerators.
In this paper, we comprehensively model the optical loss and crosstalk noise using a bottom-up approach.
We show a high power penalty and a catastrophic inferencing accuracy drop of up to 84% for SP-NNs of different scales.
- Score: 8.930237478906266
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: With the continuous increase in the size and complexity of machine learning
models, the need for specialized hardware to efficiently run such models is
rapidly growing. To address such a need, silicon-photonic-based neural network
(SP-NN) accelerators have recently emerged as a promising alternative to
electronic accelerators due to their lower latency and higher energy
efficiency. Not only can SP-NNs alleviate the fan-in and fan-out problem with
linear algebra processors, their operational bandwidth can match that of the
photodetection rate (typically 100 GHz), which is at least over an order of
magnitude faster than electronic counterparts that are restricted to a clock
rate of a few GHz. Unfortunately, the underlying silicon photonic devices in
SP-NNs suffer from inherent optical losses and crosstalk noise originating from
fabrication imperfections and undesired optical couplings, the impact of which
accumulates as the network scales up. Consequently, the inferencing accuracy in
an SP-NN can be affected by such inefficiencies -- e.g., can drop to below 10%
-- the impact of which is yet to be fully studied. In this paper, we
comprehensively model the optical loss and crosstalk noise using a bottom-up
approach, from the device to the system level, in coherent SP-NNs built using
Mach-Zehnder interferometer (MZI) devices. The proposed models can be applied
to any SP-NN architecture with different configurations to analyze the effect
of loss and crosstalk. Such an analysis is important where there are
inferencing accuracy and scalability requirements to meet when designing an
SP-NN. Using the proposed analytical framework, we show a high power penalty
and a catastrophic inferencing accuracy drop of up to 84% for SP-NNs of
different scales with three known MZI mesh configurations (i.e., Reck,
Clements, and Diamond) due to accumulated optical loss and crosstalk noise.
Related papers
- Neuromorphic Wireless Split Computing with Multi-Level Spikes [69.73249913506042]
In neuromorphic computing, spiking neural networks (SNNs) perform inference tasks, offering significant efficiency gains for workloads involving sequential data.
Recent advances in hardware and software have demonstrated that embedding a few bits of payload in each spike exchanged between the spiking neurons can further enhance inference accuracy.
This paper investigates a wireless neuromorphic split computing architecture employing multi-level SNNs.
arXiv Detail & Related papers (2024-11-07T14:08:35Z) - LitE-SNN: Designing Lightweight and Efficient Spiking Neural Network through Spatial-Temporal Compressive Network Search and Joint Optimization [48.41286573672824]
Spiking Neural Networks (SNNs) mimic the information-processing mechanisms of the human brain and are highly energy-efficient.
We propose a new approach named LitE-SNN that incorporates both spatial and temporal compression into the automated network design process.
arXiv Detail & Related papers (2024-01-26T05:23:11Z) - Free-Space Optical Spiking Neural Network [0.0]
We introduce the Free-space Optical deep Spiking Convolutional Neural Network (OSCNN)
This novel approach draws inspiration from computational models of the human eye.
Our results demonstrate promising performance with minimal latency and power consumption compared to their electronic ONN counterparts.
arXiv Detail & Related papers (2023-11-08T09:41:14Z) - The Hardware Impact of Quantization and Pruning for Weights in Spiking
Neural Networks [0.368986335765876]
quantization and pruning of parameters can both compress the model size, reduce memory footprints, and facilitate low-latency execution.
We study various combinations of pruning and quantization in isolation, cumulatively, and simultaneously to a state-of-the-art SNN targeting gesture recognition.
We show that this state-of-the-art model is amenable to aggressive parameter quantization, not suffering from any loss in accuracy down to ternary weights.
arXiv Detail & Related papers (2023-02-08T16:25:20Z) - Characterizing Coherent Integrated Photonic Neural Networks under
Imperfections [7.387054116520716]
Integrated photonic neural networks (IPNNs) are emerging as promising successors to conventional electronic AI accelerators.
In this paper, we systematically characterize the impact of uncertainties and imprecisions in IPNNs using a bottom-up approach.
arXiv Detail & Related papers (2022-07-22T01:33:19Z) - Single-Shot Optical Neural Network [55.41644538483948]
'Weight-stationary' analog optical and electronic hardware has been proposed to reduce the compute resources required by deep neural networks.
We present a scalable, single-shot-per-layer weight-stationary optical processor.
arXiv Detail & Related papers (2022-05-18T17:49:49Z) - Characterization and Optimization of Integrated Silicon-Photonic Neural
Networks under Fabrication-Process Variations [8.690877625458324]
Silicon-photonic neural networks (SPNNs) have emerged as promising successors to electronic artificial intelligence (AI) accelerators.
The underlying silicon photonic devices in SPNNs are sensitive to inevitable fabrication-process variations (FPVs) stemming from optical lithography imperfections.
We propose a novel variation-aware, design-time optimization solution to improve MZI tolerance to different FPVs in SPNNs.
arXiv Detail & Related papers (2022-04-19T23:03:36Z) - LoCI: An Analysis of the Impact of Optical Loss and Crosstalk Noise in
Integrated Silicon-Photonic Neural Networks [8.930237478906266]
Integrated silicon-photonic neural networks (SP-NNs) promise higher speed and energy efficiency for emerging artificial-intelligence applications.
This paper presents the first comprehensive and systematic optical loss and crosstalk modeling framework for SP-NNs.
arXiv Detail & Related papers (2022-04-08T04:22:39Z) - FPGA-optimized Hardware acceleration for Spiking Neural Networks [69.49429223251178]
This work presents the development of a hardware accelerator for an SNN, with off-line training, applied to an image recognition task.
The design targets a Xilinx Artix-7 FPGA, using in total around the 40% of the available hardware resources.
It reduces the classification time by three orders of magnitude, with a small 4.5% impact on the accuracy, if compared to its software, full precision counterpart.
arXiv Detail & Related papers (2022-01-18T13:59:22Z) - An Adaptive Device-Edge Co-Inference Framework Based on Soft
Actor-Critic [72.35307086274912]
High-dimension parameter model and large-scale mathematical calculation restrict execution efficiency, especially for Internet of Things (IoT) devices.
We propose a new Deep Reinforcement Learning (DRL)-Soft Actor Critic for discrete (SAC-d), which generates the emphexit point, emphexit point, and emphcompressing bits by soft policy iterations.
Based on the latency and accuracy aware reward design, such an computation can well adapt to the complex environment like dynamic wireless channel and arbitrary processing, and is capable of supporting the 5G URL
arXiv Detail & Related papers (2022-01-09T09:31:50Z) - Interleaving: Modular architectures for fault-tolerant photonic quantum
computing [50.591267188664666]
Photonic fusion-based quantum computing (FBQC) uses low-loss photonic delays.
We present a modular architecture for FBQC in which these components are combined to form "interleaving modules"
Exploiting the multiplicative power of delays, each module can add thousands of physical qubits to the computational Hilbert space.
arXiv Detail & Related papers (2021-03-15T18:00:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.