Characterization and Optimization of Integrated Silicon-Photonic Neural
Networks under Fabrication-Process Variations
- URL: http://arxiv.org/abs/2204.09153v1
- Date: Tue, 19 Apr 2022 23:03:36 GMT
- Title: Characterization and Optimization of Integrated Silicon-Photonic Neural
Networks under Fabrication-Process Variations
- Authors: Asif Mirza, Amin Shafiee, Sanmitra Banerjee, Krishnendu Chakrabarty,
Sudeep Pasricha, Mahdi Nikdast
- Abstract summary: Silicon-photonic neural networks (SPNNs) have emerged as promising successors to electronic artificial intelligence (AI) accelerators.
The underlying silicon photonic devices in SPNNs are sensitive to inevitable fabrication-process variations (FPVs) stemming from optical lithography imperfections.
We propose a novel variation-aware, design-time optimization solution to improve MZI tolerance to different FPVs in SPNNs.
- Score: 8.690877625458324
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Silicon-photonic neural networks (SPNNs) have emerged as promising successors
to electronic artificial intelligence (AI) accelerators by offering orders of
magnitude lower latency and higher energy efficiency. Nevertheless, the
underlying silicon photonic devices in SPNNs are sensitive to inevitable
fabrication-process variations (FPVs) stemming from optical lithography
imperfections. Consequently, the inferencing accuracy in an SPNN can be highly
impacted by FPVs -- e.g., can drop to below 10% -- the impact of which is yet
to be fully studied. In this paper, we, for the first time, model and explore
the impact of FPVs in the waveguide width and silicon-on-insulator (SOI)
thickness in coherent SPNNs that use Mach-Zehnder Interferometers (MZIs).
Leveraging such models, we propose a novel variation-aware, design-time
optimization solution to improve MZI tolerance to different FPVs in SPNNs.
Simulation results for two example SPNNs of different scales under realistic
and correlated FPVs indicate that the optimized MZIs can improve the
inferencing accuracy by up to 93.95% for the MNIST handwritten digit dataset --
considered as an example in this paper -- which corresponds to a <0.5% accuracy
loss compared to the variation-free case. The proposed one-time optimization
method imposes low area overhead, and hence is applicable even to
resource-constrained designs
Related papers
- Task-Oriented Real-time Visual Inference for IoVT Systems: A Co-design Framework of Neural Networks and Edge Deployment [61.20689382879937]
Task-oriented edge computing addresses this by shifting data analysis to the edge.
Existing methods struggle to balance high model performance with low resource consumption.
We propose a novel co-design framework to optimize neural network architecture.
arXiv Detail & Related papers (2024-10-29T19:02:54Z) - Hardware-Software Co-optimised Fast and Accurate Deep Reconfigurable Spiking Inference Accelerator Architecture Design Methodology [2.968768532937366]
Spiking Neural Networks (SNNs) have emerged as a promising approach to improve the energy efficiency of machine learning models.
We develop a hardware-software co-optimisation strategy to port software-trained deep neural networks (DNN) to reduced-precision spiking models.
arXiv Detail & Related papers (2024-10-07T05:04:13Z) - LitE-SNN: Designing Lightweight and Efficient Spiking Neural Network through Spatial-Temporal Compressive Network Search and Joint Optimization [48.41286573672824]
Spiking Neural Networks (SNNs) mimic the information-processing mechanisms of the human brain and are highly energy-efficient.
We propose a new approach named LitE-SNN that incorporates both spatial and temporal compression into the automated network design process.
arXiv Detail & Related papers (2024-01-26T05:23:11Z) - EPIM: Efficient Processing-In-Memory Accelerators based on Epitome [78.79382890789607]
We introduce the Epitome, a lightweight neural operator offering convolution-like functionality.
On the software side, we evaluate epitomes' latency and energy on PIM accelerators.
We introduce a PIM-aware layer-wise design method to enhance their hardware efficiency.
arXiv Detail & Related papers (2023-11-12T17:56:39Z) - Analysis of Optical Loss and Crosstalk Noise in MZI-based Coherent
Photonic Neural Networks [8.930237478906266]
silicon-photonic-based neural network (SP-NN) accelerators have emerged as a promising alternative to electronic accelerators.
In this paper, we comprehensively model the optical loss and crosstalk noise using a bottom-up approach.
We show a high power penalty and a catastrophic inferencing accuracy drop of up to 84% for SP-NNs of different scales.
arXiv Detail & Related papers (2023-08-07T02:01:18Z) - Improving Realistic Worst-Case Performance of NVCiM DNN Accelerators
through Training with Right-Censored Gaussian Noise [16.470952550714394]
We propose to use the k-th percentile performance (KPP) to capture the realistic worst-case performance of DNN models executing on CiM accelerators.
Our method achieves up to a 26% improvement in KPP compared to the state-of-the-art methods employed to enhance robustness under the impact of device variations.
arXiv Detail & Related papers (2023-07-29T01:06:37Z) - Characterizing Coherent Integrated Photonic Neural Networks under
Imperfections [7.387054116520716]
Integrated photonic neural networks (IPNNs) are emerging as promising successors to conventional electronic AI accelerators.
In this paper, we systematically characterize the impact of uncertainties and imprecisions in IPNNs using a bottom-up approach.
arXiv Detail & Related papers (2022-07-22T01:33:19Z) - Auto-PINN: Understanding and Optimizing Physics-Informed Neural
Architecture [77.59766598165551]
Physics-informed neural networks (PINNs) are revolutionizing science and engineering practice by bringing together the power of deep learning to bear on scientific computation.
Here, we propose Auto-PINN, which employs Neural Architecture Search (NAS) techniques to PINN design.
A comprehensive set of pre-experiments using standard PDE benchmarks allows us to probe the structure-performance relationship in PINNs.
arXiv Detail & Related papers (2022-05-27T03:24:31Z) - Enhanced physics-constrained deep neural networks for modeling vanadium
redox flow battery [62.997667081978825]
We propose an enhanced version of the physics-constrained deep neural network (PCDNN) approach to provide high-accuracy voltage predictions.
The ePCDNN can accurately capture the voltage response throughout the charge--discharge cycle, including the tail region of the voltage discharge curve.
arXiv Detail & Related papers (2022-03-03T19:56:24Z) - Modeling Silicon-Photonic Neural Networks under Uncertainties [4.205518884494758]
Silicon-photonic neural networks (SPNNs) offer substantial improvements in computing speed and energy efficiency compared to their digital electronic counterparts.
However, the energy efficiency and accuracy of SPNNs are highly impacted by uncertainties that arise from fabrication-process and thermal variations.
We present the first comprehensive and hierarchical study on the impact of random uncertainties on the classification accuracy of a Mach-Zehnder Interferometer (MZI)-based SPNN.
arXiv Detail & Related papers (2020-12-19T04:41:26Z) - Toward fast and accurate human pose estimation via soft-gated skip
connections [97.06882200076096]
This paper is on highly accurate and highly efficient human pose estimation.
We re-analyze this design choice in the context of improving both the accuracy and the efficiency over the state-of-the-art.
Our model achieves state-of-the-art results on the MPII and LSP datasets.
arXiv Detail & Related papers (2020-02-25T18:51:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.