Quantum-noise-limited optical neural networks operating at a few quanta
per activation
- URL: http://arxiv.org/abs/2307.15712v1
- Date: Fri, 28 Jul 2023 17:59:46 GMT
- Title: Quantum-noise-limited optical neural networks operating at a few quanta
per activation
- Authors: Shi-Yuan Ma, Tianyu Wang, J\'er\'emie Laydevant, Logan G. Wright and
Peter L. McMahon
- Abstract summary: We show that it is possible to train optical neural networks to perform deterministic image-classification tasks with high accuracy.
We experimentally demonstrated MNIST classification with a test accuracy of 98% using an optical neural network with a hidden layer operating in the single-photon regime.
- Score: 5.494796517705931
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Analog physical neural networks, which hold promise for improved energy
efficiency and speed compared to digital electronic neural networks, are
nevertheless typically operated in a relatively high-power regime so that the
signal-to-noise ratio (SNR) is large (>10). What happens if an analog system is
instead operated in an ultra-low-power regime, in which the behavior of the
system becomes highly stochastic and the noise is no longer a small
perturbation on the signal? In this paper, we study this question in the
setting of optical neural networks operated in the limit where some layers use
only a single photon to cause a neuron activation. Neuron activations in this
limit are dominated by quantum noise from the fundamentally probabilistic
nature of single-photon detection of weak optical signals. We show that it is
possible to train stochastic optical neural networks to perform deterministic
image-classification tasks with high accuracy in spite of the extremely high
noise (SNR ~ 1) by using a training procedure that directly models the
stochastic behavior of photodetection. We experimentally demonstrated MNIST
classification with a test accuracy of 98% using an optical neural network with
a hidden layer operating in the single-photon regime; the optical energy used
to perform the classification corresponds to 0.008 photons per
multiply-accumulate (MAC) operation, which is equivalent to 0.003 attojoules of
optical energy per MAC. Our experiment used >40x fewer photons per inference
than previous state-of-the-art low-optical-energy demonstrations, to achieve
the same accuracy of >90%. Our work shows that some extremely stochastic analog
systems, including those operating in the limit where quantum noise dominates,
can nevertheless be used as layers in neural networks that deterministically
perform classification tasks with high accuracy if they are appropriately
trained.
Related papers
- All-optical modulation with single-photons using electron avalanche [69.65384453064829]
We demonstrate all-optical modulation using a beam with single-photon intensity.
Our approach opens up the possibility of terahertz-speed optical switching at the single-photon level.
arXiv Detail & Related papers (2023-12-18T20:14:15Z) - Spatially Varying Nanophotonic Neural Networks [39.1303097259564]
Photonic processors that execute operations using photons instead of electrons promise to enable optical neural networks with ultra-low latency and power consumption.
Existing optical neural networks, limited by the underlying network designs, have achieved image recognition accuracy far below that of state-of-the-art electronic neural networks.
arXiv Detail & Related papers (2023-08-07T08:48:46Z) - Integrated multi-operand optical neurons for scalable and
hardware-efficient deep learning [10.157562103034]
This work proposes a scalable and efficient optical dot-product engine based on customized multi-operand photonic devices.
We experimentally demonstrate the utility of a MOON using a multi-operand-Mach-Zehnder-interferometer (MOMZI) in image recognition tasks.
arXiv Detail & Related papers (2023-05-31T06:25:39Z) - Globally Optimal Training of Neural Networks with Threshold Activation
Functions [63.03759813952481]
We study weight decay regularized training problems of deep neural networks with threshold activations.
We derive a simplified convex optimization formulation when the dataset can be shattered at a certain layer of the network.
arXiv Detail & Related papers (2023-03-06T18:59:13Z) - On-chip quantum information processing with distinguishable photons [55.41644538483948]
Multi-photon interference is at the heart of photonic quantum technologies.
Here, we experimentally demonstrate that detection can be implemented with a temporal resolution sufficient to interfere photons detuned on the scales necessary for cavity-based integrated photon sources.
We show how time-resolved detection of non-ideal photons can be used to improve the fidelity of an entangling operation and to mitigate the reduction of computational complexity in boson sampling experiments.
arXiv Detail & Related papers (2022-10-14T18:16:49Z) - Momentum Diminishes the Effect of Spectral Bias in Physics-Informed
Neural Networks [72.09574528342732]
Physics-informed neural network (PINN) algorithms have shown promising results in solving a wide range of problems involving partial differential equations (PDEs)
They often fail to converge to desirable solutions when the target function contains high-frequency features, due to a phenomenon known as spectral bias.
In the present work, we exploit neural tangent kernels (NTKs) to investigate the training dynamics of PINNs evolving under gradient descent with momentum (SGDM)
arXiv Detail & Related papers (2022-06-29T19:03:10Z) - Single-Shot Optical Neural Network [55.41644538483948]
'Weight-stationary' analog optical and electronic hardware has been proposed to reduce the compute resources required by deep neural networks.
We present a scalable, single-shot-per-layer weight-stationary optical processor.
arXiv Detail & Related papers (2022-05-18T17:49:49Z) - Resonant tunnelling diode nano-optoelectronic spiking nodes for
neuromorphic information processing [0.0]
We introduce an optoelectronic artificial neuron capable of operating at ultrafast rates and with low energy consumption.
The proposed system combines an excitable tunnelling diode (RTD) element with a nanoscale light source.
arXiv Detail & Related papers (2021-07-14T14:11:04Z) - Photonic neural field on a silicon chip: large-scale, high-speed
neuro-inspired computing and sensing [0.0]
Photonic neural networks have significant potential for high-speed neural processing with low latency and ultralow energy consumption.
We propose the concept of a photonic neural field and implement it experimentally on a silicon chip to realize highly scalable neuro-inspired computing.
In this study, we use the on-chip photonic neural field as a reservoir of information and demonstrate a high-speed chaotic time-series prediction with low errors.
arXiv Detail & Related papers (2021-05-22T09:28:51Z) - An optical neural network using less than 1 photon per multiplication [4.003843776219224]
We experimentally demonstrate an optical neural network achieving 99% accuracy on handwritten-digit classification.
This performance was achieved using a custom free-space optical processor.
Our results provide a proof-of-principle for low-optical-power operation.
arXiv Detail & Related papers (2021-04-27T20:43:23Z) - Rapid characterisation of linear-optical networks via PhaseLift [51.03305009278831]
Integrated photonics offers great phase-stability and can rely on the large scale manufacturability provided by the semiconductor industry.
New devices, based on such optical circuits, hold the promise of faster and energy-efficient computations in machine learning applications.
We present a novel technique to reconstruct the transfer matrix of linear optical networks.
arXiv Detail & Related papers (2020-10-01T16:04:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.