Controlling chaotic itinerancy in laser dynamics for reinforcement
learning
- URL: http://arxiv.org/abs/2205.05987v1
- Date: Thu, 12 May 2022 09:48:43 GMT
- Title: Controlling chaotic itinerancy in laser dynamics for reinforcement
learning
- Authors: Ryugo Iwami, Takatomo Mihana, Kazutaka Kanno, Satoshi Sunada, Makoto
Naruse, and Atsushi Uchida
- Abstract summary: Chaotic itinerancy can be employed to realize brain-like functionalities.
We propose a method for controlling the chaotic itinerancy in a multi-mode semiconductor laser to solve a machine learning task.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Photonic artificial intelligence has attracted considerable interest in
accelerating machine learning; however, the unique optical properties have not
been fully utilized for achieving higher-order functionalities. Chaotic
itinerancy, with its spontaneous transient dynamics among multiple
quasi-attractors, can be employed to realize brain-like functionalities. In
this paper, we propose a method for controlling the chaotic itinerancy in a
multi-mode semiconductor laser to solve a machine learning task, known as the
multi-armed bandit problem, which is fundamental to reinforcement learning. The
proposed method utilizes ultrafast chaotic itinerant motion in mode competition
dynamics controlled via optical injection. We found that the exploration
mechanism is completely different from a conventional searching algorithm and
is highly scalable, outperforming the conventional approaches for large-scale
bandit problems. This study paves the way to utilize chaotic itinerancy for
effectively solving complex machine learning tasks as photonic hardware
accelerators.
Related papers
- Optical training of large-scale Transformers and deep neural networks with direct feedback alignment [48.90869997343841]
We experimentally implement a versatile and scalable training algorithm, called direct feedback alignment, on a hybrid electronic-photonic platform.
An optical processing unit performs large-scale random matrix multiplications, which is the central operation of this algorithm, at speeds up to 1500 TeraOps.
We study the compute scaling of our hybrid optical approach, and demonstrate a potential advantage for ultra-deep and wide neural networks.
arXiv Detail & Related papers (2024-09-01T12:48:47Z) - Decentralized multi-agent reinforcement learning algorithm using a cluster-synchronized laser network [1.124958340749622]
We propose a photonic-based decision-making algorithm to address the competitive multi-armed bandit problem.
Our numerical simulations demonstrate that chaotic oscillations and cluster synchronization of optically coupled lasers, along with our proposed decentralized coupling adjustment, efficiently balance exploration and exploitation.
arXiv Detail & Related papers (2024-07-12T09:38:47Z) - RObotic MAnipulation Network (ROMAN) $\unicode{x2013}$ Hybrid
Hierarchical Learning for Solving Complex Sequential Tasks [70.69063219750952]
We present a Hybrid Hierarchical Learning framework, the Robotic Manipulation Network (ROMAN)
ROMAN achieves task versatility and robust failure recovery by integrating behavioural cloning, imitation learning, and reinforcement learning.
Experimental results show that by orchestrating and activating these specialised manipulation experts, ROMAN generates correct sequential activations for accomplishing long sequences of sophisticated manipulation tasks.
arXiv Detail & Related papers (2023-06-30T20:35:22Z) - On Robust Numerical Solver for ODE via Self-Attention Mechanism [82.95493796476767]
We explore training efficient and robust AI-enhanced numerical solvers with a small data size by mitigating intrinsic noise disturbances.
We first analyze the ability of the self-attention mechanism to regulate noise in supervised learning and then propose a simple-yet-effective numerical solver, Attr, which introduces an additive self-attention mechanism to the numerical solution of differential equations.
arXiv Detail & Related papers (2023-02-05T01:39:21Z) - Variational Quantum Soft Actor-Critic for Robotic Arm Control [0.0]
This work aims at exploring and assessing the advantages of the application of Quantum Computing to one of the state-of-art Reinforcement Learning techniques for continuous control.
Specifically, the performance of a Variational Quantum Soft Actor-Critic on the movement of a virtual robotic arm has been investigated by means of digital simulations of quantum circuits.
arXiv Detail & Related papers (2022-12-20T19:02:24Z) - Accelerated Policy Learning with Parallel Differentiable Simulation [59.665651562534755]
We present a differentiable simulator and a new policy learning algorithm (SHAC)
Our algorithm alleviates problems with local minima through a smooth critic function.
We show substantial improvements in sample efficiency and wall-clock time over state-of-the-art RL and differentiable simulation-based algorithms.
arXiv Detail & Related papers (2022-04-14T17:46:26Z) - From Machine Learning to Robotics: Challenges and Opportunities for
Embodied Intelligence [113.06484656032978]
Article argues that embodied intelligence is a key driver for the advancement of machine learning technology.
We highlight challenges and opportunities specific to embodied intelligence.
We propose research directions which may significantly advance the state-of-the-art in robot learning.
arXiv Detail & Related papers (2021-10-28T16:04:01Z) - Trajectory Tracking of Underactuated Sea Vessels With Uncertain
Dynamics: An Integral Reinforcement Learning Approach [2.064612766965483]
An online machine learning mechanism based on integral reinforcement learning is proposed to find a solution for a class of nonlinear tracking problems.
The solution is implemented using an online value iteration process which is realized by employing means of the adaptive critics and gradient descent approaches.
arXiv Detail & Related papers (2021-04-01T01:41:49Z) - A Machine Learning Framework for Computing the Most Probable Paths of
Stochastic Dynamical Systems [5.028470487310566]
We develop a machine learning framework to compute the most probable paths in the sense of Onsager-Machlup action functional theory.
Specifically, we reformulate the boundary value problem of Hamiltonian system and design a prototypical neural network to remedy the shortcomings of shooting method.
arXiv Detail & Related papers (2020-10-01T20:01:37Z) - Robust trajectory generation for robotic control on the neuromorphic
research chip Loihi [0.0]
We exploit a recently developed spiking neural network model, the so-called anisotropic network.
We show that the anisotropic network on Loihi reliably encodes sequential patterns of neural activity.
Taken together, our study presents a new algorithm that allows the generation of complex robotic movements.
arXiv Detail & Related papers (2020-08-26T16:02:39Z) - Limited-angle tomographic reconstruction of dense layered objects by
dynamical machine learning [68.9515120904028]
Limited-angle tomography of strongly scattering quasi-transparent objects is a challenging, highly ill-posed problem.
Regularizing priors are necessary to reduce artifacts by improving the condition of such problems.
We devised a recurrent neural network (RNN) architecture with a novel split-convolutional gated recurrent unit (SC-GRU) as the building block.
arXiv Detail & Related papers (2020-07-21T11:48:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.