Intelligent and Reconfigurable Architecture for KL Divergence Based
Online Machine Learning Algorithm
- URL: http://arxiv.org/abs/2002.07713v1
- Date: Tue, 18 Feb 2020 16:39:57 GMT
- Title: Intelligent and Reconfigurable Architecture for KL Divergence Based
Online Machine Learning Algorithm
- Authors: S. V. Sai Santosh and Sumit J. Darak
- Abstract summary: Online machine learning (OML) algorithms do not need any training phase and can be deployed directly in an unknown environment.
Online machine learning (OML) algorithms do not need any training phase and can be deployed directly in an unknown environment.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Online machine learning (OML) algorithms do not need any training phase and
can be deployed directly in an unknown environment. OML includes multi-armed
bandit (MAB) algorithms that can identify the best arm among several arms by
achieving a balance between exploration of all arms and exploitation of optimal
arm. The Kullback-Leibler divergence based upper confidence bound (KLUCB) is
the state-of-the-art MAB algorithm that optimizes exploration-exploitation
trade-off but it is complex due to underlining optimization routine. This
limits its usefulness for robotics and radio applications which demand
integration of KLUCB with the PHY on the system on chip (SoC). In this paper,
we efficiently map the KLUCB algorithm on SoC by realizing optimization routine
via alternative synthesizable computation without compromising on the
performance. The proposed architecture is dynamically reconfigurable such that
the number of arms, as well as type of algorithm, can be changed on-the-fly.
Specifically, after initial learning, on-the-fly switch to light-weight UCB
offers around 10-factor improvement in latency and throughput. Since learning
duration depends on the unknown arm statistics, we offer intelligence embedded
in architecture to decide the switching instant. We validate the functional
correctness and usefulness of the proposed architecture via a realistic
wireless application and detailed complexity analysis demonstrates its
feasibility in realizing intelligent radios.
Related papers
- Curriculum reinforcement learning for quantum architecture search under
hardware errors [1.583327010995414]
This work introduces a curriculum-based reinforcement learning QAS (CRLQAS) designed to tackle challenges in VQA deployment.
The algorithm incorporates (i) a 3D architecture encoding and restrictions on environment dynamics to explore the search space of possible circuits efficiently.
To facilitate studies, we developed an optimized simulator for our algorithm, significantly improving computational efficiency in noisy quantum circuits.
arXiv Detail & Related papers (2024-02-05T20:33:00Z) - Stabilizing Q-learning with Linear Architectures for Provably Efficient
Learning [53.17258888552998]
This work proposes an exploration variant of the basic $Q$-learning protocol with linear function approximation.
We show that the performance of the algorithm degrades very gracefully under a novel and more permissive notion of approximation error.
arXiv Detail & Related papers (2022-06-01T23:26:51Z) - Differentially Private Federated Learning via Reconfigurable Intelligent
Surface [27.004823731436765]
Federated learning (FL) enables the collaborative training of a global model over decentralized local datasets without sharing them.
We propose a reconfigurable intelligent surface (RIS) empowered over-the-air FL system to alleviate the dilemma between learning accuracy and privacy.
arXiv Detail & Related papers (2022-03-31T13:45:02Z) - Multi-armed Bandit Algorithms on System-on-Chip: Go Frequentist or
Bayesian? [0.0]
Multi-armed Bandit (MAB) algorithms identify the best arm among multiple arms.
We propose a reconfigurable and intelligent MAB (RI-MAB) framework.
arXiv Detail & Related papers (2021-06-05T10:07:31Z) - Reconfigurable Intelligent Surface Assisted Mobile Edge Computing with
Heterogeneous Learning Tasks [53.1636151439562]
Mobile edge computing (MEC) provides a natural platform for AI applications.
We present an infrastructure to perform machine learning tasks at an MEC with the assistance of a reconfigurable intelligent surface (RIS)
Specifically, we minimize the learning error of all participating users by jointly optimizing transmit power of mobile users, beamforming vectors of the base station, and the phase-shift matrix of the RIS.
arXiv Detail & Related papers (2020-12-25T07:08:50Z) - Path Design and Resource Management for NOMA enhanced Indoor Intelligent
Robots [58.980293789967575]
A communication enabled indoor intelligent robots (IRs) service framework is proposed.
Lego modeling method is proposed, which can deterministically describe the indoor layout and channel state.
The investigated radio map is invoked as a virtual environment to train the reinforcement learning agent.
arXiv Detail & Related papers (2020-11-23T21:45:01Z) - Federated Learning via Intelligent Reflecting Surface [30.935389187215474]
Over-the-air computation algorithm (AirComp) based learning (FL) is capable of achieving fast model aggregation by exploiting the waveform superposition property of multiple access channels.
In this paper, we propose a two-step optimization framework to achieve fast yet reliable model aggregation for AirComp-based FL.
Simulation results will demonstrate that our proposed framework and the deployment of an IRS can achieve a lower training loss and higher FL prediction accuracy than the baseline algorithms.
arXiv Detail & Related papers (2020-11-10T11:29:57Z) - Deep Multi-Task Learning for Cooperative NOMA: System Design and
Principles [52.79089414630366]
We develop a novel deep cooperative NOMA scheme, drawing upon the recent advances in deep learning (DL)
We develop a novel hybrid-cascaded deep neural network (DNN) architecture such that the entire system can be optimized in a holistic manner.
arXiv Detail & Related papers (2020-07-27T12:38:37Z) - Predictive Coding Approximates Backprop along Arbitrary Computation
Graphs [68.8204255655161]
We develop a strategy to translate core machine learning architectures into their predictive coding equivalents.
Our models perform equivalently to backprop on challenging machine learning benchmarks.
Our method raises the potential that standard machine learning algorithms could in principle be directly implemented in neural circuitry.
arXiv Detail & Related papers (2020-06-07T15:35:47Z) - Optimization-driven Deep Reinforcement Learning for Robust Beamforming
in IRS-assisted Wireless Communications [54.610318402371185]
Intelligent reflecting surface (IRS) is a promising technology to assist downlink information transmissions from a multi-antenna access point (AP) to a receiver.
We minimize the AP's transmit power by a joint optimization of the AP's active beamforming and the IRS's passive beamforming.
We propose a deep reinforcement learning (DRL) approach that can adapt the beamforming strategies from past experiences.
arXiv Detail & Related papers (2020-05-25T01:42:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.