An FPGA Architecture for Online Learning using the Tsetlin Machine
- URL: http://arxiv.org/abs/2306.01027v1
- Date: Thu, 1 Jun 2023 13:33:26 GMT
- Title: An FPGA Architecture for Online Learning using the Tsetlin Machine
- Authors: Samuel Prescott and Adrian Wheeldon and Rishad Shafik and Tousif
Rahman and Alex Yakovlev and Ole-Christoffer Granmo
- Abstract summary: This paper proposes a novel field-programmable gate-array infrastructure for online learning.
It implements a low-complexity machine learning algorithm called the Tsetlin Machine.
We present use cases for online learning using the proposed infrastructure and demonstrate the energy/performance/accuracy trade-offs.
- Score: 5.140342614848069
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: There is a need for machine learning models to evolve in unsupervised
circumstances. New classifications may be introduced, unexpected faults may
occur, or the initial dataset may be small compared to the data-points
presented to the system during normal operation. Implementing such a system
using neural networks involves significant mathematical complexity, which is a
major issue in power-critical edge applications.
This paper proposes a novel field-programmable gate-array infrastructure for
online learning, implementing a low-complexity machine learning algorithm
called the Tsetlin Machine. This infrastructure features a custom-designed
architecture for run-time learning management, providing on-chip offline and
online learning. Using this architecture, training can be carried out on-demand
on the \ac{FPGA} with pre-classified data before inference takes place.
Additionally, our architecture provisions online learning, where training can
be interleaved with inference during operation. Tsetlin Machine (TM) training
naturally descends to an optimum, with training also linked to a threshold
hyper-parameter which is used to reduce the probability of issuing feedback as
the TM becomes trained further. The proposed architecture is modular, allowing
the data input source to be easily changed, whilst inbuilt cross-validation
infrastructure allows for reliable and representative results during system
testing. We present use cases for online learning using the proposed
infrastructure and demonstrate the energy/performance/accuracy trade-offs.
Related papers
- Toward Efficient Convolutional Neural Networks With Structured Ternary Patterns [1.1965844936801797]
Convolutional neural networks (ConvNets) exert severe demands on local device resources.
This brief presents work toward utilizing static convolutional filters to design efficient ConvNet architectures.
arXiv Detail & Related papers (2024-07-20T10:18:42Z) - Deep Feature Learning for Wireless Spectrum Data [0.5809784853115825]
We propose an approach to learning feature representations for wireless transmission clustering in a completely unsupervised manner.
We show that the automatic representation learning is able to extract fine-grained clusters containing the shapes of the wireless transmission bursts.
arXiv Detail & Related papers (2023-08-07T12:27:19Z) - PDSketch: Integrated Planning Domain Programming and Learning [86.07442931141637]
We present a new domain definition language, named PDSketch.
It allows users to flexibly define high-level structures in the transition models.
Details of the transition model will be filled in by trainable neural networks.
arXiv Detail & Related papers (2023-03-09T18:54:12Z) - Pretraining Graph Neural Networks for few-shot Analog Circuit Modeling
and Design [68.1682448368636]
We present a supervised pretraining approach to learn circuit representations that can be adapted to new unseen topologies or unseen prediction tasks.
To cope with the variable topological structure of different circuits we describe each circuit as a graph and use graph neural networks (GNNs) to learn node embeddings.
We show that pretraining GNNs on prediction of output node voltages can encourage learning representations that can be adapted to new unseen topologies or prediction of new circuit level properties.
arXiv Detail & Related papers (2022-03-29T21:18:47Z) - SOLIS -- The MLOps journey from data acquisition to actionable insights [62.997667081978825]
In this paper we present a unified deployment pipeline and freedom-to-operate approach that supports all requirements while using basic cross-platform tensor framework and script language engines.
This approach however does not supply the needed procedures and pipelines for the actual deployment of machine learning capabilities in real production grade systems.
arXiv Detail & Related papers (2021-12-22T14:45:37Z) - Inductive biases and Self Supervised Learning in modelling a physical
heating system [0.0]
In this paper I infer inductive biases about a physical system.
I use these biases to derive a new neural network architecture that can model this real system.
The proposed architecture family called Delay can be used in a real scenario to control systems with delayed responses.
arXiv Detail & Related papers (2021-04-23T08:50:41Z) - Edge-assisted Democratized Learning Towards Federated Analytics [67.44078999945722]
We show the hierarchical learning structure of the proposed edge-assisted democratized learning mechanism, namely Edge-DemLearn.
We also validate Edge-DemLearn as a flexible model training mechanism to build a distributed control and aggregation methodology in regions.
arXiv Detail & Related papers (2020-12-01T11:46:03Z) - One-step regression and classification with crosspoint resistive memory
arrays [62.997667081978825]
High speed, low energy computing machines are in demand to enable real-time artificial intelligence at the edge.
One-step learning is supported by simulations of the prediction of the cost of a house in Boston and the training of a 2-layer neural network for MNIST digit recognition.
Results are all obtained in one computational step, thanks to the physical, parallel, and analog computing within the crosspoint array.
arXiv Detail & Related papers (2020-05-05T08:00:07Z) - Deep Learning for Ultra-Reliable and Low-Latency Communications in 6G
Networks [84.2155885234293]
We first summarize how to apply data-driven supervised deep learning and deep reinforcement learning in URLLC.
To address these open problems, we develop a multi-level architecture that enables device intelligence, edge intelligence, and cloud intelligence for URLLC.
arXiv Detail & Related papers (2020-02-22T14:38:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.