Spintronics for image recognition: performance benchmarking via
ultrafast data-driven simulations
- URL: http://arxiv.org/abs/2308.05810v3
- Date: Wed, 7 Feb 2024 10:37:12 GMT
- Title: Spintronics for image recognition: performance benchmarking via
ultrafast data-driven simulations
- Authors: Anatole Moureaux and Chlo\'e Chopin and Simon de Wergifosse and
Laurent Jacques and Flavio Abreu Araujo
- Abstract summary: We present a demonstration of image classification using an echo-state network (ESN) relying on a single simulated spintronic nanostructure.
We employ an ultrafast data-driven simulation framework called the data-driven Thiele equation approach to simulate the STVO dynamics.
We showcase the versatility of our solution by successfully applying it to solve classification challenges with the MNIST, EMNIST-letters and Fashion MNIST datasets.
- Score: 4.2412715094420665
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present a demonstration of image classification using an echo-state
network (ESN) relying on a single simulated spintronic nanostructure known as
the vortex-based spin-torque oscillator (STVO) delayed in time. We employ an
ultrafast data-driven simulation framework called the data-driven Thiele
equation approach (DD-TEA) to simulate the STVO dynamics. This allows us to
avoid the challenges associated with repeated experimental manipulation of such
a nanostructured system. We showcase the versatility of our solution by
successfully applying it to solve classification challenges with the MNIST,
EMNIST-letters and Fashion MNIST datasets. Through our simulations, we
determine that within an ESN with numerous learnable parameters the results
obtained using the STVO dynamics as an activation function are comparable to
the ones obtained with other conventional nonlinear activation functions like
the reLU and the sigmoid. While achieving state-of-the-art accuracy levels on
the MNIST dataset, our model's performance on EMNIST-letters and Fashion MNIST
is lower due to the relative simplicity of the system architecture and the
increased complexity of the tasks. We expect that the DD-TEA framework will
enable the exploration of deeper architectures, ultimately leading to improved
classification accuracy.
Related papers
- Task-Oriented Real-time Visual Inference for IoVT Systems: A Co-design Framework of Neural Networks and Edge Deployment [61.20689382879937]
Task-oriented edge computing addresses this by shifting data analysis to the edge.
Existing methods struggle to balance high model performance with low resource consumption.
We propose a novel co-design framework to optimize neural network architecture.
arXiv Detail & Related papers (2024-10-29T19:02:54Z) - From Fourier to Neural ODEs: Flow Matching for Modeling Complex Systems [20.006163951844357]
We propose a simulation-free framework for training neural ordinary differential equations (NODEs)
We employ the Fourier analysis to estimate temporal and potential high-order spatial gradients from noisy observational data.
Our approach outperforms state-of-the-art methods in terms of training time, dynamics prediction, and robustness.
arXiv Detail & Related papers (2024-05-19T13:15:23Z) - A Generative Self-Supervised Framework using Functional Connectivity in
fMRI Data [15.211387244155725]
Deep neural networks trained on Functional Connectivity (FC) networks extracted from functional Magnetic Resonance Imaging (fMRI) data have gained popularity.
Recent research on the application of Graph Neural Network (GNN) to FC suggests that exploiting the time-varying properties of the FC could significantly improve the accuracy and interpretability of the model prediction.
High cost of acquiring high-quality fMRI data and corresponding labels poses a hurdle to their application in real-world settings.
We propose a generative SSL approach that is tailored to effectively harnesstemporal information within dynamic FC.
arXiv Detail & Related papers (2023-12-04T16:14:43Z) - Neuromorphic spintronics simulated using an unconventional data-driven
Thiele equation approach [0.0]
We develop a quantitative description of the dynamics of spin-torque vortex nano-oscillators (STVOs) through an unconventional model.
Our approach is promising for accelerating the design of STVO-based neuromorphic computing devices.
arXiv Detail & Related papers (2023-07-18T13:47:57Z) - Disentangling Structured Components: Towards Adaptive, Interpretable and
Scalable Time Series Forecasting [52.47493322446537]
We develop a adaptive, interpretable and scalable forecasting framework, which seeks to individually model each component of the spatial-temporal patterns.
SCNN works with a pre-defined generative process of MTS, which arithmetically characterizes the latent structure of the spatial-temporal patterns.
Extensive experiments are conducted to demonstrate that SCNN can achieve superior performance over state-of-the-art models on three real-world datasets.
arXiv Detail & Related papers (2023-05-22T13:39:44Z) - Batch-Ensemble Stochastic Neural Networks for Out-of-Distribution
Detection [55.028065567756066]
Out-of-distribution (OOD) detection has recently received much attention from the machine learning community due to its importance in deploying machine learning models in real-world applications.
In this paper we propose an uncertainty quantification approach by modelling the distribution of features.
We incorporate an efficient ensemble mechanism, namely batch-ensemble, to construct the batch-ensemble neural networks (BE-SNNs) and overcome the feature collapse problem.
We show that BE-SNNs yield superior performance on several OOD benchmarks, such as the Two-Moons dataset, the FashionMNIST vs MNIST dataset, FashionM
arXiv Detail & Related papers (2022-06-26T16:00:22Z) - Rank-R FNN: A Tensor-Based Learning Model for High-Order Data
Classification [69.26747803963907]
Rank-R Feedforward Neural Network (FNN) is a tensor-based nonlinear learning model that imposes Canonical/Polyadic decomposition on its parameters.
First, it handles inputs as multilinear arrays, bypassing the need for vectorization, and can thus fully exploit the structural information along every data dimension.
We establish the universal approximation and learnability properties of Rank-R FNN, and we validate its performance on real-world hyperspectral datasets.
arXiv Detail & Related papers (2021-04-11T16:37:32Z) - PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive
Learning [109.84770951839289]
We present PredRNN, a new recurrent network for learning visual dynamics from historical context.
We show that our approach obtains highly competitive results on three standard datasets.
arXiv Detail & Related papers (2021-03-17T08:28:30Z) - Coupled Oscillatory Recurrent Neural Network (coRNN): An accurate and
(gradient) stable architecture for learning long time dependencies [15.2292571922932]
We propose a novel architecture for recurrent neural networks.
Our proposed RNN is based on a time-discretization of a system of second-order ordinary differential equations.
Experiments show that the proposed RNN is comparable in performance to the state of the art on a variety of benchmarks.
arXiv Detail & Related papers (2020-10-02T12:35:04Z) - Convolutional Tensor-Train LSTM for Spatio-temporal Learning [116.24172387469994]
We propose a higher-order LSTM model that can efficiently learn long-term correlations in the video sequence.
This is accomplished through a novel tensor train module that performs prediction by combining convolutional features across time.
Our results achieve state-of-the-art performance-art in a wide range of applications and datasets.
arXiv Detail & Related papers (2020-02-21T05:00:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.