Multi-level, Forming Free, Bulk Switching Trilayer RRAM for Neuromorphic
Computing at the Edge
- URL: http://arxiv.org/abs/2310.13844v1
- Date: Fri, 20 Oct 2023 22:37:46 GMT
- Title: Multi-level, Forming Free, Bulk Switching Trilayer RRAM for Neuromorphic
Computing at the Edge
- Authors: Jaeseoung Park (1), Ashwani Kumar (1), Yucheng Zhou (1), Sangheon Oh
(1), Jeong-Hoon Kim (1), Yuhan Shi (1), Soumil Jain (2), Gopabandhu Hota (1),
Amelie L. Nagle (3), Catherine D. Schuman (4), Gert Cauwenberghs (2) and
Duygu Kuzum (1) ((1) Department of Electrical and Computer Engineering, (2)
Department of Bioengineering, University of California, San Diego, CA, USA.
(3) Department of Computer Science, Massachusetts Institute of Technology,
MA, USA. (4) Department of Electrical Engineering and Computer Science,
University of Tennessee, TN, USA.)
- Abstract summary: We develop a forming-free and bulk switching RRAM technology based on a trilayer metal-oxide stack.
We develop a neuromorphic compute-in-memory platform based on trilayer bulk RRAM crossbars.
Our work paves the way for neuromorphic computing at the edge under strict size, weight, and power constraints.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Resistive memory-based reconfigurable systems constructed by CMOS-RRAM
integration hold great promise for low energy and high throughput neuromorphic
computing. However, most RRAM technologies relying on filamentary switching
suffer from variations and noise leading to computational accuracy loss,
increased energy consumption, and overhead by expensive program and verify
schemes. Low ON-state resistance of filamentary RRAM devices further increases
the energy consumption due to high-current read and write operations, and
limits the array size and parallel multiply & accumulate operations.
High-forming voltages needed for filamentary RRAM are not compatible with
advanced CMOS technology nodes. To address all these challenges, we developed a
forming-free and bulk switching RRAM technology based on a trilayer metal-oxide
stack. We systematically engineered a trilayer metal-oxide RRAM stack and
investigated the switching characteristics of RRAM devices with varying
thicknesses and oxygen vacancy distributions across the trilayer to achieve
reliable bulk switching without any filament formation. We demonstrated bulk
switching operation at megaohm regime with high current nonlinearity and
programmed up to 100 levels without compliance current. We developed a
neuromorphic compute-in-memory platform based on trilayer bulk RRAM crossbars
by combining energy-efficient switched-capacitor voltage sensing circuits with
differential encoding of weights to experimentally demonstrate high-accuracy
matrix-vector multiplication. We showcased the computational capability of bulk
RRAM crossbars by implementing a spiking neural network model for an autonomous
navigation/racing task. Our work addresses challenges posed by existing RRAM
technologies and paves the way for neuromorphic computing at the edge under
strict size, weight, and power constraints.
Related papers
- Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution [49.902047563260496]
We develop the first attempt to integrate the Vision State Space Model (Mamba) for remote sensing image (RSI) super-resolution.
To achieve better SR reconstruction, building upon Mamba, we devise a Frequency-assisted Mamba framework, dubbed FMSR.
Our FMSR features a multi-level fusion architecture equipped with the Frequency Selection Module (FSM), Vision State Space Module (VSSM), and Hybrid Gate Module (HGM)
arXiv Detail & Related papers (2024-05-08T11:09:24Z) - Efficient and accurate neural field reconstruction using resistive memory [52.68088466453264]
Traditional signal reconstruction methods on digital computers face both software and hardware challenges.
We propose a systematic approach with software-hardware co-optimizations for signal reconstruction from sparse inputs.
This work advances the AI-driven signal restoration technology and paves the way for future efficient and robust medical AI and 3D vision applications.
arXiv Detail & Related papers (2024-04-15T09:33:09Z) - Random resistive memory-based deep extreme point learning machine for
unified visual processing [67.51600474104171]
We propose a novel hardware-software co-design, random resistive memory-based deep extreme point learning machine (DEPLM)
Our co-design system achieves huge energy efficiency improvements and training cost reduction when compared to conventional systems.
arXiv Detail & Related papers (2023-12-14T09:46:16Z) - Evaluation of STT-MRAM as a Scratchpad for Training in ML Accelerators [9.877596714655096]
Training deep neural networks (DNNs) is an extremely memory-intensive process.
Spin-Transfer-Torque MRAM (STT-MRAM) offers several desirable properties for training accelerators.
We show that MRAM provide up to 15-22x improvement in system level energy.
arXiv Detail & Related papers (2023-08-03T20:36:48Z) - Scaling Limits of Memristor-Based Routers for Asynchronous Neuromorphic
Systems [2.5264231114078353]
Multi-core neuromorphic systems typically use on-chip routers to transmit spikes among cores.
A promising alternative is to exploit the features of memristive crossbar arrays and use them as programmable switch-matrices that route spikes.
We study the challenges of memristive crossbar arrays, when used as routing channels to transmit spikes in asynchronous Spiking Neural Network (SNN) hardware.
arXiv Detail & Related papers (2023-07-16T17:50:24Z) - TL-nvSRAM-CIM: Ultra-High-Density Three-Level ReRAM-Assisted
Computing-in-nvSRAM with DC-Power Free Restore and Ternary MAC Operations [8.669532093397065]
This work proposes an ultra-high-density three-level ReRAMs-assisted computing scheme for large NN models.
The proposed TL-nvSRAM-CIM achieves 7.8x higher storage density, compared with the state-art works.
arXiv Detail & Related papers (2023-07-06T01:46:06Z) - The Combination of Metal Oxides as Oxide Layers for RRAM and Artificial
Intelligence [0.0]
Review paper provides an overview of the combination of metal oxides-based RRAM and AI.
We discuss the use of AI to improve the performance of RRAM devices and the use of RRAM to power AI.
We address key challenges in the field and provide insights into future research directions.
arXiv Detail & Related papers (2023-04-29T04:15:50Z) - GLEAM: Greedy Learning for Large-Scale Accelerated MRI Reconstruction [50.248694764703714]
Unrolled neural networks have recently achieved state-of-the-art accelerated MRI reconstruction.
These networks unroll iterative optimization algorithms by alternating between physics-based consistency and neural-network based regularization.
We propose Greedy LEarning for Accelerated MRI reconstruction, an efficient training strategy for high-dimensional imaging settings.
arXiv Detail & Related papers (2022-07-18T06:01:29Z) - Improving Inference Lifetime of Neuromorphic Systems via Intelligent
Synapse Mapping [0.2578242050187029]
An RRAM cell can switch its state after reading its content a certain number of times.
We propose an architectural solution to extend the read endurance of RRAM-based neuromorphic systems.
arXiv Detail & Related papers (2021-06-16T20:12:47Z) - Efficient Micro-Structured Weight Unification and Pruning for Neural
Network Compression [56.83861738731913]
Deep Neural Network (DNN) models are essential for practical applications, especially for resource limited devices.
Previous unstructured or structured weight pruning methods can hardly truly accelerate inference.
We propose a generalized weight unification framework at a hardware compatible micro-structured level to achieve high amount of compression and acceleration.
arXiv Detail & Related papers (2021-06-15T17:22:59Z) - SmartDeal: Re-Modeling Deep Network Weights for Efficient Inference and
Training [82.35376405568975]
Deep neural networks (DNNs) come with heavy parameterization, leading to external dynamic random-access memory (DRAM) for storage.
We present SmartDeal (SD), an algorithm framework to trade higher-cost memory storage/access for lower-cost computation.
We show that SD leads to 10.56x and 4.48x reduction in the storage and training energy, with negligible accuracy loss compared to state-of-the-art training baselines.
arXiv Detail & Related papers (2021-01-04T18:54:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.