Fugu-MT 論文翻訳(概要): Pruning random resistive memory for optimizing analogue AI

論文の概要: Pruning random resistive memory for optimizing analogue AI

arxiv url: http://arxiv.org/abs/2311.07164v1
Date: Mon, 13 Nov 2023 08:59:01 GMT
ステータス: 翻訳完了
システム内更新日: 2023-11-14 14:59:32.858276
Title: Pruning random resistive memory for optimizing analogue AI
Title（参考訳）: アナログai最適化のためのプルーニングランダム抵抗メモリ
Authors: Yi Li, Songqi Wang, Yaping Zhao, Shaocong Wang, Woyu Zhang, Yangu He, Ning Lin, Binbin Cui, Xi Chen, Shiming Zhang, Hao Jiang, Peng Lin, Xumeng Zhang, Xiaojuan Qi, Zhongrui Wang, Xiaoxin Xu, Dashan Shang, Qi Liu, Kwang-Ting Cheng, Ming Liu
Abstract要約: AIモデルは、エネルギー消費と環境持続可能性に前例のない課題を提示する。有望な解決策の1つは、アナログコンピューティングを再考することである。ここでは、構造的塑性に着想を得たエッジプルーニングを用いたユニバーサルソリューション、ソフトウェア・ハードウエアの共設計について報告する。
参考スコア（独自算出の注目度）: 54.21621702814583
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: The rapid advancement of artificial intelligence (AI) has been marked by the large language models exhibiting human-like intelligence. However, these models also present unprecedented challenges to energy consumption and environmental sustainability. One promising solution is to revisit analogue computing, a technique that predates digital computing and exploits emerging analogue electronic devices, such as resistive memory, which features in-memory computing, high scalability, and nonvolatility. However, analogue computing still faces the same challenges as before: programming nonidealities and expensive programming due to the underlying devices physics. Here, we report a universal solution, software-hardware co-design using structural plasticity-inspired edge pruning to optimize the topology of a randomly weighted analogue resistive memory neural network. Software-wise, the topology of a randomly weighted neural network is optimized by pruning connections rather than precisely tuning resistive memory weights. Hardware-wise, we reveal the physical origin of the programming stochasticity using transmission electron microscopy, which is leveraged for large-scale and low-cost implementation of an overparameterized random neural network containing high-performance sub-networks. We implemented the co-design on a 40nm 256K resistive memory macro, observing 17.3% and 19.9% accuracy improvements in image and audio classification on FashionMNIST and Spoken digits datasets, as well as 9.8% (2%) improvement in PR (ROC) in image segmentation on DRIVE datasets, respectively. This is accompanied by 82.1%, 51.2%, and 99.8% improvement in energy efficiency thanks to analogue in-memory computing. By embracing the intrinsic stochasticity and in-memory computing, this work may solve the biggest obstacle of analogue computing systems and thus unleash their immense potential for next-generation AI hardware.
Abstract（参考訳）: 人工知能(AI)の急速な進歩は、人間のような知性を示す大きな言語モデルによって特徴づけられている。しかし、これらのモデルはまた、エネルギー消費と環境持続可能性に前例のない課題を呈している。 1つの有望な解決策はアナログコンピューティングを再検討することであり、これはデジタルコンピューティングよりも早く、抵抗性メモリのような、インメモリコンピューティング、高いスケーラビリティ、非揮発性を備えた新しいアナログ電子デバイスを利用する技術である。しかし、アナログコンピューティングは以前と同じ課題に直面している。プログラミングの非理想性と高価なプログラミングは、基礎となるデバイス物理学のためである。本稿では,ランダム重み付けされたアナログ抵抗型メモリニューラルネットワークのトポロジーを最適化するために,構造可塑性に触発されたエッジプルーニングを用いたソフトウェアハードウェア共同設計について報告する。ソフトウェア面では、ランダム重み付けされたニューラルネットワークのトポロジーは、抵抗メモリ重みを正確にチューニングするのではなく、接続をプルーニングすることで最適化される。ハードウェア面では、高性能サブネットワークを含む過パラメータランダムニューラルネットワークの大規模かつ低コスト実装に利用される透過型電子顕微鏡を用いて、プログラミング確率性の物理的起源を明らかにする。我々は,40nm 256K 抵抗型メモリマクロに共設計を実装し,FashionMNIST と Spoken を用いた画像分類では17.3%,音声分類では19.9%,DRIVE を用いた画像分割では9.8% (2%) の精度向上を実現した。これには82.1%、51.2%、99.8%のエネルギー効率向上が伴う。内在的な確率性とインメモリコンピューティングを取り入れることで、この研究はアナログコンピューティングシステムの最大の障害を解決し、次世代aiハードウェアの膨大な可能性を解き放ちます。

論文の概要: Pruning random resistive memory for optimizing analogue AI

関連論文リスト