Fugu-MT 論文翻訳(概要): Neural Dynamics Self-Attention for Spiking Transformers

論文の概要: Neural Dynamics Self-Attention for Spiking Transformers

arxiv url: http://arxiv.org/abs/2603.19290v1
Date: Mon, 09 Mar 2026 12:23:37 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-06 02:36:12.849355
Title: Neural Dynamics Self-Attention for Spiking Transformers
Title（参考訳）: スパイキング変圧器のためのニューラルダイナミクス自己注意
Authors: Dehao Zhang, Fukai Guo, Shuai Wang, Jingya Wang, Jieyuan Zhang, Yimeng Shan, Malu Zhang, Yang Yang, Haizhou Li,
Abstract要約: TransformerアーキテクチャによるSNN(Spike Neural Networks)は、エネルギー効率とパフォーマンスのバランスをとるための、有望な経路を提供する。既存のスパイキングトランスフォーマーは、(i)ニューラルネットワーク(ANN)と(ii)推論時の高メモリオーバーヘッドと比較して、大幅なパフォーマンスギャップがあるという2つの重要な課題に直面している。本稿では、局所受容野を持つスパイキングニューロンを用いて、メモリ要求を減らしながら注意を計算するLRF-Dynを提案する。
参考スコア（独自算出の注目度）: 59.57278823240336
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Integrating Spiking Neural Networks (SNNs) with Transformer architectures offers a promising pathway to balance energy efficiency and performance, particularly for edge vision applications. However, existing Spiking Transformers face two critical challenges: (i) a substantial performance gap compared to their Artificial Neural Networks (ANNs) counterparts and (ii) high memory overhead during inference. Through theoretical analysis, we attribute both limitations to the Spiking Self-Attention (SSA) mechanism: the lack of locality bias and the need to store large attention matrices. Inspired by the localized receptive fields (LRF) and membrane-potential dynamics of biological visual neurons, we propose LRF-Dyn, which uses spiking neurons with localized receptive fields to compute attention while reducing memory requirements. Specifically, we introduce a LRF method into SSA to assign higher weights to neighboring regions, strengthening local modeling and improving performance. Building on this, we approximate the resulting attention computation via charge-fire-reset dynamics, eliminating explicit attention-matrix storage and reducing inference-time memory. Extensive experiments on visual tasks confirm that our method reduces memory overhead while delivering significant performance improvements. These results establish it as a key unit for achieving energy-efficient Spiking Transformers.
Abstract（参考訳）: スパイキングニューラルネットワーク(SNN)とTransformerアーキテクチャを統合することで、特にエッジビジョンアプリケーションにおいて、エネルギー効率とパフォーマンスのバランスをとる上で有望な経路を提供する。しかし、既存のスパイキングトランスフォーマーは2つの重要な課題に直面している。 (i)Artificial Neural Networks(ANN)と同等の性能差 (ii)推論時のメモリオーバーヘッドが高いこと。理論的分析を通じて、局所性バイアスの欠如と大きな注意行列の保存の必要性という、スパイキング自己注意機構(SSA)の2つの限界を考察した。生体視覚ニューロンの局所受容野(LRF)と膜電位ダイナミクスに着想を得て,局所受容野を持つスパイキングニューロンを用いて,記憶要求を低減しつつ注意を計算できるLRF-Dynを提案する。具体的には, SSA に LRF 法を導入し, 近隣地域への重み付けを行い, 局所モデリングを強化し, 性能向上を図る。これに基づいて、チャージ・ファイア・リセット・ダイナミクスによるアテンション計算を近似し、明示的なアテンション・マトリクス記憶を排除し、推論時メモリを削減する。視覚的タスクに関する大規模な実験により,本手法はメモリオーバーヘッドを低減し,大幅な性能向上を実現する。これらの結果はエネルギー効率の高いスパイキング変換器を実現するための鍵単位として確立されている。

論文の概要: Neural Dynamics Self-Attention for Spiking Transformers

関連論文リスト