Fugu-MT 論文翻訳(概要): Plug-and-Play Spiking Operators: Breaking the Nonlinearity Bottleneck in Spiking Transformers

論文の概要: Plug-and-Play Spiking Operators: Breaking the Nonlinearity Bottleneck in Spiking Transformers

arxiv url: http://arxiv.org/abs/2605.20289v1
Date: Tue, 19 May 2026 06:59:46 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-21 19:19:56.273384
Title: Plug-and-Play Spiking Operators: Breaking the Nonlinearity Bottleneck in Spiking Transformers
Title（参考訳）: プラグアンドプレイスパイク演算子:スパイク変圧器の非線形ボタネックを破る
Authors: Xinzhe Yuan, Xiang Peng, Bin Gu, Huan Xiong,
Abstract要約: 本稿では,Transformer の非線形性に対するスパイクフレンドリーな近似を実装し,既存の ANN-to-SNN パイプラインに統合するプラグイン・アンド・プレイフレームワークを提案する。本手法は,これらの非線形計算を,分割,指数化,および$ell$ノルムの3つの繰り返しプリミティブに分解し,集団群を介して実現する。実験により、対象とする非線形演算子を選択的に置き換えることにより、評価された全てのタスクに対して11%の精度低下が生じることが示された。
参考スコア（独自算出の注目度）: 32.743518029177146
License: http://creativecommons.org/licenses/by/4.0/
Abstract: ANN-to-SNN conversion offers a practical, training-free route to spiking large language models. However, current pipelines primarily focus on spike-driven realizations for Transformer linear-algebra operations, while providing limited support for key nonlinear operators. This gap limits compatibility with neuromorphic-style execution constraints, where such nonlinearities typically require division, exponentiation, or norm computations that are not naturally supported by standard leaky integrate-and-fire dynamics. To solve this problem, we propose a plug-and-play framework that implements spike-friendly approximations for Transformer nonlinearities and integrates into existing ANN-to-SNN pipelines. Our method decomposes these nonlinear computations into three recurring primitives -- division, exponentiation, and $\ell_2$ norms -- and realizes them via population computation using LIF neuron groups, combined with lightweight bit-shift scaling to avoid floating-point arithmetic. By composing these primitives as modular operator blocks, our framework supports common Transformer nonlinearities (e.g., Softmax, SiLU, and normalization) without any fine-tuning. Experiments on a range of LLMs Transformers show that selectively replacing the targeted nonlinear operators incurs less than a $1\%$ accuracy drop across all evaluated tasks.
Abstract（参考訳）: ANN-to-SNN変換は、大規模な言語モデルをスパイクするための実践的でトレーニングのない経路を提供する。しかし、現在のパイプラインは主にTransformer線形代数演算のためのスパイク駆動実現に焦点を当て、重要な非線形演算子を限定的にサポートしている。このギャップはニューロモルフィックスタイルの実行制約との互換性に制限され、そのような非線形性は通常、分割、指数化、あるいは標準の漏れた積分と発火のダイナミクスによって自然にサポートされないノルム計算を必要とする。この問題を解決するために,トランスフォーマーの非線形性に対するスパイクフレンドリーな近似を実装し,既存のANN-to-SNNパイプラインに統合するプラグイン・アンド・プレイフレームワークを提案する。提案手法は,これらの非線形計算を分割,指数化,および$\ell_2$ノルムの3つの繰り返しプリミティブに分解し,LIFニューロン群を用いた集団計算と軽量ビットシフトスケーリングを組み合わせることで浮動小数点演算を回避する。これらのプリミティブをモジュラー演算子ブロックとして構成することにより、我々のフレームワークは微調整なしで共通のトランスフォーマー非線形性(例えば、ソフトマックス、SiLU、正規化)をサポートする。 LLM変換器の実験では、対象の非線形演算子を選択的に置き換えることによって、評価された全てのタスクに対して1\%の精度低下が生じる。

論文の概要: Plug-and-Play Spiking Operators: Breaking the Nonlinearity Bottleneck in Spiking Transformers

関連論文リスト