Fugu-MT 論文翻訳(概要): SpikePool: Event-driven Spiking Transformer with Pooling Attention

論文の概要: SpikePool: Event-driven Spiking Transformer with Pooling Attention

arxiv url: http://arxiv.org/abs/2510.12102v1
Date: Tue, 14 Oct 2025 03:08:49 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-15 19:02:32.169136
Title: SpikePool: Event-driven Spiking Transformer with Pooling Attention
Title（参考訳）: SpikePool: イベント駆動型スパイクトランス
Authors: Donghyun Lee, Alex Sima, Yuhang Li, Panos Stinis, Priyadarshini Panda,
Abstract要約: スパイキングニューラルネットワーク(SNN)は、トランスフォーマーアーキテクチャとますます統合されている。現在のアプローチは主に、基礎となる信号処理特性を解析せずにアーキテクチャの変更に焦点を当てている。我々は周波数スペクトル領域を通してスパイキングトランスを解析し、ハイパスフィルタとして振る舞うことを発見した。スパイクベースの自己アテンションを最大プールアテンションに置き換えたSpikePoolを提案する。
参考スコア（独自算出の注目度）: 17.15887489143204
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Building on the success of transformers, Spiking Neural Networks (SNNs) have increasingly been integrated with transformer architectures, leading to spiking transformers that demonstrate promising performance on event-based vision tasks. However, despite these empirical successes, there remains limited understanding of how spiking transformers fundamentally process event-based data. Current approaches primarily focus on architectural modifications without analyzing the underlying signal processing characteristics. In this work, we analyze spiking transformers through the frequency spectrum domain and discover that they behave as high-pass filters, contrasting with Vision Transformers (ViTs) that act as low-pass filters. This frequency domain analysis reveals why certain designs work well for event-based data, which contains valuable high-frequency information but is also sparse and noisy. Based on this observation, we propose SpikePool, which replaces spike-based self-attention with max pooling attention, a low-pass filtering operation, to create a selective band-pass filtering effect. This design preserves meaningful high-frequency content while capturing critical features and suppressing noise, achieving a better balance for event-based data processing. Our approach demonstrates competitive results on event-based datasets for both classification and object detection tasks while significantly reducing training and inference time by up to 42.5% and 32.8%, respectively.
Abstract（参考訳）: トランスフォーマーの成功に基づいて、スパイキングニューラルネットワーク(SNN)は、ますますトランスフォーマーアーキテクチャと統合され、イベントベースのビジョンタスクで有望なパフォーマンスを示すスパイキングトランスフォーマーにつながっている。しかしながら、これらの経験的な成功にもかかわらず、トランスフォーマーのスパイクがイベントベースのデータを根本的に処理する方法についての理解は限られている。現在のアプローチは主に、基礎となる信号処理特性を解析せずにアーキテクチャの変更に焦点を当てている。本研究では、周波数スペクトル領域を通してスパイキングトランスフォーマーを解析し、低域フィルタとして機能するビジョントランス(ViT)と対比して、ハイパスフィルタとして振る舞うことを発見する。この周波数領域分析は、ある設計が貴重な高周波情報を含むイベントベースデータに対してうまく機能する理由を明らかにしている。そこで本研究では,スパイクに基づく自己注意を最大プールアテンションに置き換えたSpikePoolを提案する。この設計は、重要な特徴を捉え、ノイズを抑えながら有意義な高周波コンテンツを保存し、イベントベースのデータ処理のバランスを改善する。提案手法では,分類タスクとオブジェクト検出タスクの両方を対象としたイベントベースデータセットの競合結果を示すとともに,トレーニング時間と推論時間をそれぞれ42.5%,32.8%まで大幅に短縮する。

論文の概要: SpikePool: Event-driven Spiking Transformer with Pooling Attention

関連論文リスト