Fugu-MT 論文翻訳(概要): LaplacianFormer:Rethinking Linear Attention with Laplacian Kernel

論文の概要: LaplacianFormer:Rethinking Linear Attention with Laplacian Kernel

arxiv url: http://arxiv.org/abs/2604.20368v1
Date: Wed, 22 Apr 2026 09:04:54 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-23 15:36:11.060343
Title: LaplacianFormer:Rethinking Linear Attention with Laplacian Kernel
Title（参考訳）: LaplacianFormer:Laplacian Kernelによる線形注意の再考
Authors: Zhe Feng, Sen Lian, Changwei Wang, Muyang Zhang, Tianlong Tan, Rongtao Xu, Weiliang Meng, Xiaopeng Zhang,
Abstract要約: ソフトマックスアテンションの二次的複雑さは、トランスフォーマーを高解像度の視覚タスクにスケーリングする上で大きな障害となる。ソフトマックスの代わりにラプラシアンカーネルを用いるトランスフォーマー変種であるラプラシアンフォーマーを提案する。 ImageNetの実験では、LaplacianFormerは高いパフォーマンスと効率のトレードオフを実現し、注意力を高めている。
参考スコア（独自算出の注目度）: 27.87296519831803
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The quadratic complexity of softmax attention presents a major obstacle for scaling Transformers to high-resolution vision tasks. Existing linear attention variants often replace the softmax with Gaussian kernels to reduce complexity, but such approximations lack theoretical grounding and tend to oversuppress mid-range token interactions. We propose LaplacianFormer, a Transformer variant that employs a Laplacian kernel as a principled alternative to softmax, motivated by empirical observations and theoretical analysis. To address expressiveness degradation under low-rank approximations, we introduce a provably injective feature map that retains fine-grained token information. For efficient computation, we adopt a Nyström approximation of the kernel matrix and solve the resulting system using Newton--Schulz iteration, avoiding costly matrix inversion and SVD. We further develop custom CUDA implementations for both the kernel and solver, enabling high-throughput forward and backward passes suitable for edge deployment. Experiments on ImageNet show that LaplacianFormer achieves strong performance-efficiency trade-offs while improving attention expressiveness.
Abstract（参考訳）: ソフトマックスアテンションの二次的複雑さは、トランスフォーマーを高解像度の視覚タスクにスケーリングする上で大きな障害となる。既存の線形アテンションの変種はしばしば複雑さを減らすためにソフトマックスをガウス核に置き換えるが、そのような近似には理論的根拠がなく、中距離トークンの相互作用を抑圧する傾向がある。我々は,経験的観察と理論的解析を動機とした,ラプラシアンカーネルをソフトマックスの原理的な代替品とするトランスフォーマー変種であるラプラシアンホルダーを提案する。低ランク近似下での表現性劣化に対処するために, 粒度の細かいトークン情報を保持する, 証明可能なインジェクティブ特徴写像を導入する。効率的な計算のために、カーネル行列のNyström近似を採用し、Newton-Schulz 反復法を用いて、コストのかかる行列逆転とSVDを回避する。さらに、カーネルとソルバの両方にカスタムなCUDA実装を開発し、エッジデプロイメントに適した高スループットの前方および後方パスを実現する。 ImageNetの実験によると、LaplacianFormerは注目の表現性を改善しつつ、高いパフォーマンス効率のトレードオフを実現している。

論文の概要: LaplacianFormer:Rethinking Linear Attention with Laplacian Kernel

関連論文リスト