Fugu-MT 論文翻訳(概要): E2Former-V2: On-the-Fly Equivariant Attention with Linear Activation Memory

論文の概要: E2Former-V2: On-the-Fly Equivariant Attention with Linear Activation Memory

arxiv url: http://arxiv.org/abs/2601.16622v1
Date: Fri, 23 Jan 2026 10:20:08 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-26 14:27:27.63212
Title: E2Former-V2: On-the-Fly Equivariant Attention with Linear Activation Memory
Title（参考訳）: E2Former-V2:線形アクティベーションメモリによるオンザフライ同変アテンション
Authors: Lin Huang, Chengxiang Huang, Ziang Wang, Yiyue Du, Chu Wang, Haocheng Lu, Yunyang Li, Xiaoli Liu, Arthur Jiang, Jia Zhang,
Abstract要約: Equivariant Graph Neural Networks (EGNN) は3次元原子論システムのモデリングに広く利用されている。 textbfE2Former-V2は,代数的空間性とハードウェア対応の実行を統合するスケーラブルなアーキテクチャである。 E2Former-V2は、推論を加速しながら、同等の予測性能を維持している。
参考スコア（独自算出の注目度）: 13.451231889715542
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Equivariant Graph Neural Networks (EGNNs) have become a widely used approach for modeling 3D atomistic systems. However, mainstream architectures face critical scalability bottlenecks due to the explicit construction of geometric features or dense tensor products on \textit{every} edge. To overcome this, we introduce \textbf{E2Former-V2}, a scalable architecture that integrates algebraic sparsity with hardware-aware execution. We first propose \textbf{E}quivariant \textbf{A}xis-\textbf{A}ligned \textbf{S}parsification (EAAS). EAAS builds on Wigner-$6j$ convolution by exploiting an $\mathrm{SO}(3) \rightarrow \mathrm{SO}(2)$ change of basis to transform computationally expensive dense tensor contractions into efficient, sparse parity re-indexing operations. Building on this representation, we introduce \textbf{On-the-Fly Equivariant Attention}, a fully node-centric mechanism implemented via a custom fused Triton kernel. By eliminating materialized edge tensors and maximizing SRAM utilization, our kernel achieves a \textbf{20$\times$ improvement in TFLOPS} compared to standard implementations. Extensive experiments on the SPICE and OMol25 datasets demonstrate that E2Former-V2 maintains comparable predictive performance while notably accelerating inference. This work demonstrates that large equivariant transformers can be trained efficiently using widely accessible GPU platforms. The code is avalible at https://github.com/IQuestLab/UBio-MolFM/tree/e2formerv2.
Abstract（参考訳）: Equivariant Graph Neural Networks (EGNN) は3次元原子論システムのモデリングに広く利用されている。しかし、主流アーキテクチャは、幾何的特徴や高密度テンソル積が \textit{every} エッジ上に明確に構築されているため、重要なスケーラビリティのボトルネックに直面している。これを解決するために,代数的疎結合とハードウェア対応実行を統合するスケーラブルアーキテクチャである \textbf{E2Former-V2} を導入する。まず、まず、textbf{E}quivariant \textbf{A}xis-\textbf{A}ligned \textbf{S}parsification (EAAS)を提案する。 EAAS は Wigner-$6j$ の畳み込みを$\mathrm{SO}(3) \rightarrow \mathrm{SO}(2)$ の基底変更を利用して構築する。この表現に基づいて、カスタムの融合トリトンカーネルによって実装された完全にノード中心のメカニズムである「textbf{On-the-Fly Equivariant Attention}」を導入する。実効化されたエッジテンソルを排除し、SRAMの利用を最大化することにより、カーネルは標準実装と比較して、 TFLOPS の \textbf{20$\times$ Improvement を達成する。 SPICEとOMol25データセットに関する大規模な実験は、E2Former-V2が推論を加速しながら、同等の予測性能を維持していることを示している。この研究は、広くアクセス可能なGPUプラットフォームを使用して、大きな同変トランスフォーマーを効率的に訓練できることを実証している。コードはhttps://github.com/IQuestLab/UBio-MolFM/tree/e2formerv2で公開されている。

論文の概要: E2Former-V2: On-the-Fly Equivariant Attention with Linear Activation Memory

関連論文リスト