Fugu-MT 論文翻訳(概要): Transolver is a Linear Transformer: Revisiting Physics-Attention through the Lens of Linear Attention

論文の概要: Transolver is a Linear Transformer: Revisiting Physics-Attention through the Lens of Linear Attention

arxiv url: http://arxiv.org/abs/2511.06294v2
Date: Wed, 12 Nov 2025 01:38:11 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-12 16:10:52.870332
Title: Transolver is a Linear Transformer: Revisiting Physics-Attention through the Lens of Linear Attention
Title（参考訳）: Transolverはリニアトランスである:リニアアテンションのレンズを通して物理アテンションを再考する
Authors: Wenjie Hu, Sidun Liu, Peng Qiao, Zhenglun Sun, Yong Dou,
Abstract要約: そこで本研究では,物理注意を正準線形の注意に再設計するための2段階の変換を提案する。提案手法は,6つの標準PDEベンチマーク上での最先端性能を実現する。パラメータの数を平均40.0%削減し、計算コストを36.2%削減する。
参考スコア（独自算出の注目度）: 17.072389584390425
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent advances in Transformer-based Neural Operators have enabled significant progress in data-driven solvers for Partial Differential Equations (PDEs). Most current research has focused on reducing the quadratic complexity of attention to address the resulting low training and inference efficiency. Among these works, Transolver stands out as a representative method that introduces Physics-Attention to reduce computational costs. Physics-Attention projects grid points into slices for slice attention, then maps them back through deslicing. However, we observe that Physics-Attention can be reformulated as a special case of linear attention, and that the slice attention may even hurt the model performance. Based on these observations, we argue that its effectiveness primarily arises from the slice and deslice operations rather than interactions between slices. Building on this insight, we propose a two-step transformation to redesign Physics-Attention into a canonical linear attention, which we call Linear Attention Neural Operator (LinearNO). Our method achieves state-of-the-art performance on six standard PDE benchmarks, while reducing the number of parameters by an average of 40.0% and computational cost by 36.2%. Additionally, it delivers superior performance on two challenging, industrial-level datasets: AirfRANS and Shape-Net Car.
Abstract（参考訳）: トランスフォーマーベースのニューラル演算子の最近の進歩は、部分微分方程式(PDE)のためのデータ駆動型解法(英語版)の大幅な進歩を可能にしている。最近の研究は、トレーニングの低さと推論効率に対処するために注意の二次的な複雑さを減らすことに重点を置いている。これらの研究の中で、Transolverは計算コストを削減するために物理認識を導入する代表的手法として際立っている。 Physics-Attentionプロジェクトは、スライス注意のためにグリッドをスライスに向け、デリシングを通じてそれらをマップする。しかし,物理注意を線形注意の特別な場合として再定義することは可能であり,スライス注意がモデル性能を損なう可能性もある。これらの観察結果から,スライス間の相互作用よりもスライス操作やスライス操作から有効性が導かれると論じる。そこで我々は,この知見に基づいて,物理注意を正準線形注意に再設計するための2段階の変換を提案し,これを線形注意ニューラルネットワーク (LinearNO) と呼ぶ。提案手法は6つの標準PDEベンチマーク上での最先端性能を実現し,パラメータの数を平均40.0%,計算コスト36.2%削減する。さらに、AirfRANSとShape-Net Carという、2つの困難な産業レベルのデータセットに対して、優れたパフォーマンスを提供する。

論文の概要: Transolver is a Linear Transformer: Revisiting Physics-Attention through the Lens of Linear Attention

関連論文リスト