Fugu-MT 論文翻訳(概要): bispectrum: Selective $G$-Bispectra Made Practical

論文の概要: bispectrum: Selective $G$-Bispectra Made Practical

arxiv url: http://arxiv.org/abs/2605.07270v1
Date: Fri, 08 May 2026 05:35:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-11 19:43:38.824389
Title: bispectrum: Selective $G$-Bispectra Made Practical
Title（参考訳）: Bispectrum:Selective $G$-Bispectra Made Practical
Authors: Johan Mathe, Adele Myers, Simon Mataigne, Nina Miolane,
Abstract要約: 7つの異なるグループアクションに対して選択的な$G$-bispectraを実装するオープンソースライブラリであるbispectrumを提示する。 3つの古典的ベンチマークデータセット上のディープラーニングアーキテクチャに層をプールするために$G$-bispectraを組み込むことの利点を評価する。
参考スコア（独自算出の注目度）: 10.148766645678954
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Many machine learning tasks are invariant under the action of a group $G$ of transformations: signal classification can be invariant under translations, image classification under 2D rotations, and spherical-image classification under 3D rotations. The $G$-bispectrum is a principled complete invariant of a signal (retaining all all signal's information up to the group action) with proven benefits in machine learning and as a pooling layer in deep networks. However, its deployment has been hampered by high computational cost and a patchwork of group-specific implementations. We present bispectrum, an open-source, fully unit-tested PyTorch library that implements selective $G$-bispectra for seven different group actions, as differentiable modules that can be directly incorporated into machine learning pipelines and deep learning architectures. For finite groups $G$, selectivity reduces the computational cost from $O(|G|^2)$ to $O(|G|)$. For planar rotations, we leverage the disk bispectrum. For spherical 3D rotations, we introduce an augmented selective bispectrum at band-limit $L$ which reduces the cost from $O(L^3)$ to $Θ(L^2)$ coefficients. We profile the entire library (for which we implemented various compute optimizations), showing that it delivers near-exact $G$-invariance with its selective $G$-bispectra computed in sub-millisecond time on GPU (up to commonly used bandlimits). We evaluate the benefits of incorporating $G$-bispectra as pooling layers into deep learning architectures on three classical benchmark datasets --comparing against norm pooling, gated pooling, Fourier-ELU pooling, max pooling, and (non-equivariant) data-augmented convolutional baselines. Results show that $G$-bispectra consistently outperform alternatives in the low-data, moderate-capacity regime.
Abstract（参考訳）: 多くの機械学習タスクは変換群$G$の作用の下で不変であり、信号分類は変換の下で不変、画像分類は2次元回転で、球面画像分類は3次元回転で不変である。 G$-bispectrumは、(グループアクションまですべての信号の情報を保持する)信号の原則化された完全な不変量であり、機械学習およびディープネットワークにおけるプール層として証明された利点がある。しかし、その展開は高い計算コストとグループ固有の実装のパッチワークによって妨げられている。オープンソースで完全に単体テストされたPyTorchライブラリであるBispectrumを,7つのグループアクションに対して選択可能な$G$-bispectraを実装した,マシンラーニングパイプラインやディープラーニングアーキテクチャに直接組み込むことのできる,差別化可能なモジュールとして紹介する。有限群$G$の場合、選択性は計算コストを$O(|G|^2)$から$O(|G|)$に下げる。平面回転にはディスクバイスペクトルを利用する。球面3次元回転に対しては、帯域制限$L$で拡張選択双スペクトルを導入し、コストを$O(L^3)$から$O(L^2)$係数に削減する。我々は、ライブラリ全体(様々な計算最適化を実装した)をプロファイルし、GPU上でミリ秒以下の時間で計算される選択的な$G$-bispectra(一般的に使用されるバンドリミットまで)とほぼ同等の$G$-invarianceを提供することを示した。我々は、標準プーリング、ゲートプーリング、フーリエ-ELUプーリング、最大プーリング、および(不変ではない)データ拡張畳み込みベースラインと比較して、3つの古典的なベンチマークデータセットのディープラーニングアーキテクチャに、レイヤをプールするために$G$-bispectraを組み込むことの利点を評価する。結果は、$G$-bispectraが低データ、中程度の容量のシステムにおいて、一貫して代替よりも優れていることを示している。

論文の概要: bispectrum: Selective $G$-Bispectra Made Practical

関連論文リスト