Fugu-MT 論文翻訳(概要): Sampling Parallelism for Fast and Efficient Bayesian Learning

論文の概要: Sampling Parallelism for Fast and Efficient Bayesian Learning

arxiv url: http://arxiv.org/abs/2604.04736v1
Date: Mon, 06 Apr 2026 15:03:35 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-07 15:49:19.239418
Title: Sampling Parallelism for Fast and Efficient Bayesian Learning
Title（参考訳）: 高速かつ効率的なベイズ学習のための並列性サンプリング
Authors: Asena Karolin Özdemir, Lars H. Heyen, Arvid Weyrauch, Achim Streit, Markus Götz, Charlotte Debus,
Abstract要約: 本稿では,サンプリングに基づくベイズ学習のボトルネックを狙った,シンプルながら強力な並列化戦略であるサンプリング並列化を導入する。実験では,サンプル数が計算資源に比例してスケールされた場合,ほぼ完全なスケーリングを示し,試料評価がクリーンに並列化することを確認した。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Machine learning models, and deep neural networks in particular, are increasingly deployed in risk-sensitive domains such as healthcare, environmental forecasting, and finance, where reliable quantification of predictive uncertainty is essential. However, many uncertainty quantification (UQ) methods remain difficult to apply due to their substantial computational cost. Sampling-based Bayesian learning approaches, such as Bayesian neural networks (BNNs), are particularly expensive since drawing and evaluating multiple parameter samples rapidly exhausts memory and compute resources. These constraints have limited the accessibility and exploration of Bayesian techniques thus far. To address these challenges, we introduce sampling parallelism, a simple yet powerful parallelization strategy that targets the primary bottleneck of sampling-based Bayesian learning: the samples themselves. By distributing sample evaluations across multiple GPUs, our method reduces memory pressure and training time without requiring architectural changes or extensive hyperparameter tuning. We detail the methodology and evaluate its performance on a few example tasks and architectures, comparing against distributed data parallelism (DDP) as a baseline. We further demonstrate that sampling parallelism is complementary to existing strategies by implementing a hybrid approach that combines sample and data parallelism. Our experiments show near-perfect scaling when the sample number is scaled proportionally to the computational resources, confirming that sample evaluations parallelize cleanly. Although DDP achieves better raw speedups under scaling with constant workload, sampling parallelism has a notable advantage: by applying independent stochastic augmentations to the same batch on each GPU, it increases augmentation diversity and thus reduces the number of epochs required for convergence.
Abstract（参考訳）: マシンラーニングモデルやディープニューラルネットワークは、医療、環境予知、金融といったリスクに敏感な領域にますます多くデプロイされており、予測の不確実性の信頼できる定量化が不可欠である。しかし、多くの不確実量化法(UQ)は、その計算コストが大きいため、適用が難しいままである。ベイズニューラルネットワーク(BNN)のようなサンプリングベースのベイズ学習アプローチは、複数のパラメータサンプルの描画と評価がメモリと計算資源を急速に消費するため、特に高価である。これらの制約は、これまでベイズ的手法のアクセシビリティと探索を制限してきた。これらの課題に対処するために,サンプリングに基づくベイズ学習の主なボトルネックであるサンプリング並列化戦略であるサンプリング並列化を導入する。複数のGPUにサンプル評価を分散することにより,アーキテクチャ変更や広範なハイパーパラメータチューニングを必要とせずに,メモリの圧力とトレーニング時間を削減できる。本稿では,分散データ並列処理(DDP)をベースラインとして,その方法論を詳述し,その性能評価を行う。さらに,サンプル並列化とデータ並列化を組み合わせたハイブリッドアプローチを導入することで,サンプリング並列化が既存の戦略を補完することを示す。実験では,サンプル数が計算資源に比例してスケールされた場合,ほぼ完全なスケーリングを示し,試料評価がクリーンに並列化することを確認した。 DDPは、一定のワークロードでスケールする際の生のスピードアップを改善するが、サンプリング並列性は顕著な利点がある: 各GPU上で同じバッチに独立確率拡張を適用することにより、増大の多様性が増大し、収束に必要なエポックの数が減少する。

論文の概要: Sampling Parallelism for Fast and Efficient Bayesian Learning

関連論文リスト