Fugu-MT 論文翻訳(概要): AdaPtis: Reducing Pipeline Bubbles with Adaptive Pipeline Parallelism on Heterogeneous Models

論文の概要: AdaPtis: Reducing Pipeline Bubbles with Adaptive Pipeline Parallelism on Heterogeneous Models

arxiv url: http://arxiv.org/abs/2509.23722v1
Date: Sun, 28 Sep 2025 08:05:13 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 22:32:19.399325
Title: AdaPtis: Reducing Pipeline Bubbles with Adaptive Pipeline Parallelism on Heterogeneous Models
Title（参考訳）: AdaPtis: 異種モデル上での適応的なパイプライン並列化によるパイプラインバブル低減
Authors: Jihu Guo, Tenghui Ma, Wei Gao, Peng Sun, Jiaxing Li, Xun Chen, Yuyang Jin, Dahua Lin,
Abstract要約: AdaPtisは、適応パイプライン並列性をサポートする大規模言語モデル(LLM)のトレーニングシステムである。大規模な実験により、AdaPtisはMegatron-LM I-1F1Bよりも平均1.42倍(最大2.14倍)のスピードアップを達成した。
参考スコア（独自算出の注目度）: 59.7059443712562
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Pipeline parallelism is widely used to train large language models (LLMs). However, increasing heterogeneity in model architectures exacerbates pipeline bubbles, thereby reducing training efficiency. Existing approaches overlook the co-optimization of model partition, model placement, and workload scheduling, resulting in limited efficiency improvement or even performance degradation. To respond, we propose AdaPtis, an LLM training system that supports adaptive pipeline parallelism. First, we develop a pipeline performance model to accurately estimate training throughput. Second, AdaPtis jointly optimizes model partition, model placement, and workload scheduling policies guided by this performance model. Third, we design a unified pipeline executor that efficiently supports the execution of diverse pipeline strategies. Extensive experiments show that AdaPtis achieves an average speedup of 1.42x (up to 2.14x) over Megatron-LM I-1F1B across various LLM architectures and scales.
Abstract（参考訳）: パイプライン並列性は、大きな言語モデル(LLM)のトレーニングに広く使用されている。しかし、モデルアーキテクチャの不均一性の増加はパイプラインバブルを悪化させ、トレーニング効率を低下させる。既存のアプローチでは、モデルパーティション、モデル配置、ワークロードスケジューリングの共最適化を見落としている。そこで本研究では,適応的なパイプライン並列処理をサポートするLLMトレーニングシステムであるAdaPtisを提案する。まず,トレーニングスループットを正確に推定するパイプライン性能モデルを開発する。第2に、AdaPtisはモデルパーティション、モデル配置、およびこのパフォーマンスモデルによってガイドされるワークロードスケジューリングポリシーを共同で最適化する。第3に,多様なパイプライン戦略の実行を効率的にサポートする統合パイプラインエグゼキュータを設計する。大規模な実験により、AdaPtisはMegatron-LM I-1F1Bよりも平均1.42倍(最大2.14倍)のスピードアップを達成した。

論文の概要: AdaPtis: Reducing Pipeline Bubbles with Adaptive Pipeline Parallelism on Heterogeneous Models

関連論文リスト