Fugu-MT 論文翻訳(概要): MONET: Modeling and Optimization of neural NEtwork Training from Edge to Data Centers

論文の概要: MONET: Modeling and Optimization of neural NEtwork Training from Edge to Data Centers

arxiv url: http://arxiv.org/abs/2603.15002v1
Date: Mon, 16 Mar 2026 09:06:43 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 18:28:57.896911
Title: MONET: Modeling and Optimization of neural NEtwork Training from Edge to Data Centers
Title（参考訳）: MONET:エッジからデータセンタへのニューラルネトワークトレーニングのモデリングと最適化
Authors: Jérémy Morlier, Robin Geens, Stef Cuyckens, Arne Symons, Marian Verhelst, Vincent Gripon, Mathieu Léonardon,
Abstract要約: MONETは、異種データフローアクセラレータ上でのニューラルネットワークのトレーニングをモデル化するためのフレームワークである。より広い設計空間のために、ニューラルネットワークトレーニングにおいてより複雑になる問題について検討する。当社のフレームワークは、遺伝的アルゴリズムの助けを借りて、アクティベーションチェックポイントにおける興味深いトレードオフを見つけるために使用しています。
参考スコア（独自算出の注目度）: 5.086004180978689
License: http://creativecommons.org/licenses/by/4.0/
Abstract: While hardware-software co-design has significantly improved the efficiency of neural network inference, modeling the training phase remains a critical yet underexplored challenge. Training workloads impose distinct constraints, particularly regarding memory footprint and backpropagation complexity, which existing inference-focused tools fail to capture. This paper introduces MONET, a framework designed to model the training of neural networks on heterogeneous dataflow accelerators. MONET builds upon Stream, an experimentally verified framework that that models the inference of neural networks on heterogeneous dataflow accelerators with layer fusion. Using MONET, we explore the design space of ResNet-18 and a small GPT-2, demonstrating the framework's capability to model training workflows and find better hardware architectures. We then further examine problems that become more complex in neural network training due to the larger design space, such as determining the best layer-fusion configuration. Additionally, we use our framework to find interesting trade-offs in activation checkpointing, with the help of a genetic algorithm. Our findings highlight the importance of a holistic approach to hardware-software co-design for scalable and efficient deep learning deployment.
Abstract（参考訳）: ハードウェアとソフトウェアの共同設計は、ニューラルネットワーク推論の効率を大幅に向上させたが、トレーニングフェーズのモデル化は、依然として重要で未調査の課題である。トレーニングワークロードは、特にメモリフットプリントと、既存の推論中心のツールがキャプチャーに失敗するバックプロパゲーションの複雑さに関して、明確な制約を課している。本稿では、異種データフローアクセラレータ上でのニューラルネットワークのトレーニングをモデル化するフレームワークであるMONETを紹介する。 MONETは、層融合を伴う異種データフローアクセラレータ上でのニューラルネットワークの推論をモデル化する実験的に検証されたフレームワークであるStreamの上に構築されている。 MONETを用いて、ResNet-18と小さなGPT-2の設計空間を探索し、トレーニングワークフローをモデル化し、より良いハードウェアアーキテクチャを見つけるためのフレームワークの能力を実証する。さらに、最適な層融合構成を決定するなど、より大きな設計空間のために、ニューラルネットワークトレーニングにおいてより複雑になる問題についても検討する。さらに、我々のフレームワークを使用して、遺伝的アルゴリズムの助けを借りて、アクティベーションチェックポイントにおける興味深いトレードオフを見つけます。我々の発見は、スケーラブルで効率的なディープラーニングデプロイメントのためのハードウェアとソフトウェアの共同設計に対する全体論的アプローチの重要性を強調した。

論文の概要: MONET: Modeling and Optimization of neural NEtwork Training from Edge to Data Centers

関連論文リスト