Fugu-MT 論文翻訳(概要): GPU Memory Requirement Prediction for Deep Learning Task Based on Bidirectional Gated Recurrent Unit Optimization Transformer

論文の概要: GPU Memory Requirement Prediction for Deep Learning Task Based on Bidirectional Gated Recurrent Unit Optimization Transformer

arxiv url: http://arxiv.org/abs/2510.20985v1
Date: Thu, 23 Oct 2025 20:20:35 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-28 09:00:15.315067
Title: GPU Memory Requirement Prediction for Deep Learning Task Based on Bidirectional Gated Recurrent Unit Optimization Transformer
Title（参考訳）: 双方向Gated Recurrent Unit Optimization Transformerに基づくディープラーニングタスクのGPUメモリ要求予測
Authors: Chao Wang, Zhizhao Wen, Ruoxin Zhang, Puyang Xu, Yifan Jiang,
Abstract要約: 本稿では,双方向ゲートリカレントユニット(BiGRU)を統合し,トランスフォーマーアーキテクチャを最適化するディープラーニングモデルを提案する。平均絶対誤差(MAE)と決定係数(R2)の指標では、モデルも良好に動作し、結果はバランスよく安定している。予測精度は従来の機械学習手法に比べて大幅に改善されている。
参考スコア（独自算出の注目度）: 6.443211386140362
License: http://creativecommons.org/licenses/by/4.0/
Abstract: In response to the increasingly critical demand for accurate prediction of GPU memory resources in deep learning tasks, this paper deeply analyzes the current research status and innovatively proposes a deep learning model that integrates bidirectional gated recurrent units (BiGRU) to optimize the Transformer architecture, aiming to improve the accuracy of memory demand prediction. To verify the effectiveness of the model, a carefully designed comparative experiment was conducted, selecting four representative basic machine learning models: decision tree, random forest, Adaboost, and XGBoost as benchmarks. The detailed experimental results show that the BiGRU Transformer optimization model proposed in this paper exhibits significant advantages in key evaluation indicators: in terms of mean square error (MSE) and root mean square error (RMSE), the model achieves the lowest value among all comparison models, and its predicted results have the smallest deviation from the actual values; In terms of mean absolute error (MAE) and coefficient of determination (R2) indicators, the model also performs well and the results are balanced and stable, with comprehensive predictive performance far exceeding the benchmark machine learning methods compared. In summary, the Transformer model based on bidirectional gated recurrent unit optimization successfully constructed in this study can efficiently and accurately complete GPU memory demand prediction tasks in deep learning tasks, and its prediction accuracy has been significantly improved compared to traditional machine learning methods. This research provides strong technical support and reliable theoretical basis for optimizing resource scheduling and management of deep learning tasks, and improving the utilization efficiency of computing clusters.
Abstract（参考訳）: ディープラーニングタスクにおけるGPUメモリリソースの正確な予測に対する要求がますます高まる中で,本論文では,現在の研究状況を深く分析し,双方向ゲートリカレントユニット(BiGRU)を統合してTransformerアーキテクチャを最適化する深層学習モデルを提案する。モデルの有効性を検証するため、慎重に設計された比較実験を行い、決定木、ランダムフォレスト、アダブースト、XGBoostの4つの基本的な機械学習モデルをベンチマークとして選択した。本稿では, 平均二乗誤差 (MSE) とルート平均二乗誤差 (RMSE) において, 平均二乗誤差 (RMSE) において, 平均三乗誤差 (MAE) と決定係数 (R2) との差が最小であり, 平均三乗誤差 (MAE) と決定係数 (R2) の差が最小であり, モデルの性能も良好であり, 予測性能はベンチマーク機械学習手法をはるかに上回った。要約して,本研究で成功した双方向ゲート再帰単位最適化に基づくトランスフォーマーモデルは,ディープラーニングタスクにおけるGPUメモリ要求予測タスクを効率的かつ正確に完了させることができ,従来の機械学習手法と比較して予測精度が大幅に向上した。本研究は、ディープラーニングタスクのリソーススケジューリングと管理を最適化し、計算クラスタの利用効率を向上させるための、強力な技術的支援と信頼性のある理論的基盤を提供する。

論文の概要: GPU Memory Requirement Prediction for Deep Learning Task Based on Bidirectional Gated Recurrent Unit Optimization Transformer

関連論文リスト