Fugu-MT 論文翻訳(概要): Evaluating Large Language Models for Workload Mapping and Scheduling in Heterogeneous HPC Systems

論文の概要: Evaluating Large Language Models for Workload Mapping and Scheduling in Heterogeneous HPC Systems

arxiv url: http://arxiv.org/abs/2511.11612v1
Date: Tue, 04 Nov 2025 03:04:28 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-23 18:31:12.296019
Title: Evaluating Large Language Models for Workload Mapping and Scheduling in Heterogeneous HPC Systems
Title（参考訳）: 不均一HPCシステムにおけるワークロードマッピングとスケジューリングのための大規模言語モデルの評価
Authors: Aasish Kumar Sharma, Julian Kunkel,
Abstract要約: 大規模言語モデル(LLM)は、その推論能力のためにますます研究されている。本研究は、代表的ワークロードマッピングとスケジューリング問題に基づいて、利用可能なLLMを21個評価する。
参考スコア（独自算出の注目度）: 0.03819617852128932
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) are increasingly explored for their reasoning capabilities, yet their ability to perform structured, constraint-based optimization from natural language remains insufficiently understood. This study evaluates twenty-one publicly available LLMs on a representative heterogeneous high-performance computing (HPC) workload mapping and scheduling problem. Each model received the same textual description of system nodes, task requirements, and scheduling constraints, and was required to assign tasks to nodes, compute the total makespan, and explain its reasoning. A manually derived analytical optimum of nine hours and twenty seconds served as the ground truth reference. Three models exactly reproduced the analytical optimum while satisfying all constraints, twelve achieved near-optimal results within two minutes of the reference, and six produced suboptimal schedules with arithmetic or dependency errors. All models generated feasible task-to-node mappings, though only about half maintained strict constraint adherence. Nineteen models produced partially executable verification code, and eighteen provided coherent step-by-step reasoning, demonstrating strong interpretability even when logical errors occurred. Overall, the results define the current capability boundary of LLM reasoning in combinatorial optimization: leading models can reconstruct optimal schedules directly from natural language, but most still struggle with precise timing, data transfer arithmetic, and dependency enforcement. These findings highlight the potential of LLMs as explainable co-pilots for optimization and decision-support tasks rather than autonomous solvers.
Abstract（参考訳）: 大規模言語モデル(LLM)はその推論能力について研究が進んでいるが、自然言語から構造化された制約ベースの最適化を行う能力はまだ十分に理解されていない。本研究では,代表的ヘテロジニアスハイパフォーマンスコンピューティング(HPC)のワークロードマッピングとスケジューリング問題に対して,利用可能なLLMを21個評価する。各モデルは、システムノード、タスク要求、スケジューリングの制約に関する同じテキスト記述を受け取り、ノードにタスクを割り当て、全体のメースパンを計算し、その推論を説明する必要があった。 9時間20秒という手作業による分析の最適化が、根拠となる真理の基準となった。 3つのモデルでは、全ての制約を満たしながら解析的最適を正確に再現し、基準から2分以内に12のほぼ最適結果を得た。すべてのモデルで実行可能なタスク・ツー・ノードマッピングが生成されたが、厳密な制約順守は半分程度しか維持されなかった。 19のモデルが部分的に実行可能な検証コードを作成し、18のモデルが一貫性のあるステップ・バイ・ステップの推論を提供し、論理的誤りが発生した場合でも強い解釈可能性を示した。先行モデルでは、自然言語から直接最適なスケジュールを再構築することができるが、ほとんどの場合、正確なタイミング、データ転送演算、依存性の強制に苦慮している。これらの結果は、自律的解法よりも最適化と意思決定支援のための説明可能なコパイロットとしてのLCMの可能性を強調した。

論文の概要: Evaluating Large Language Models for Workload Mapping and Scheduling in Heterogeneous HPC Systems

関連論文リスト