Fugu-MT 論文翻訳(概要): Efficient Hyperparameter Tuning via Trajectory Invariance Principle

論文の概要: Efficient Hyperparameter Tuning via Trajectory Invariance Principle

arxiv url: http://arxiv.org/abs/2509.25049v1
Date: Mon, 29 Sep 2025 17:01:19 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 22:32:20.14831
Title: Efficient Hyperparameter Tuning via Trajectory Invariance Principle
Title（参考訳）: 軌道不変原理による効率的なハイパーパラメータチューニング
Authors: Bingrui Li, Jiaxin Wen, Zhanpeng Zhou, Jun Zhu, Jianfei Chen,
Abstract要約: 学習速度と重み減衰を組み合わせた量に関して, 学習前損失曲線, 勾配雑音, 勾配ノルムがほぼ重なり合う, トラジェクトリ不変(trajectory invariance)と呼ばれる現象を同定する。この現象は、元の2次元のハイパーパラメータ空間を1次元に効果的に還元し、効率的なチューニング規則をもたらす。全体として,本研究は,効率的なチューニングのための新しい原則を提案し,スケーリング法則に関する今後の研究を刺激するものである。
参考スコア（独自算出の注目度）: 35.90572735438328
License: http://creativecommons.org/licenses/by/4.0/
Abstract: As hyperparameter tuning becomes increasingly costly at scale, efficient tuning methods are essential. Yet principles for guiding hyperparameter tuning remain limited. In this work, we seek to establish such principles by considering a broad range of hyperparameters, including batch size, learning rate, and weight decay. We identify a phenomenon we call trajectory invariance, where pre-training loss curves, gradient noise, and gradient norm exhibit invariance--closely overlapping--with respect to a quantity that combines learning rate and weight decay. This phenomenon effectively reduces the original two-dimensional hyperparameter space to one dimension, yielding an efficient tuning rule: follow the salient direction revealed by trajectory invariance. Furthermore, we refine previous scaling laws and challenge several existing viewpoints. Overall, our work proposes new principles for efficient tuning and inspires future research on scaling laws.
Abstract（参考訳）: ハイパーパラメータチューニングが大規模化するにつれ,効率的なチューニング手法が不可欠である。しかし、ハイパーパラメータチューニングを導くための原則は依然として限られている。本研究では,バッチサイズ,学習速度,体重減少など,幅広いハイパーパラメータを考慮し,そのような原理の確立を目指す。学習速度と重み減衰を組み合わせた量に関して, 学習前損失曲線, 勾配雑音, 勾配ノルムがほぼ重なり合う, トラジェクトリ不変(trajectory invariance)と呼ばれる現象を同定する。この現象は、もともとの2次元のハイパーパラメータ空間を1次元に効果的に還元し、効率的なチューニング規則を与える。さらに、従来のスケーリング法則を洗練し、いくつかの既存の視点に挑戦する。全体として,本研究は,効率的なチューニングのための新しい原則を提案し,スケーリング法則に関する今後の研究を刺激するものである。

論文の概要: Efficient Hyperparameter Tuning via Trajectory Invariance Principle

関連論文リスト