Fugu-MT 論文翻訳(概要): Automatic Stability and Recovery for Neural Network Training

論文の概要: Automatic Stability and Recovery for Neural Network Training

arxiv url: http://arxiv.org/abs/2601.17483v1
Date: Sat, 24 Jan 2026 15:14:54 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-27 15:23:07.8119
Title: Automatic Stability and Recovery for Neural Network Training
Title（参考訳）: ニューラルネットワークトレーニングのための自動安定性と回復
Authors: Barak Or,
Abstract要約: 現代のニューラルネットワークのトレーニングはますます脆弱になり、まれだが深刻な不安定な更新は、しばしば不可逆的なばらつきやサイレントな劣化を引き起こす。既存の最適化手法は、安定性プローブに埋め込まれた防止機構に依存しており、不安定性を検出して回復する能力に制限がある。最適化を制御実行プロセスとして扱う監視フレームワークを導入する。
参考スコア（独自算出の注目度）: 1.9544213396776273
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Training modern neural networks is increasingly fragile, with rare but severe destabilizing updates often causing irreversible divergence or silent performance degradation. Existing optimization methods primarily rely on preventive mechanisms embedded within the optimizer, offering limited ability to detect and recover from instability once it occurs. We introduce a supervisory runtime stability framework that treats optimization as a controlled stochastic process. By isolating an innovation signal derived from secondary measurements, such as validation probes, the framework enables automatic detection and recovery from destabilizing updates without modifying the underlying optimizer. We provide theoretical runtime safety guarantees that formalize bounded degradation and recovery. Our implementation incurs minimal overhead and is compatible with memory-constrained training settings.
Abstract（参考訳）: 現代のニューラルネットワークのトレーニングは、希少ながら深刻な不安定な更新によって、不可逆的なばらつきやサイレントなパフォーマンス低下を引き起こしているため、ますます脆弱になっている。既存の最適化手法は、主にオプティマイザに埋め込まれた防止機構に依存しており、一度不安定性を検知し、回復する能力に制限がある。本稿では,最適化を制御確率過程として扱うスーパーバイザリーランタイム安定化フレームワークを提案する。このフレームワークは、バリデーションプローブなどの二次測定から導かれるイノベーション信号を分離することにより、基盤となるオプティマイザを変更することなく、更新の不安定化から自動検出とリカバリを可能にする。境界劣化と回復を形式化する理論的ランタイム安全保証を提供する。我々の実装は最小限のオーバーヘッドを発生させ、メモリ制約のあるトレーニング設定と互換性がある。

論文の概要: Automatic Stability and Recovery for Neural Network Training

関連論文リスト