Fugu-MT 論文翻訳(概要): Training Deep Visual Networks Beyond Loss and Accuracy Through a Dynamical Systems Approach

論文の概要: Training Deep Visual Networks Beyond Loss and Accuracy Through a Dynamical Systems Approach

arxiv url: http://arxiv.org/abs/2604.09716v1
Date: Wed, 08 Apr 2026 12:41:10 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-14 20:13:15.625177
Title: Training Deep Visual Networks Beyond Loss and Accuracy Through a Dynamical Systems Approach
Title（参考訳）: ダイナミックシステムアプローチによる損失と正確性を超えた深層視覚ネットワークのトレーニング
Authors: Hai La Quang, Hassan Ugail, Newton Howard, Cong Tran Tien, Nam Vu Hoai, Hung Nguyen Viet,
Abstract要約: 本稿では,力学系のレンズを用いてトレーニングを行うことにより,その過程を補完的に研究する手法を提案する。トレーニングエポック間で収集されたレイヤアクティベーションから,3つの尺度を定義する。このフレームワークをモデルアーキテクチャとデータセットの9つの組み合わせに適用する。
参考スコア（独自算出の注目度）: 0.6229352230128421
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Deep visual recognition models are usually trained and evaluated using metrics such as loss and accuracy. While these measures show whether a model is improving, they reveal very little about how its internal representations change during training. This paper introduces a complementary way to study that process by examining training through the lens of dynamical systems. Drawing on ideas from signal analysis originally used to study biological neural activity, we define three measures from layer activations collected across training epochs: an integration score that reflects long-range coordination across layers, a metastability score that captures how flexibly the network shifts between more and less synchronised states, and a combined dynamical stability index. We apply this framework to nine combinations of model architecture and dataset, including several ResNet variants, DenseNet-121, MobileNetV2, VGG-16, and a pretrained Vision Transformer on CIFAR-10 and CIFAR-100. The results suggest three main patterns. First, the integration measure consistently distinguishes the easier CIFAR-10 setting from the more difficult CIFAR-100 setting. Second, changes in the volatility of the stability index may provide an early sign of convergence before accuracy fully plateaus. Third, the relationship between integration and metastability appears to reflect different styles of training behaviour. Overall, this study offers an exploratory but promising new way to understand deep visual training beyond loss and accuracy.
Abstract（参考訳）: ディープ・ヴィジュアライゼーション・モデルは通常、損失や精度などの指標を用いて訓練され、評価される。これらの尺度は、モデルが改善されているかどうかを示しているが、トレーニング中に内部表現がどのように変化するかについてはほとんど明らかにしていない。本稿では,力学系のレンズを用いてトレーニングを行うことにより,その過程を補完的に研究する手法を提案する。生体神経活動の研究に使用された信号分析のアイデアに基づいて、我々は、トレーニングエポック間で収集されたレイヤアクティベーションから、層間の長距離調整を反映する統合スコア、ネットワークがより多くの同期状態間でどのように柔軟にシフトするかを捉えたメタスタビリティスコア、そして、組み合わせた動的安定性指数の3つの尺度を定義した。このフレームワークは、DenseNet-121、MobileNetV2、VGG-16、CIFAR-10およびCIFAR-100上の事前訓練されたビジョン変換器など、モデルアーキテクチャとデータセットの9つの組み合わせに適用する。結果は3つの主要なパターンを示唆している。第1に、CIFAR-10設定がより難しいCIFAR-100設定と一貫して区別される。第二に、安定性指数のボラティリティの変化は、完全なプラトースの前に収束の早期兆候を与える可能性がある。第三に、統合性とメタスタビリティの関係は、トレーニング行動の異なるスタイルを反映しているように見える。全体として、この研究は、損失と正確性を超えた深い視覚トレーニングを理解するための、探索的だが有望な新しい方法を提供する。

論文の概要: Training Deep Visual Networks Beyond Loss and Accuracy Through a Dynamical Systems Approach

関連論文リスト