Fugu-MT 論文翻訳(概要): Cumulative-Goodness Free-Riding in Forward-Forward Networks: Real, Repairable, but Not Accuracy-Dominant

論文の概要: Cumulative-Goodness Free-Riding in Forward-Forward Networks: Real, Repairable, but Not Accuracy-Dominant

arxiv url: http://arxiv.org/abs/2605.06240v1
Date: Thu, 07 May 2026 13:24:58 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-08 22:27:11.842759
Title: Cumulative-Goodness Free-Riding in Forward-Forward Networks: Real, Repairable, but Not Accuracy-Dominant
Title（参考訳）: 前向きネットワークにおける累積ゴドネスフリーライディング:現実的, 修復可能, 正確性に乏しい
Authors: Amirhossein Yousefiramandi,
Abstract要約: フォワード・フォワード(FF)トレーニングでは、各レイヤが局所的な善良さの基準から学ぶことができる。累積・良さのバリエーションでは、後続のレイヤは、以前のレイヤがすでに部分的に分離したタスクを継承することができる。我々はこの現象を層フリーライディングとして定式化する。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Forward-Forward (FF) training allows each layer to learn from a local goodness criterion. In cumulative-goodness variants, however, later layers can inherit a task that earlier layers have already partially separated. We formalize this phenomenon as layer free-riding: under the softplus FF criterion, the class-discrimination gradient reaching block $d$ decays exponentially with the positive margin accumulated by preceding blocks. We then study three local remedies -- per-block, hardness-gated, and depth-scaled -- that recover current-layer separation measures without relying on backpropagated gradients. On CIFAR-10 and CIFAR-100, these remedies dramatically improve layer-separation statistics, with $4\times$--$45\times$ gains in deeper layers, while changing accuracy by less than one percentage point for non-degenerate training procedures. Tiny ImageNet provides a tougher cross-dataset check for our selected block-wise configuration and reveals the same qualitative gap between layer-health diagnostics and final accuracy. Calibration experiments further show that architecture and augmentation choices have a larger effect on final accuracy than the training-rule modifications studied here. Cumulative free-riding is therefore a real and repairable optimization pathology. Nonetheless, for the FF training rules, architectures, and datasets we study, it is not the dominant factor limiting achievable accuracy.
Abstract（参考訳）: フォワード・フォワード(FF)トレーニングでは、各レイヤが局所的な善良さの基準から学ぶことができる。しかし累積的・良さのバリエーションでは、後続のレイヤは、以前のレイヤがすでに部分的に分離したタスクを継承することができる。我々はこの現象を層フリーライディングとして定式化する: ソフトプラスFF基準の下では、クラス判別勾配がブロック$d$崩壊に指数関数的に到達し、前のブロックで蓄積された正のマージンが指数関数的に減少する。次に、バックプロパゲート勾配に頼ることなく、電流層分離対策を回復する3つの局所的対策(ブロックごと、硬度、深さスケール)について検討する。 CIFAR-10とCIFAR-100では、これらの治療法は階層分離統計を劇的に改善し、より深い層では4/times$-45\times$ゲインを、非退化トレーニング手順では1ポイント未満の精度で変更した。 Tiny ImageNetは、選択したブロックワイズ構成に対するより厳しいクロスデータセットチェックを提供し、レイヤーヘルス診断と最終的な精度の質的なギャップを明らかにします。校正実験は、アーキテクチャと拡張の選択が、ここで研究されたトレーニングルールの修正よりも最終的な精度により大きな影響を及ぼすことを示した。したがって、累積フリーライディングは真の、そして修復可能な最適化パスロジーである。それでも、私たちが研究しているFFトレーニングルール、アーキテクチャ、データセットは、達成可能な精度を制限する主要な要因ではありません。

論文の概要: Cumulative-Goodness Free-Riding in Forward-Forward Networks: Real, Repairable, but Not Accuracy-Dominant

関連論文リスト