Fugu-MT 論文翻訳(概要): Juvenile state hypothesis: What we can learn from lottery ticket hypothesis researches?

論文の概要: Juvenile state hypothesis: What we can learn from lottery ticket hypothesis researches?

arxiv url: http://arxiv.org/abs/2109.03862v1
Date: Wed, 8 Sep 2021 18:22:00 GMT
ステータス: 翻訳完了
システム内更新日: 2021-09-11 10:26:06.325137
Title: Juvenile state hypothesis: What we can learn from lottery ticket hypothesis researches?
Title（参考訳）: 少年国家仮説:宝くじ券仮説の研究から何が学べるのか?
Authors: Di Zhang
Abstract要約: オリジナルの抽選チケット仮説は、トレーニング収束後のプルーニングとウェイトリセットを実行する。本稿では,ニューラルネットワーク構造探索とプルーニングアルゴリズムを併用してこの問題を緩和する戦略を提案する。
参考スコア（独自算出の注目度）: 1.701869491238765
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The proposition of lottery ticket hypothesis revealed the relationship between network structure and initialization parameters and the learning potential of neural networks. The original lottery ticket hypothesis performs pruning and weight resetting after training convergence, exposing it to the problem of forgotten learning knowledge and potential high cost of training. Therefore, we propose a strategy that combines the idea of neural network structure search with a pruning algorithm to alleviate this problem. This algorithm searches and extends the network structure on existing winning ticket sub-network to producing new winning ticket recursively. This allows the training and pruning process to continue without compromising performance. A new winning ticket sub-network with deeper network structure, better generalization ability and better test performance can be obtained in this recursive manner. This method can solve: the difficulty of training or performance degradation of the sub-networks after pruning, the forgetting of the weights of the original lottery ticket hypothesis and the difficulty of generating winning ticket sub-network when the final network structure is not given. We validate this strategy on the MNIST and CIFAR-10 datasets. And after relating it to similar biological phenomena and relevant lottery ticket hypothesis studies in recent years, we will further propose a new hypothesis to discuss which factors that can keep a network juvenile, i.e., those possible factors that influence the learning potential or generalization performance of a neural network during training.
Abstract（参考訳）: 抽選券仮説の提案は、ネットワーク構造と初期化パラメータとニューラルネットワークの学習ポテンシャルの関係を明らかにした。オリジナルの抽選券仮説は、トレーニング収束後のプルーニングとウェイトリセットを行い、忘れられた学習知識の問題と潜在的な訓練コストに公開する。そこで本稿では,ニューラルネットワーク構造探索とプルーニングアルゴリズムを併用してこの問題を緩和する戦略を提案する。このアルゴリズムは、既存の入賞券サブネットワーク上のネットワーク構造を検索して拡張し、新たな入賞券を再帰的に生成する。これにより、トレーニングとプルーニングプロセスがパフォーマンスを損なうことなく継続できる。この再帰的な方法で、より深いネットワーク構造、より良い一般化能力、より良いテスト性能を有する新しい入賞切符サブネットワークを得ることができる。プルーニング後のサブネットワークの訓練や性能低下の難しさ、元の抽選券仮説の重み付けの忘れ、最終ネットワーク構造が与えられていない場合の当選券サブネットワークの生成の難しさを解決できる。 MNISTとCIFAR-10データセットでこの戦略を検証する。そして,近年の同様の生物学的現象と関連する宝くじの仮説を関連づけた上で,ネットワークの幼少期を維持する要因,すなわちトレーニング中のニューラルネットワークの学習可能性や一般化性能に影響を与える要因について議論する新たな仮説を提案する。

論文の概要: Juvenile state hypothesis: What we can learn from lottery ticket hypothesis researches?

関連論文リスト