論文の概要: Partial Search in a Frozen Network is Enough to Find a Strong Lottery
- arxiv url: http://arxiv.org/abs/2402.14029v1
- Date: Tue, 20 Feb 2024 03:14:45 GMT
- ステータス: 処理完了
- システム内更新日: 2024-02-23 17:30:16.598870
- Title: Partial Search in a Frozen Network is Enough to Find a Strong Lottery
- Title(参考訳): 冷凍ネットワークにおける部分探索は、強力なロテリチケットを見つけるのに十分である
- Authors: Hikari Otsuka, Daiki Chijiwa, \'Angel L\'opez Garc\'ia-Arias, Yasuyuki
Okoshi, Kazushi Kawamura, Thiem Van Chu, Daichi Fujiki, Susumu Takeuchi,
Masato Motomura
- Abstract要約: ランダムに密集したネットワークは、ウェイトラーニングなしで高い精度を達成する--強い宝くじ(SLT)
- 参考スコア(独自算出の注目度): 4.296242531100888
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Randomly initialized dense networks contain subnetworks that achieve high
accuracy without weight learning -- strong lottery tickets (SLTs). Recently,
Gadhikar et al. (2023) demonstrated theoretically and experimentally that SLTs
can also be found within a randomly pruned source network, thus reducing the
SLT search space. However, this limits the search to SLTs that are even sparser
than the source, leading to worse accuracy due to unintentionally high
sparsity. This paper proposes a method that reduces the SLT search space by an
arbitrary ratio that is independent of the desired SLT sparsity. A random
subset of the initial weights is excluded from the search space by freezing it
-- i.e., by either permanently pruning them or locking them as a fixed part of
the SLT. Indeed, the SLT existence in such a reduced search space is
theoretically guaranteed by our subset-sum approximation with randomly frozen
variables. In addition to reducing search space, the random freezing pattern
can also be exploited to reduce model size in inference. Furthermore,
experimental results show that the proposed method finds SLTs with better
accuracy and model size trade-off than the SLTs obtained from dense or randomly
pruned source networks. In particular, the SLT found in a frozen graph neural
network achieves higher accuracy than its weight trained counterpart while
reducing model size by $40.3\times$.
- Abstract(参考訳): Randomly initialized dense networks contain subnetworks that achieve high accuracy without weight learning -- strong lottery tickets (SLTs). Recently, Gadhikar et al. (2023) demonstrated theoretically and experimentally that SLTs can also be found within a randomly pruned source network, thus reducing the SLT search space. However, this limits the search to SLTs that are even sparser than the source, leading to worse accuracy due to unintentionally high sparsity. This paper proposes a method that reduces the SLT search space by an arbitrary ratio that is independent of the desired SLT sparsity. A random subset of the initial weights is excluded from the search space by freezing it -- i.e., by either permanently pruning them or locking them as a fixed part of the SLT.
さらに, 実験結果から, SLTの精度とモデルサイズとのトレードオフが, 濃密あるいはランダムな音源ネットワークから得られたSLTよりも優れていることがわかった。
- On the Sparsity of the Strong Lottery Ticket Hypothesis [8.47014750905382]
最近の研究で、任意のニューラルネットワークを正確に近似できるランダムニューラルネットワークの$N$ containsworksが示されている。
古典的セッティングにおけるStrong Lottery Ticket仮説の最初の証明を提供する。
論文 参考訳(メタデータ) (2024-10-18T06:57:37Z) - ELSA: Partial Weight Freezing for Overhead-Free Sparse Network
Deployment [95.04504362111314]
論文 参考訳(メタデータ) (2023-12-11T22:44:05Z) - Lottery Pools: Winning More by Interpolating Tickets without Increasing
Training or Inference Cost [28.70692607078139]
論文 参考訳(メタデータ) (2022-08-23T09:50:55Z) - Not All Lotteries Are Made Equal [0.0]
本研究は, モデルサイズとこれらのスパースサブネットワークの発見容易性の関係について検討する。
意外なことに、有限の予算の下では、小さなモデルの方がTicket Search(TS)の恩恵を受けることを示す実験を通して示します。
論文 参考訳(メタデータ) (2022-06-16T13:41:36Z) - Dual Lottery Ticket Hypothesis [71.95937879869334]
Lottery Ticket hypothesis (LTH)は、スパースネットワークトレーニングを調査し、その能力を維持するための新しい視点を提供する。
論文 参考訳(メタデータ) (2022-03-08T18:06:26Z) - Efficient Lottery Ticket Finding: Less Data is More [87.13642800792077]
Lottery ticket hypothesis (LTH) は、高密度ネットワークに対する当選チケット(希少だが批判的)の存在を明らかにする。
入場券の発見には, 列車プルー・リトラクションのプロセスにおいて, 煩雑な計算が必要となる。
論文 参考訳(メタデータ) (2021-06-06T19:58:17Z) - Multi-Prize Lottery Ticket Hypothesis: Finding Accurate Binary Neural
Networks by Pruning A Randomly Weighted Network [13.193734014710582]
論文 参考訳(メタデータ) (2021-03-17T00:31:24Z) - Learning N:M Fine-grained Structured Sparse Neural Networks From Scratch [75.69506249886622]
論文 参考訳(メタデータ) (2021-02-08T05:55:47Z) - Good Students Play Big Lottery Better [84.6111281091602]
本論文では,KDチケット (Knowledge Distillation Ticket) と呼ばれるサブネットワークを再訓練する手法を提案する。
論文 参考訳(メタデータ) (2021-01-08T23:33:53Z) - Proving the Lottery Ticket Hypothesis: Pruning is All You Need [56.25432563818297]
論文 参考訳(メタデータ) (2020-02-03T07:23:11Z)