Fugu-MT 論文翻訳(概要): Cherry Hypothesis: Identifying the Cherry on the Cake for Dynamic Networks

論文の概要: Cherry Hypothesis: Identifying the Cherry on the Cake for Dynamic Networks

arxiv url: http://arxiv.org/abs/2211.05528v1
Date: Thu, 10 Nov 2022 12:42:43 GMT
ステータス: 翻訳完了
システム内更新日: 2022-11-11 15:39:52.040723
Title: Cherry Hypothesis: Identifying the Cherry on the Cake for Dynamic Networks
Title（参考訳）: チェリー仮説:動的ネットワークのためのケーキ上のチェリーの同定
Authors: Shwai He, Liang Ding, Daize Dong, Boan Liu, Fuqiang Yu, Dacheng Tao
Abstract要約: 一般的なプラクティスは、静的なレイヤをすべてのパラメータが動的で入力に応じて変化する完全に動的なレイヤに変換することです。このような完全にダイナミックな設定は、冗長なパラメータと高いデプロイメントコストを引き起こす可能性がある。我々は、冗長な動的パラメータを静的なパラメータに変換するために、脳にインスパイアされた部分動的ネットワーク、すなわちPAD-Netを提案する。
参考スコア（独自算出の注目度）: 72.85480289152719
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Dynamic networks have been extensively explored as they can considerably improve the model's representation power with acceptable computational cost. The common practice in implementing dynamic networks is to convert given static layers into fully dynamic ones where all parameters are dynamic and vary with the input. Recent studies empirically show the trend that the more dynamic layers contribute to ever-increasing performance. However, such a fully dynamic setting 1) may cause redundant parameters and high deployment costs, limiting the applicability of dynamic networks to a broader range of tasks and models, and more importantly, 2) contradicts the previous discovery in the human brain that \textit{when human brains process an attention-demanding task, only partial neurons in the task-specific areas are activated by the input, while the rest neurons leave in a baseline state.} Critically, there is no effort to understand and resolve the above contradictory finding, leaving the primal question -- to make the computational parameters fully dynamic or not? -- unanswered. The main contributions of our work are challenging the basic commonsense in dynamic networks, and, proposing and validating the \textsc{cherry hypothesis} -- \textit{A fully dynamic network contains a subset of dynamic parameters that when transforming other dynamic parameters into static ones, can maintain or even exceed the performance of the original network.} Technically, we propose a brain-inspired partially dynamic network, namely PAD-Net, to transform the redundant dynamic parameters into static ones. Also, we further design Iterative Mode Partition to partition the dynamic- and static-subnet, which alleviates the redundancy in traditional fully dynamic networks. Our hypothesis and method are comprehensively supported by large-scale experiments with typical advanced dynamic methods.
Abstract（参考訳）: 動的ネットワークは、許容可能な計算コストでモデルの表現能力を大幅に向上できるため、広く研究されてきた。動的ネットワークを実装する一般的なプラクティスは、静的な層を全てのパラメータが動的で入力によって変化する完全に動的な層に変換することである。近年の研究では、よりダイナミックな層がパフォーマンスの向上に寄与する傾向が実証的に示されている。しかし、そのような完全にダイナミックな設定 1)冗長なパラメータと高いデプロイメントコストの原因となり、動的ネットワークの適用範囲が幅広いタスクやモデルに制限される可能性がある。 2) 人間の脳が注意喚起タスクを処理しているとき、タスク特異的領域の部分ニューロンのみが入力によって活性化され、残りのニューロンはベースライン状態に留まる、という人間の脳における以前の発見とは矛盾する。重要なことは、上記の矛盾した発見を理解し、解決する努力はせず、予備的な疑問を残して、計算パラメータを完全に動的にするか、そうでないか? -答えなし。我々の研究の主な貢献は、動的ネットワークにおける基本的な常識に挑戦することであり、 \textsc{cherry hypothesis} -- \textit{a full dynamic networkには、動的パラメータのサブセットが含まれており、他の動的パラメータを静的なネットワークに変換する場合、元のネットワークのパフォーマンスを維持または超過することができる。技術的には、冗長な動的パラメータを静的なパラメータに変換するために、脳にインスパイアされた部分動的ネットワーク、すなわちPAD-Netを提案する。また,従来の完全動的ネットワークにおける冗長性を緩和する動的サブネットと静的サブネットを分割する反復モード分割を設計する。本仮説と手法は,典型的な動的手法を用いた大規模実験によって包括的に支持されている。

論文の概要: Cherry Hypothesis: Identifying the Cherry on the Cake for Dynamic Networks

関連論文リスト