Fugu-MT 論文翻訳(概要): Task structure and nonlinearity jointly determine learned representational geometry

論文の概要: Task structure and nonlinearity jointly determine learned representational geometry

arxiv url: http://arxiv.org/abs/2401.13558v1
Date: Wed, 24 Jan 2024 16:14:38 GMT
ステータス: 翻訳完了
システム内更新日: 2024-01-25 14:06:27.995063
Title: Task structure and nonlinearity jointly determine learned representational geometry
Title（参考訳）: 学習表現幾何学を共同で決定するタスク構造と非線形性
Authors: Matteo Alleman, Jack W Lindsey, Stefano Fusi
Abstract要約: 本稿では,Tanhネットワークが対象出力の構造を反映した表現を学習する傾向を示し,ReLUネットワークは生入力の構造についてより多くの情報を保持することを示した。我々の研究結果は、入力出力幾何学、非線形性、ニューラルネットワークにおける学習表現との相互作用に光を当てた。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The utility of a learned neural representation depends on how well its geometry supports performance in downstream tasks. This geometry depends on the structure of the inputs, the structure of the target outputs, and the architecture of the network. By studying the learning dynamics of networks with one hidden layer, we discovered that the network's activation function has an unexpectedly strong impact on the representational geometry: Tanh networks tend to learn representations that reflect the structure of the target outputs, while ReLU networks retain more information about the structure of the raw inputs. This difference is consistently observed across a broad class of parameterized tasks in which we modulated the degree of alignment between the geometry of the task inputs and that of the task labels. We analyzed the learning dynamics in weight space and show how the differences between the networks with Tanh and ReLU nonlinearities arise from the asymmetric asymptotic behavior of ReLU, which leads feature neurons to specialize for different regions of input space. By contrast, feature neurons in Tanh networks tend to inherit the task label structure. Consequently, when the target outputs are low dimensional, Tanh networks generate neural representations that are more disentangled than those obtained with a ReLU nonlinearity. Our findings shed light on the interplay between input-output geometry, nonlinearity, and learned representations in neural networks.
Abstract（参考訳）: 学習したニューラルネットワーク表現の有用性は、そのジオメトリがダウンストリームタスクのパフォーマンスをどのようにサポートするかに依存する。この幾何学は、入力の構造、ターゲット出力の構造、ネットワークのアーキテクチャに依存する。 1つの隠れた層でネットワークの学習ダイナミクスを研究することで、ネットワークの活性化関数が表現幾何学に予期せぬほど強い影響を与えることが判明した: tanhネットワークは、ターゲット出力の構造を反映した表現を学習し、reluネットワークは生の入力の構造に関するより多くの情報を保持する傾向がある。この違いは、タスク入力の幾何とタスクラベルのアライメントの程度を変調する幅広いパラメータ化されたタスクのクラスを通して一貫して観察される。重み空間における学習ダイナミクスを分析し,tanh と relu の非線形性を持つネットワーク間の差異が relu の非対称漸近的挙動からどのように生じるかを示した。対照的に、Tanhネットワークの特徴ニューロンはタスクラベル構造を継承する傾向がある。その結果、ターゲット出力が低次元である場合、TanhネットワークはReLU非線形性で得られるものよりも絡み合った神経表現を生成する。本研究は,ニューラルネットワークにおける入出力幾何,非線形性,学習表現の相互作用に光を当てた。

関連論文リスト

Coding schemes in neural networks learning classification tasks [52.22978725954347]
完全接続型広義ニューラルネットワーク学習タスクについて検討する。ネットワークが強力なデータ依存機能を取得することを示す。驚くべきことに、内部表現の性質は神経の非線形性に大きく依存する。
論文参考訳（メタデータ） (2024-06-24T14:50:05Z)
Graph Metanetworks for Processing Diverse Neural Architectures [33.686728709734105]
Graph Metanetworks(GMN)は、競合するメソッドが苦労するニューラルネットワークに一般化する。 GMNは,入力ニューラルネットワーク関数を残したパラメータ置換対称性と等価であることを示す。
論文参考訳（メタデータ） (2023-12-07T18:21:52Z)
Variation Spaces for Multi-Output Neural Networks: Insights on Multi-Task Learning and Network Compression [28.851519959657466]
本稿では,ベクトル値ニューラルネットワークの解析のための新しい理論的枠組みを提案する。この研究の重要な貢献は、ベクトル値変動空間に対する表現定理の開発である。これらのベクトル値変動空間に関連するノルムは、複数のタスクに有用な特徴の学習を促進する。
論文参考訳（メタデータ） (2023-05-25T23:32:10Z)
Quasi-orthogonality and intrinsic dimensions as measures of learning and generalisation [55.80128181112308]
ニューラルネットワークの特徴空間の次元性と準直交性は、ネットワークの性能差別と共同して機能する可能性があることを示す。本研究は, ネットワークの最終的な性能と, ランダムに初期化された特徴空間の特性との関係を示唆する。
論文参考訳（メタデータ） (2022-03-30T21:47:32Z)
Data-driven emergence of convolutional structure in neural networks [83.4920717252233]
識別タスクを解くニューラルネットワークが、入力から直接畳み込み構造を学習できることを示す。データモデルを慎重に設計することにより、このパターンの出現は、入力の非ガウス的、高次局所構造によって引き起こされることを示す。
論文参考訳（メタデータ） (2022-02-01T17:11:13Z)
A neural anisotropic view of underspecification in deep learning [60.119023683371736]
ニューラルネットが問題の未特定化を扱う方法が,データ表現に大きく依存していることを示す。深層学習におけるアーキテクチャ的インダクティブバイアスの理解は,これらのシステムの公平性,堅牢性,一般化に対処する上で基本的であることを強調した。
論文参考訳（メタデータ） (2021-04-29T14:31:09Z)
Learning Connectivity of Neural Networks from a Topological Perspective [80.35103711638548]
本稿では,ネットワークを解析のための完全なグラフに表現するためのトポロジ的視点を提案する。接続の規模を反映したエッジに学習可能なパラメータを割り当てることにより、学習プロセスを異なる方法で行うことができる。この学習プロセスは既存のネットワークと互換性があり、より大きな検索空間と異なるタスクへの適応性を持っている。
論文参考訳（メタデータ） (2020-08-19T04:53:31Z)
Neural networks adapting to datasets: learning network size and topology [77.34726150561087]
ニューラルネットワークは、勾配に基づくトレーニングの過程で、そのサイズとトポロジの両方を学習できるフレキシブルなセットアップを導入します。結果として得られるネットワークは、特定の学習タスクとデータセットに合わせたグラフの構造を持つ。
論文参考訳（メタデータ） (2020-06-22T12:46:44Z)
Internal representation dynamics and geometry in recurrent neural networks [10.016265742591674]
本稿では,バニラRNNがネットワークのダイナミクスを解析することにより,単純な分類タスクを実現する方法を示す。初期の内部表現はデータの実際のラベルから解放されるが、この情報は出力層に直接アクセスできない。
論文参考訳（メタデータ） (2020-01-09T23:19:21Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。