Fugu-MT 論文翻訳(概要): Modular Deep Learning

論文の概要: Modular Deep Learning

arxiv url: http://arxiv.org/abs/2302.11529v1
Date: Wed, 22 Feb 2023 18:11:25 GMT
ステータス: 翻訳完了
システム内更新日: 2023-02-23 14:18:54.716499
Title: Modular Deep Learning
Title（参考訳）: モジュール型ディープラーニング
Authors: Jonas Pfeiffer, Sebastian Ruder, Ivan Vuli\'c, Edoardo Maria Ponti
Abstract要約: トランスファーラーニングは近年、機械学習の主要なパラダイムとなっている。負の干渉を伴わずに複数のタスクを専門とするモデルを開発する方法はまだ不明である。これらの課題に対する有望な解決策として、モジュール型ディープラーニングが登場した。
参考スコア（独自算出の注目度）: 93.79001031755476
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Transfer learning has recently become the dominant paradigm of machine learning. Pre-trained models fine-tuned for downstream tasks achieve better performance with fewer labelled examples. Nonetheless, it remains unclear how to develop models that specialise towards multiple tasks without incurring negative interference and that generalise systematically to non-identically distributed tasks. Modular deep learning has emerged as a promising solution to these challenges. In this framework, units of computation are often implemented as autonomous parameter-efficient modules. Information is conditionally routed to a subset of modules and subsequently aggregated. These properties enable positive transfer and systematic generalisation by separating computation from routing and updating modules locally. We offer a survey of modular architectures, providing a unified view over several threads of research that evolved independently in the scientific literature. Moreover, we explore various additional purposes of modularity, including scaling language models, causal inference, programme induction, and planning in reinforcement learning. Finally, we report various concrete applications where modularity has been successfully deployed such as cross-lingual and cross-modal knowledge transfer. Related talks and projects to this survey, are available at https://www.modulardeeplearning.com/.
Abstract（参考訳）: トランスファーラーニングは近年、機械学習の主要なパラダイムとなっている。下流タスク用に微調整された事前訓練されたモデルは、ラベル付き例を少なくしてより良いパフォーマンスを実現する。にもかかわらず、負の干渉を起こさずに複数のタスクに特化して、体系的に非同一の分散タスクに一般化するモデルを開発する方法はまだ不明である。モジュラーディープラーニングは、これらの課題に対する有望な解決策として現れてきた。このフレームワークでは、計算単位はしばしば自律的なパラメーター効率のモジュールとして実装される。情報は条件付きでモジュールのサブセットにルーティングされ、その後集約される。これらの特性は、計算をルーティングから分離し、モジュールをローカルに更新することで、正の転送と体系的な一般化を可能にする。我々はモジュラーアーキテクチャの調査を行い、科学文献の中で独立に進化したいくつかの研究のスレッドを統一した見解を提供する。さらに,言語モデルのスケーリング,因果推論,プログラム誘導,強化学習の計画など,モジュール性に関するさまざまな目的についても検討する。最後に,モジュール性が言語間やモーダル間の知識伝達など,様々な具体的な応用について報告する。この調査に関連する講演やプロジェクトは、https://www.modulardeeplearning.com/で見ることができる。

論文の概要: Modular Deep Learning

関連論文リスト