Fugu-MT 論文翻訳(概要): Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergent Language

論文の概要: Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergent Language

arxiv url: http://arxiv.org/abs/2210.00482v2
Date: Wed, 5 Oct 2022 21:14:46 GMT
ステータス: 翻訳完了
システム内更新日: 2022-10-07 14:50:05.165129
Title: Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergent Language
Title（参考訳）: 教師なし構成表現学習における構成一般化 : 絡み合いと創発言語に関する研究
Authors: Zhenlin Xu, Marc Niethammer, Colin Raffel
Abstract要約: 合成一般化を直接テストできる2つのデータセット上で、3つの教師なし表現学習アルゴリズムについて検討する。単純なモデルと少ないラベルでボトルネック表現を直接使用すると、学習された表現そのものの前後のレイヤからの表現よりも、より悪い一般化につながる可能性がある。驚くべきことに、不整合表現を生成する圧力の増加は、より悪い一般化を伴う表現を生成するのに対し、ELモデルからの表現は強い合成一般化を示す。
参考スコア（独自算出の注目度）: 48.37815764394315
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Deep learning models struggle with compositional generalization, i.e. the ability to recognize or generate novel combinations of observed elementary concepts. In hopes of enabling compositional generalization, various unsupervised learning algorithms have been proposed with inductive biases that aim to induce compositional structure in learned representations (e.g. disentangled representation and emergent language learning). In this work, we evaluate these unsupervised learning algorithms in terms of how well they enable compositional generalization. Specifically, our evaluation protocol focuses on whether or not it is easy to train a simple model on top of the learned representation that generalizes to new combinations of compositional factors. We systematically study three unsupervised representation learning algorithms - $\beta$-VAE, $\beta$-TCVAE, and emergent language (EL) autoencoders - on two datasets that allow directly testing compositional generalization. We find that directly using the bottleneck representation with simple models and few labels may lead to worse generalization than using representations from layers before or after the learned representation itself. In addition, we find that the previously proposed metrics for evaluating the levels of compositionality are not correlated with actual compositional generalization in our framework. Surprisingly, we find that increasing pressure to produce a disentangled representation produces representations with worse generalization, while representations from EL models show strong compositional generalization. Taken together, our results shed new light on the compositional generalization behavior of different unsupervised learning algorithms with a new setting to rigorously test this behavior, and suggest the potential benefits of delevoping EL learning algorithms for more generalizable representations.
Abstract（参考訳）: 深層学習モデルは構成的一般化、すなわち観察された基本概念の新たな組み合わせを認識または生成する能力に苦慮する。構成一般化を可能にするために、学習表現における構成構造を誘導する帰納的バイアス(例えば、非絡み合い表現や創発的言語学習)を用いて、様々な教師なし学習アルゴリズムが提案されている。本研究では,これらの教師なし学習アルゴリズムを,構成一般化の有効性の観点から評価する。具体的には,合成因子の新たな組み合わせに一般化する学習表現の上に,簡単なモデルを訓練することが容易かどうかに焦点をあてる。我々は,構成一般化を直接テスト可能な2つのデータセット上で,教師なし表現学習アルゴリズム($\beta$-VAE,$\beta$-TCVAE,および創発言語(EL)オートエンコーダ)を体系的に研究した。単純なモデルと少ないラベルでボトルネック表現を直接使用すると、学習された表現そのものの前後のレイヤからの表現よりも、より悪い一般化につながる可能性がある。また,これまでに提案した構成性評価指標は,本フレームワークの実際の構成一般化と相関しないことがわかった。驚くべきことに、不整合表現を生成する圧力の増加は、より悪い一般化を伴う表現を生成するのに対し、ELモデルからの表現は強い構成一般化を示す。本研究の結果は, 異なる教師なし学習アルゴリズムの合成一般化行動に新たな光を当て, より一般化可能な表現のために, EL学習アルゴリズムを高度に活用することの潜在的メリットを示唆するものである。

論文の概要: Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergent Language

関連論文リスト