Fugu-MT 論文翻訳(概要): Heat Death of Generative Models in Closed-Loop Learning

論文の概要: Heat Death of Generative Models in Closed-Loop Learning

arxiv url: http://arxiv.org/abs/2404.02325v2
Date: Wed, 28 Aug 2024 22:54:15 GMT
ステータス: 翻訳完了
システム内更新日: 2024-08-30 19:18:41.380719
Title: Heat Death of Generative Models in Closed-Loop Learning
Title（参考訳）: 閉ループ学習における生成モデルの熱死
Authors: Matteo Marchi, Stefano Soatto, Pratik Chaudhari, Paulo Tabuada,
Abstract要約: 本研究では、独自の学習データセットに加えて、生成したコンテンツをフィードバックする生成モデルの学習ダイナミクスについて検討する。各イテレーションで十分な量の外部データが導入されない限り、非自明な温度がモデルを退化させることを示す。
参考スコア（独自算出の注目度）: 63.83608300361159
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Improvement and adoption of generative machine learning models is rapidly accelerating, as exemplified by the popularity of LLMs (Large Language Models) for text, and diffusion models for image generation. As generative models become widespread, data they generate is incorporated into shared content through the public web. This opens the question of what happens when data generated by a model is fed back to the model in subsequent training campaigns. This is a question about the stability of the training process, whether the distribution of publicly accessible content, which we refer to as "knowledge", remains stable or collapses. Small scale empirical experiments reported in the literature show that this closed-loop training process is prone to degenerating. Models may start producing gibberish data, or sample from only a small subset of the desired data distribution (a phenomenon referred to as mode collapse). So far there has been only limited theoretical understanding of this process, in part due to the complexity of the deep networks underlying these generative models. The aim of this paper is to provide insights into this process (that we refer to as "generative closed-loop learning") by studying the learning dynamics of generative models that are fed back their own produced content in addition to their original training dataset. The sampling of many of these models can be controlled via a "temperature" parameter. Using dynamical systems tools, we show that, unless a sufficient amount of external data is introduced at each iteration, any non-trivial temperature leads the model to asymptotically degenerate. In fact, either the generative distribution collapses to a small set of outputs or becomes uniform over a large set of outputs.
Abstract（参考訳）: テキスト用LLM(Large Language Models)や画像生成用拡散モデルなどにより,生成機械学習モデルの改良と採用が急速に加速している。生成モデルが普及するにつれて、生成したデータは公開ウェブを通じて共有コンテンツに組み込まれる。このことは、その後のトレーニングキャンペーンでモデルによって生成されたデータがモデルにフィードバックされたときに何が起こるのかという疑問を提起する。これは、トレーニングプロセスの安定性に関する問題であり、私たちが「知識」と呼ぶ公開コンテンツの分布が安定しているか、崩壊しているかである。論文で報告された小規模な実験実験により、この閉ループトレーニングプロセスは縮退しがちであることが示された。モデルは、望まれるデータ分布の小さなサブセット(モード崩壊と呼ばれる現象)からのみ、ジベリッシュなデータを生成する。これまでのところ、これらの生成モデルの基礎となる深層ネットワークの複雑さのために、このプロセスに関する理論的理解は限られている。本研究の目的は、本プロセス(「生成クローズドループ学習」と呼ぶ)について、独自の学習データセットに加えて、独自の生成したコンテンツをフィードバックする生成モデルの学習ダイナミクスを研究することである。これらのモデルのサンプリングは、"温度"パラメータによって制御できる。動的システムツールを用いて、各イテレーションで十分な量の外部データが導入されない限り、非自明な温度がモデルを漸近的に退化させることを示す。実際、生成分布は小さな出力セットに崩壊するか、あるいは大きな出力セットに対して一様になる。

論文の概要: Heat Death of Generative Models in Closed-Loop Learning

関連論文リスト