Fugu-MT 論文翻訳(概要): Residual Learning of Neural Text Generation with $n$-gram Language Model

論文の概要: Residual Learning of Neural Text Generation with $n$-gram Language Model

arxiv url: http://arxiv.org/abs/2210.14431v1
Date: Wed, 26 Oct 2022 02:42:53 GMT
ステータス: 翻訳完了
システム内更新日: 2022-10-27 13:01:25.962127
Title: Residual Learning of Neural Text Generation with $n$-gram Language Model
Title（参考訳）: $n$-gram言語モデルを用いたニューラルテキスト生成の残差学習
Authors: Huayang Li, Deng Cai, Jin Xu, Taro Watanabe
Abstract要約: 我々は、$n$-gramのLMと実データ分布の間の残差に適合するニューラルネットワークLMを学習する。当社のアプローチは、一般的なスタンドアロンニューラルネットワークモデルに対して、継続的にパフォーマンスの向上を実現しています。
参考スコア（独自算出の注目度）: 41.26228768053928
License: http://creativecommons.org/licenses/by/4.0/
Abstract: $N$-gram language models (LM) have been largely superseded by neural LMs as the latter exhibits better performance. However, we find that $n$-gram models can achieve satisfactory performance on a large proportion of testing cases, indicating they have already captured abundant knowledge of the language with relatively low computational cost. With this observation, we propose to learn a neural LM that fits the residual between an $n$-gram LM and the real-data distribution. The combination of $n$-gram and neural LMs not only allows the neural part to focus on the deeper understanding of language but also provides a flexible way to customize an LM by switching the underlying $n$-gram model without changing the neural model. Experimental results on three typical language tasks (i.e., language modeling, machine translation, and summarization) demonstrate that our approach attains additional performance gains over popular standalone neural models consistently. We also show that our approach allows for effective domain adaptation by simply switching to a domain-specific $n$-gram model, without any extra training. Our code is released at https://github.com/ghrua/NgramRes.
Abstract（参考訳）: N$-gram言語モデル(LM)は、より優れた性能を示すため、ニューラルなLMに取って代わられている。しかし,$n$-gramモデルでは,比較的計算コストの低い言語知識を既に獲得していることから,多数のテストケースにおいて良好な性能が得られることがわかった。この観察により,n$-gram lmと実データ分布の間の残差に適合するニューラルネットワークlmを学習することを提案する。 n$-gramとneural lmsの組み合わせにより、ニューラルネットワークは言語をより深く理解することに集中できるだけでなく、ニューラルモデルを変更することなく基礎となる$n$-gramモデルを切り替えることで、lmをカスタマイズするための柔軟な方法を提供する。 3つの典型的な言語タスク(言語モデリング、機械翻訳、要約)に関する実験結果から、我々のアプローチは、一般的なスタンドアロンニューラルネットワークモデルよりも、継続的にパフォーマンスが向上することを示した。また、本手法はドメイン固有の$n$-gramモデルに切り替えるだけで、余分なトレーニングをすることなく、効果的なドメイン適応を可能にすることを示す。私たちのコードはhttps://github.com/ghrua/ngramresでリリースしています。

論文の概要: Residual Learning of Neural Text Generation with $n$-gram Language Model

関連論文リスト