Fugu-MT 論文翻訳(概要): Dictionary Insertion Prompting for Multilingual Reasoning on Multilingual Large Language Models

論文の概要: Dictionary Insertion Prompting for Multilingual Reasoning on Multilingual Large Language Models

arxiv url: http://arxiv.org/abs/2411.01141v1
Date: Sat, 02 Nov 2024 05:10:50 GMT
ステータス: 翻訳完了
システム内更新日: 2024-11-28 17:07:43.62514
Title: Dictionary Insertion Prompting for Multilingual Reasoning on Multilingual Large Language Models
Title（参考訳）: 多言語大言語モデルを用いた多言語推論のための辞書挿入プロンプト
Authors: Hongyuan Lu, Zixuan Li, Wai Lam,
Abstract要約: textbfDictionary textbfInsertion textbfPrompting (textbfDIP) という,新規かつシンプルで効果的な方法を提案する。非英語のプロンプトを提供する際、DIPは単語辞書を調べ、単語の英語のプロンプトをLLMのプロンプトに挿入する。そして、英語へのより良い翻訳とより良い英語モデル思考のステップを可能にし、明らかにより良い結果をもたらす。
参考スコア（独自算出の注目度）: 52.00446751692225
License: http://creativecommons.org/licenses/by/4.0/
Abstract: As current training data for Large Language Models (LLMs) are dominated by English corpus, they are English-centric and they present impressive performance on English reasoning tasks.\footnote{This paper primarily studies English-centric models, but our method could be universal by using the centric language in the dictionary for non-English-centric LLMs.} Yet, they usually suffer from lower performance in other languages. There are about 7,000 languages over the world, and many are low-resourced on English-centric LLMs. For the sake of people who primarily speak these languages, it is especially urgent to enable our LLMs in those languages. Model training is usually effective, but computationally expensive and requires experienced NLP practitioners. This paper presents a novel and simple yet effective method called \textbf{D}ictionary \textbf{I}nsertion \textbf{P}rompting (\textbf{DIP}). When providing a non-English prompt, DIP looks up a word dictionary and inserts words' English counterparts into the prompt for LLMs. It then enables better translation into English and better English model thinking steps which leads to obviously better results. We experiment with about 200 languages from FLORES-200. Since there are no adequate datasets, we use the NLLB translator to create synthetic multilingual benchmarks from the existing 4 English reasoning benchmarks such as GSM8K and AQuA. Despite the simplicity and computationally lightweight, we surprisingly found the effectiveness of DIP on math and commonsense reasoning tasks on multiple open-source and close-source LLMs.\footnote{Our dictionaries, code, and synthetic benchmarks will be open-sourced to facilitate future research.}
Abstract（参考訳）: 現在、LLM(Large Language Models)のトレーニングデータは英語コーパスに支配されているため、それらは英語中心であり、英語の推論タスクにおいて印象的なパフォーマンスを示す。この論文は、主に英語中心のモデルを研究するが、英語中心のLLMの辞書における中心言語を用いて、我々の手法は普遍的である可能性がある。しかし、彼らは通常、他の言語での低いパフォーマンスに悩まされる。世界中に約7000の言語があり、その多くが英語中心のLLMで低リソース化されている。主にこれらの言語を話す人々のために、これらの言語でLLMを有効にすることは特に緊急です。モデルトレーニングは通常効果的であるが、計算に高価であり、経験豊富なNLP実践者を必要とする。本稿では, 新規かつシンプルで効果的な方法である \textbf{D}ictionary \textbf{I}nsertion \textbf{P}rompting (\textbf{DIP})を提案する。非英語のプロンプトを提供する際、DIPは単語辞書を調べ、単語の英語のプロンプトをLLMのプロンプトに挿入する。そして、英語へのより良い翻訳と、より良い英語モデル思考のステップを可能にし、明らかにより良い結果をもたらす。 FLORES-200の約200言語について実験を行った。適切なデータセットがないため、GSM8KやAQuAといった既存の4つの英語推論ベンチマークから合成多言語ベンチマークを作成するために、NLLBトランスレータを使用する。単純で計算量も軽量であるにもかかわらず,複数のオープンソースおよびオープンソース LLM 上でのDIP の算数および常識推論タスクに対する効果は驚くほど高いことがわかった。われわれの辞書、コード、および合成ベンチマークは、将来の研究を促進するためにオープンソース化される。 ※

論文の概要: Dictionary Insertion Prompting for Multilingual Reasoning on Multilingual Large Language Models

関連論文リスト