Fugu-MT 論文翻訳(概要): Open Machine Translation for Esperanto

論文の概要: Open Machine Translation for Esperanto

arxiv url: http://arxiv.org/abs/2603.29345v1
Date: Tue, 31 Mar 2026 07:17:24 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-01 15:25:03.2454
Title: Open Machine Translation for Esperanto
Title（参考訳）: エスペラントのためのオープン機械翻訳
Authors: Ona de Gibert, Lluís de Gibert,
Abstract要約: Esperantoのためのオープンソースの機械翻訳システムの総合評価を行った。ルールベースシステム,エンコーダデコーダモデル,LLMをモデルサイズで比較する。以上の結果から,NLLBファミリーは全ての言語ペアで最高の性能を発揮することがわかった。
参考スコア（独自算出の注目度）: 2.1836499601883754
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Esperanto is a widespread constructed language, known for its regular grammar and productive word formation. Besides having substantial resources available thanks to its online community, it remains relatively underexplored in the context of modern machine translation (MT) approaches. In this work, we present the first comprehensive evaluation of open-source MT systems for Esperanto, comparing rule-based systems, encoder-decoder models, and LLMs across model sizes. We evaluate translation quality across six language directions involving English, Spanish, Catalan, and Esperanto using multiple automatic metrics as well as human evaluation. Our results show that the NLLB family achieves the best performance in all language pairs, followed closely by our trained compact models and a fine-tuned general-purpose LLM. Human evaluation confirms this trend, with NLLB translations preferred in approximately half of the comparisons, although noticeable errors remain. In line with Esperanto's tradition of openness and international collaboration, we release our code and best-performing models publicly.
Abstract（参考訳）: エスペラント語は広く構築された言語であり、正規文法と生産的な単語形成で知られている。オンラインコミュニティのおかげで、かなりのリソースが手に入るだけでなく、現代の機械翻訳(MT)アプローチの文脈では、まだあまり研究されていない。本研究では,エスペラントのオープンソースMTシステムについて,ルールベースシステム,エンコーダ・デコーダモデル,LLMをモデルサイズで比較し,総合評価を行った。我々は,英語,スペイン語,カタルーニャ語,エスペラント語を含む6つの言語方向の翻訳品質を,複数の自動測定値と人間の評価値を用いて評価した。以上の結果から,NLLBファミリーは全ての言語ペアで最高の性能を達成でき,さらに訓練されたコンパクトモデルと微調整された汎用LLMがそれに近づいた。人間による評価はこの傾向を確認し、NLLB翻訳は比較の約半分で好まれるが、目立った誤りは残る。 Esperanto氏のオープン性と国際的なコラボレーションという伝統に従って、コードと最高のパフォーマンスのモデルを公開しています。

論文の概要: Open Machine Translation for Esperanto

関連論文リスト