Fugu-MT 論文翻訳(概要): Ancient Greek to Modern Greek Machine Translation: A Novel Benchmark and Fine-Tuning Experiments on LLMs and NMT Models

論文の概要: Ancient Greek to Modern Greek Machine Translation: A Novel Benchmark and Fine-Tuning Experiments on LLMs and NMT Models

arxiv url: http://arxiv.org/abs/2605.18504v1
Date: Mon, 18 May 2026 14:56:44 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-19 17:57:49.808356
Title: Ancient Greek to Modern Greek Machine Translation: A Novel Benchmark and Fine-Tuning Experiments on LLMs and NMT Models
Title（参考訳）: 古代ギリシア語から現代ギリシア語への機械翻訳: LLMとNMTモデルに関する新しいベンチマークと微調整実験
Authors: Spyridon Mavromatis, Sokratis Sofianopoulos, Prokopis Prokopidis, Maria Giagkou,
Abstract要約: 我々はAG-MG Parallel Corpusを132,481対の文列を持つ新しいリソースとして紹介する。ウェブスクラッピングされた抜粋レベルのデータと多段階の文レベルのアライメントを組み合わせた新しいコーパス生成パイプラインを提案する。本研究は,3つの微調整戦略の評価を行い,最新のMTモデルの総合ベンチマークを行った。
参考スコア（独自算出の注目度）: 0.24042587920175496
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Machine Translation (MT) for Ancient Greek (AG) to Modern Greek (MG) is a low-resource task, constrained by the lack of large-scale, high-quality parallel data. We address this gap by introducing the AG-MG Parallel Corpus, a new resource containing 132,481 sentence-aligned pairs derived from literary, historical, and biblical texts. We present a novel corpus creation pipeline that combines web-scraped, excerpt-level data with a multi-stage sentence-level alignment, and refinement process. Our method uses VecAlign with LaBSE embeddings, which we first fine-tune on a manually-aligned AG-MG subset, followed by an LLM-based error/misalignment correction phase using Gemini 2.5 Flash to ensure high alignment quality. Furthermore, we provide the first comprehensive benchmark of modern MT models on this task, evaluating three fine-tuning strategies across NMT models (NLLB, M2M100) and a Greek LLM (Llama-Krikri-8B). Our experiments show that fine-tuning yields significant improvements over base models, increasing performance by up to +10.3 BLEU points. Specifically, full-parameter fine-tuning of Llama-Krikri-8B achieves the highest overall performance with a BLEU score of 13.16, while the QLoRA-adapted M2M100-1.2B model demonstrates the largest relative gains and highly competitive results. Our dataset and models represent a significant contribution to Greek NLP.
Abstract（参考訳）: 機械翻訳(MT)は、古代ギリシア語(AG)から現代ギリシア語(MG)への機械翻訳であり、大規模で高品質な並列データがないために制約される。 AG-MG並列コーパス(AG-MG Parallel Corpus)は、文学、歴史、聖書のテキストから132,481対の文を並べた新しいリソースである。ウェブスクラッピングされた抜粋レベルのデータと多段階の文レベルのアライメントと改良プロセスを組み合わせた新しいコーパス生成パイプラインを提案する。提案手法では,手動のAG-MGサブセットにVecAlignとLaBSEを組み込み,次にGemini 2.5 Flashを用いたLLMベースのエラー/ミスアライメント補正フェーズを用いて高アライメント品質を確保する。さらに,NMTモデル (NLLB, M2M100) とギリシャのLLM (Llama-Krikri-8B) の3つの微調整戦略を評価する。実験の結果, 微調整はベースモデルよりも大幅に改善され, 最大+10.3BLEUポイントの性能が向上した。具体的には、Llama-Krikri-8Bのフルパラメータの微調整はBLEUスコアが13.16であるのに対して、QLoRAに適応したM2M100-1.2Bモデルは最大の相対的な利得と高い競争結果を示す。我々のデータセットとモデルは、ギリシャのNLPに大きな貢献をしている。

論文の概要: Ancient Greek to Modern Greek Machine Translation: A Novel Benchmark and Fine-Tuning Experiments on LLMs and NMT Models

関連論文リスト