Fugu-MT 論文翻訳(概要): KD4MT: A Survey of Knowledge Distillation for Machine Translation

論文の概要: KD4MT: A Survey of Knowledge Distillation for Machine Translation

arxiv url: http://arxiv.org/abs/2602.15845v1
Date: Thu, 22 Jan 2026 14:26:34 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-23 12:01:13.642609
Title: KD4MT: A Survey of Knowledge Distillation for Machine Translation
Title（参考訳）: KD4MT:機械翻訳の知識蒸留に関する調査
Authors: Ona de Gibert, Joseph Attieh, Timothee Mickus, Yves Scherrer, Jörg Tiedemann,
Abstract要約: この調査は105論文にわたる機械翻訳のための知識蒸留(2025年10月1日)を合成する。我々は,KD4MTの方法論的貢献と実践的応用に基づいて,KD4MT文献の進歩を分類した。具体的設定でKD法を選択するためのガイドラインを提案し, MTへのKDの適用に伴う潜在的なリスクを明らかにする。
参考スコア（独自算出の注目度）: 13.688061965899562
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Knowledge Distillation (KD) as a research area has gained a lot of traction in recent years as a compression tool to address challenges related to ever-larger models in NLP. Remarkably, Machine Translation (MT) offers a much more nuanced take on this narrative: in MT, KD also functions as a general-purpose knowledge transfer mechanism that shapes supervision and translation quality as well as efficiency. This survey synthesizes KD for MT (KD4MT) across 105 papers (through October 1, 2025). We begin by introducing both MT and KD for non-experts, followed by an overview of the standard KD approaches relevant to MT applications. Subsequently, we categorize advances in the KD4MT literature based on (i) their methodological contributions and (ii) their practical applications. Our qualitative and quantitative analyses identify common trends in the field and highlight key research gaps as well as the absence of unified evaluation practice for KD methods in MT. We further provide practical guidelines for selecting a KD method in concrete settings and highlight potential risks associated with the application of KD to MT such as increased hallucination and bias amplification. Finally, we discuss the role of LLMs in re-shaping the KD4MT field. To support further research, we complement our survey with a publicly available database summarizing the main characteristics of the surveyed KD methods and a glossary of key terms.
Abstract（参考訳）: 研究分野としての知識蒸留(KD)は近年,NLPの超大型モデルに関わる課題に対処するための圧縮ツールとして,多くの注目を集めている。 MTでは、KDは、監督と翻訳の質と効率を形作る汎用的な知識伝達メカニズムとしても機能します。 MT用KD(KD4MT)を105紙(2025年10月1日)で合成する。 MT と KD の両方を非専門家向けに導入し,続いて MT アプリケーションに関連する標準 KD アプローチの概要を紹介する。その後、我々はKD4MT文献の進歩を分類する。一方法論上の貢献及び貢献 (二)実践的応用我々は,KD法を具体的設定で選択するための実践的ガイドラインや,幻覚や偏見の増幅など,KDのMTへの適用に伴う潜在的なリスクを明らかにするために,現場における共通トレンドを特定し,重要な研究ギャップを浮き彫りにする。最後に、KD4MTフィールドの再形成におけるLLMの役割について論じる。今後の研究を支援するために,調査対象のKD手法の主な特徴を要約したデータベースと,重要な用語の用語集を補完する。

論文の概要: KD4MT: A Survey of Knowledge Distillation for Machine Translation

関連論文リスト