Fugu-MT 論文翻訳(概要): ExPosST: Explicit Positioning with Adaptive Masking for LLM-Based Simultaneous Machine Translation

論文の概要: ExPosST: Explicit Positioning with Adaptive Masking for LLM-Based Simultaneous Machine Translation

arxiv url: http://arxiv.org/abs/2603.14903v1
Date: Mon, 16 Mar 2026 07:04:55 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:36.121602
Title: ExPosST: Explicit Positioning with Adaptive Masking for LLM-Based Simultaneous Machine Translation
Title（参考訳）: ExPosST: LLMに基づく同時機械翻訳のための適応型マスキングを用いた明示的位置決め
Authors: Yuzhe Shang, Pengzhi Gao, Yazheng Yang, Jiayao Ma, Wei Liu, Jian Luan, Jingsong Su,
Abstract要約: 大規模言語モデル(LLM)は、最近、同時機械翻訳(SimulMT)において有望な性能を示した。復号器のみのLLMをSimulMTに適用すると、位置ミスマッチが発生し、復号効率と位置整合性の間にジレンマが発生する。既存のアプローチは、しばしば特定の位置エンコーディングや、慎重に設計されたプロンプトスキームに依存しており、推論効率、位置整合性、幅広いモデルの互換性を同時に達成できない。我々は,このジレンマを明示的な位置割り当てによって解決する汎用フレームワークであるExPosSTを提案する。ExPosSTは入力元トークンの固定位置スロットを予約し,異なる位置符号化方式でKVキャッシュによる効率的な復号を可能にする。
参考スコア（独自算出の注目度）: 19.365349936996584
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models (LLMs) have recently demonstrated promising performance in simultaneous machine translation (SimulMT). However, applying decoder-only LLMs to SimulMT introduces a positional mismatch, which leads to a dilemma between decoding efficiency and positional consistency. Existing approaches often rely on specific positional encodings or carefully designed prompting schemes, and thus fail to simultaneously achieve inference efficiency, positional consistency, and broad model compatibility. In this work, we propose ExPosST, a general framework that resolves this dilemma through explicit position allocation. ExPosST reserves fixed positional slots for incoming source tokens, enabling efficient decoding with KV cache across different positional encoding methods. To further bridge the gap between fine-tuning and inference, we introduce a policy-consistent fine-tuning strategy that aligns training with inference-time decoding behavior. Experiments across multiple language pairs demonstrate that ExPosST effectively supports simultaneous translation under diverse policies.
Abstract（参考訳）: 大規模言語モデル(LLM)は、最近、同時機械翻訳(SimulMT)において有望な性能を示した。しかし、SimulMTにデコーダのみのLLMを適用すると、位置ミスマッチが発生し、デコード効率と位置整合性の間にジレンマが発生する。既存のアプローチは、しばしば特定の位置エンコーディングや慎重に設計されたプロンプトスキームに依存しており、推論効率、位置整合性、より広いモデルの互換性を同時に達成できない。本稿では,このジレンマを明示的な位置割り当てによって解決する汎用フレームワークであるExPosSTを提案する。 ExPosSTは、入力元トークンの固定位置スロットを予約し、異なる位置符号化メソッド間でKVキャッシュによる効率的な復号を可能にする。さらに、微調整と推論のギャップを埋めるために、推論時復号動作とトレーニングを整合させるポリシー一貫性のある微調整戦略を導入する。複数の言語ペアの実験により、ExPosSTは多様なポリシーの下で同時翻訳を効果的にサポートすることが示された。

論文の概要: ExPosST: Explicit Positioning with Adaptive Masking for LLM-Based Simultaneous Machine Translation

関連論文リスト