Fugu-MT 論文翻訳(概要): G-IdiomAlign: A Gloss-Pivoted Benchmark for Cross-Lingual Idiom Alignment

論文の概要: G-IdiomAlign: A Gloss-Pivoted Benchmark for Cross-Lingual Idiom Alignment

arxiv url: http://arxiv.org/abs/2606.18989v1
Date: Wed, 17 Jun 2026 12:09:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-18 17:16:51.155124
Title: G-IdiomAlign: A Gloss-Pivoted Benchmark for Cross-Lingual Idiom Alignment
Title（参考訳）: G-IdiomAlign: 言語間IdiomアライメントのためのGloss-Pivotedベンチマーク
Authors: Fengying Ye, Yanming Sun, Runzhe Zhan, Zheqi Zhang, Lidia S. Chao, Derek F. Wong,
Abstract要約: We present G-IdiomAlign, a gloss-ed benchmark where each idiom are anchored by a English gloss from Wiktionary。 G-IdiomAlignは,(1)非グロスとWith-glosの入力を対比したGloss-Contrastive Generationと,(2)非グロスとWith-glosの2つのプロトコルをサポートし,明示的なセマンティックピボットの効果を分離する。
参考スコア（独自算出の注目度）: 37.72396108456943
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Idioms are difficult to transfer across languages due to their non-compositionality and weak surface-form grounding, making literal mappings unreliable. We present G-IdiomAlign, a gloss-pivoted benchmark where each idiom is anchored by an English gloss from Wiktionary. We further construct a high-confidence reference alignment set for reproducible evaluation. G-IdiomAlign supports two protocols: (1) a controlled Multiple-Choice Idiom Equivalence with typed distractors for error attribution; and (2) a Gloss-Contrastive Generation contrasting No-gloss and With-gloss inputs to isolate the effect of an explicit semantic pivot. Across diverse LLMs, a bias to literal translation is a dominant failure mode, especially when the target is a low-resource language. Glosses consistently improve Gloss-Contrastive Generation under an embedding-based semantic proxy, but performance remains modest, indicating substantial headroom in the open output space. Subsequent analysis on Qwen3-8B further suggests that cross-condition differences are concentrated more in attention heads than in layers, while better With-gloss generations coincide with stronger gloss anchoring.
Abstract（参考訳）: 慣用句は非合成性や表面形状の弱い接地のため言語間での移動が困難であり、リテラルマッピングは信頼できない。我々は、G-IdiomAlignというGloss-pivotedベンチマークを示し、各イディオムはWiktionaryの英語の光沢によって固定される。さらに,再現性評価のための高信頼基準アライメントセットを構築した。 G-IdiomAlign は,(1) 誤り帰属のための型付きイントラクタによる制御されたマルチコースイディオム等価性,(2) 明示的なセマンティックピボットの効果を分離するために,No-glos と With-glos の入力を対比したGloss-Contrastive Generation をサポートする。多様なLLM全体において、リテラル翻訳へのバイアスは、特にターゲットが低リソース言語である場合、主要な障害モードである。 Glossesは埋め込みベースのセマンティックプロキシの下でGross-Contrastive Generationを継続的に改善するが、パフォーマンスは控えめであり、オープンな出力空間におけるかなりのヘッドルームを示している。その後のQwen3-8B分析では、層よりもクロスコンディションの違いの方が注目度が高いことが示唆された。

論文の概要: G-IdiomAlign: A Gloss-Pivoted Benchmark for Cross-Lingual Idiom Alignment

関連論文リスト