Fugu-MT 論文翻訳(概要): Hebrew Diacritics Restoration using Visual Representation

論文の概要: Hebrew Diacritics Restoration using Visual Representation

arxiv url: http://arxiv.org/abs/2510.26521v1
Date: Thu, 30 Oct 2025 14:15:16 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-31 16:05:09.852131
Title: Hebrew Diacritics Restoration using Visual Representation
Title（参考訳）: 視覚表現を用いたヘブライ語発音の復元
Authors: Yair Elboher, Yuval Pinter,
Abstract要約: ゼロショット分類問題としてタスクをフレーム化するヘブライ語ダイアクリプティゼーションシステムであるDIVRITを提案する。提案手法は単語レベルで動作し,各単語に対して最も適切な発音パターンを選択する。 DIVRITの重要な革新は、画像として非記述テキストを処理するヘブライ語ビジュアル言語モデルを使用することである。
参考スコア（独自算出の注目度）: 8.254230288283258
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Diacritics restoration in Hebrew is a fundamental task for ensuring accurate word pronunciation and disambiguating textual meaning. Despite the language's high degree of ambiguity when unvocalized, recent machine learning approaches have significantly advanced performance on this task. In this work, we present DIVRIT, a novel system for Hebrew diacritization that frames the task as a zero-shot classification problem. Our approach operates at the word level, selecting the most appropriate diacritization pattern for each undiacritized word from a dynamically generated candidate set, conditioned on the surrounding textual context. A key innovation of DIVRIT is its use of a Hebrew Visual Language Model, which processes undiacritized text as an image, allowing diacritic information to be embedded directly within the input's vector representation. Through a comprehensive evaluation across various configurations, we demonstrate that the system effectively performs diacritization without relying on complex, explicit linguistic analysis. Notably, in an ``oracle'' setting where the correct diacritized form is guaranteed to be among the provided candidates, DIVRIT achieves a high level of accuracy. Furthermore, strategic architectural enhancements and optimized training methodologies yield significant improvements in the system's overall generalization capabilities. These findings highlight the promising potential of visual representations for accurate and automated Hebrew diacritization.
Abstract（参考訳）: ヘブライ語における方言の復元は、正確な単語の発音とテキストの意味の曖昧さを保証するための基本的な課題である。アンボーカライズされていない言語では曖昧さの度合いが高いにもかかわらず、最近の機械学習アプローチは、このタスクにおいてかなり高度なパフォーマンスを持っている。本稿では,ヘブライ語ダイアクリプティゼーションのための新しいシステムであるDIVRITについて述べる。本手法は単語レベルで動作し,周囲のテクスチュアコンテキストに条件付き動的に生成した候補集合から,単語毎に最も適切な発音パターンを選択する。 DIVRITの重要な革新はヘブライ語ビジュアル言語モデルを使用することである。様々な構成の包括的評価を通じて,複雑で明示的な言語分析に頼ることなく,効果的にダイアクリタイズを行うことを示した。特に、与えられた候補のうち、正しいダイアライズされた形式が保証されている '`oracle'' 設定では、DIVRIT は高い精度を達成する。さらに、戦略的アーキテクチャ強化と最適化されたトレーニング手法により、システム全体の一般化能力が大幅に向上する。これらの知見は, 正確な, 自動化されたヘブライ語発音のための視覚表現の可能性を示すものである。

論文の概要: Hebrew Diacritics Restoration using Visual Representation

関連論文リスト