Fugu-MT 論文翻訳(概要): Ukrainian Visual Word Sense Disambiguation Benchmark

論文の概要: Ukrainian Visual Word Sense Disambiguation Benchmark

arxiv url: http://arxiv.org/abs/2603.23627v1
Date: Tue, 24 Mar 2026 18:09:24 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-26 21:06:10.981897
Title: Ukrainian Visual Word Sense Disambiguation Benchmark
Title（参考訳）: ウクライナの視覚的単語センスの曖昧さベンチマーク
Authors: Yurii Laba, Yaryna Mohytych, Ivanna Rohulia, Halyna Kyryleyza, Hanna Dydyk-Meush, Oles Dobosevych, Rostyslav Hryniv,
Abstract要約: 本研究では,ウクライナ語における視覚的単語センス曖昧化(Visual Word Sense Disambiguation, Visual-WSD)タスクを評価するためのベンチマークを提案する。 Visual-WSDタスクの主目的は、最小限の文脈情報を用いて、与えられた曖昧な単語の最も適切な表現を特定することである。分析の結果,ウクライナ語と英語の視覚-WSD課題において,有意な性能差が認められた。
参考スコア（独自算出の注目度）: 0.7203557048672377
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: This study presents a benchmark for evaluating the Visual Word Sense Disambiguation (Visual-WSD) task in Ukrainian. The main goal of the Visual-WSD task is to identify, with minimal contextual information, the most appropriate representation of a given ambiguous word from a set of ten images. To construct this benchmark, we followed a methodology similar to that proposed by (CITATION), who previously introduced benchmarks for the Visual-WSD task in English, Italian, and Farsi. This approach allows us to incorporate the Ukrainian benchmark into a broader framework for cross-language model performance comparisons. We collected the benchmark data semi-automatically and refined it with input from domain experts. We then assessed eight multilingual and multimodal large language models using this benchmark. All tested models performed worse than the zero-shot CLIP-based baseline model (CITATION) used by (CITATION) for the English Visual-WSD task. Our analysis revealed a significant performance gap in the Visual-WSD task between Ukrainian and English.
Abstract（参考訳）: 本研究では,ウクライナ語における視覚的単語センス曖昧化(Visual Word Sense Disambiguation, Visual-WSD)タスクを評価するためのベンチマークを提案する。 Visual-WSDタスクの主目的は、最小限の文脈情報を用いて、与えられた曖昧な単語を10個の画像から最も適切に表現することである。このベンチマークを構築するために、我々は以前Visual-WSDタスクのベンチマークを英語、イタリア語、Farsiで導入したCITATION(CITATION)に類似した手法に従った。このアプローチにより、ウクライナのベンチマークを、クロス言語モデルのパフォーマンス比較のためのより広範なフレームワークに組み込むことができます。ベンチマークデータを半自動で収集し、ドメインの専門家からの入力で洗練しました。次に、このベンチマークを用いて8つの多言語および多モーダルな大言語モデルを評価した。全てのテストモデルは、英語のVisual-WSDタスクに使用されるゼロショットCLIPベースベースラインモデル(CITATION)よりもパフォーマンスが悪くなった。分析の結果,ウクライナ語と英語の視覚-WSD課題において,有意な性能差が認められた。

論文の概要: Ukrainian Visual Word Sense Disambiguation Benchmark

関連論文リスト