Fugu-MT 論文翻訳(概要): MKJ at SemEval-2026 Task 9: A Comparative Study of Generalist, Specialist, and Ensemble Strategies for Multilingual Polarization

論文の概要: MKJ at SemEval-2026 Task 9: A Comparative Study of Generalist, Specialist, and Ensemble Strategies for Multilingual Polarization

arxiv url: http://arxiv.org/abs/2604.21370v1
Date: Thu, 23 Apr 2026 07:34:26 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-24 14:40:06.36495
Title: MKJ at SemEval-2026 Task 9: A Comparative Study of Generalist, Specialist, and Ensemble Strategies for Multilingual Polarization
Title（参考訳）: MKJ at SemEval-2026 Task 9: A Comparison Study of Generalist, Specialist, and Ensemble Strategies for Multilingual Polarization (特集:一般)
Authors: Maziar Kianimoghadam Jouneghani,
Abstract要約: 本研究では,SemEval-2026タスク9(Subtask 1)における22言語間の多言語分極検出の系統的研究を行う。単一のユニバーサルアーキテクチャを強制するのではなく、多言語ジェネラリスト、言語固有のスペシャリスト、および開発パフォーマンスに基づいたハイブリッドアンサンブルを切り替える言語適応フレームワークを採用する。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We present a systematic study of multilingual polarization detection across 22 languages for SemEval-2026 Task 9 (Subtask 1), contrasting multilingual generalists with language-specific specialists and hybrid ensembles. While a standard generalist like XLM-RoBERTa suffices when its tokenizer aligns with the target text, it may struggle with distinct scripts (e.g., Khmer, Odia) where monolingual specialists yield significant gains. Rather than enforcing a single universal architecture, we adopt a language-adaptive framework that switches between multilingual generalists, language-specific specialists, and hybrid ensembles based on development performance. Additionally, cross-lingual augmentation via NLLB-200 yielded mixed results, often underperforming native architecture selection and degrading morphologically rich tracks. Our final system achieves an overall macro-averaged F1 score of 0.796 and an average accuracy of 0.826 across all 22 tracks. Code and final test predictions are publicly available at: https://github.com/Maziarkiani/SemEval2026-Task9-Subtask1-Polarization.
Abstract（参考訳）: 本研究では,SemEval-2026 Task 9(Subtask 1)における22言語間の多言語分極検出の体系的研究を行い,言語固有の専門家やハイブリッドアンサンブルとの対比を行った。 XLM-RoBERTaのような標準的なジェネラリストは、トークン化ツールがターゲットのテキストと整列するときに十分であるが、モノリンガルの専門家が大きな利益をもたらす異なるスクリプト(例えば、Khmer、Odia)と競合する可能性がある。単一のユニバーサルアーキテクチャを強制するのではなく、多言語ジェネラリスト、言語固有のスペシャリスト、および開発パフォーマンスに基づいたハイブリッドアンサンブルを切り替える言語適応フレームワークを採用する。さらに、NLLB-200による言語間の拡張により、混合結果が得られ、しばしばネイティブアーキテクチャの選択が低下し、形態学的にリッチなトラックが劣化した。最終システムは、マクロ平均F1スコアが0.796、平均精度が0.826である。コードと最終的なテスト予測は、https://github.com/Maziarkiani/SemEval2026-Task9-Subtask1-Polarizationで公開されている。

論文の概要: MKJ at SemEval-2026 Task 9: A Comparative Study of Generalist, Specialist, and Ensemble Strategies for Multilingual Polarization

関連論文リスト