Fugu-MT 論文翻訳(概要): "AGI" team at SHROOM-CAP: Data-Centric Approach to Multilingual Hallucination Detection using XLM-RoBERTa

論文の概要: "AGI" team at SHROOM-CAP: Data-Centric Approach to Multilingual Hallucination Detection using XLM-RoBERTa

arxiv url: http://arxiv.org/abs/2511.18301v1
Date: Sun, 23 Nov 2025 05:48:27 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-25 18:34:24.75774
Title: "AGI" team at SHROOM-CAP: Data-Centric Approach to Multilingual Hallucination Detection using XLM-RoBERTa
Title（参考訳）: SHROOM-CAP「AGI」チーム:XLM-RoBERTaを用いた多言語幻覚検出のためのデータ中心的アプローチ
Authors: Harsh Rathva, Pruthwik Mishra, Shrikant Malviya,
Abstract要約: 本稿では,SHROOM-CAP 2025の9言語にわたる科学的幻覚検出タスクについて述べる。既存の5つのデータセットを統合して、124,821のサンプル(50%の正解、50%の幻覚)からなる総合的なトレーニングコーパスを作成します。我々の結果は、体系的なデータキュレーションがアーキテクチャの革新を単独で著しく上回ることを示した。
参考スコア（独自算出の注目度）: 2.444311666637296
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The detection of hallucinations in multilingual scientific text generated by Large Language Models (LLMs) presents significant challenges for reliable AI systems. This paper describes our submission to the SHROOM-CAP 2025 shared task on scientific hallucination detection across 9 languages. Unlike most approaches that focus primarily on model architecture, we adopted a data-centric strategy that addressed the critical issue of training data scarcity and imbalance. We unify and balance five existing datasets to create a comprehensive training corpus of 124,821 samples (50% correct, 50% hallucinated), representing a 172x increase over the original SHROOM training data. Our approach fine-tuned XLM-RoBERTa-Large with 560 million parameters on this enhanced dataset, achieves competitive performance across all languages, including \textbf{2nd place in Gujarati} (zero-shot language) with Factuality F1 of 0.5107, and rankings between 4th-6th place across the remaining 8 languages. Our results demonstrate that systematic data curation can significantly outperform architectural innovations alone, particularly for low-resource languages in zero-shot settings.
Abstract（参考訳）: LLM(Large Language Models)が生成する多言語科学テキストにおける幻覚の検出は,信頼性の高いAIシステムにおいて重要な課題である。本稿では,SHROOM-CAP 2025の9言語にわたる科学的幻覚検出タスクについて述べる。主にモデルアーキテクチャに焦点を当てた多くのアプローチとは異なり、データ不足と不均衡をトレーニングする上で重要な問題に対処する、データ中心の戦略を採用しました。既存の5つのデータセットを統一してバランスを取り、124,821個のサンプル(50%は正しい、50%は幻覚的)からなる総合的なトレーニングコーパスを作成しました。我々のアプローチでは、XLM-RoBERTa-Largeを5億6000万のパラメータで微調整し、残りの8言語で4位から6位までのランク付けを行い、すべての言語で競合性能を実現しました。以上の結果から,構造化データキュレーションは,特にゼロショット環境での低リソース言語において,アーキテクチャの革新性に優れることが示された。

論文の概要: "AGI" team at SHROOM-CAP: Data-Centric Approach to Multilingual Hallucination Detection using XLM-RoBERTa

関連論文リスト