Fugu-MT 論文翻訳(概要): Exploring the Capability Boundaries of LLMs in Mastering of Chinese Chouxiang Language

論文の概要: Exploring the Capability Boundaries of LLMs in Mastering of Chinese Chouxiang Language

arxiv url: http://arxiv.org/abs/2604.15841v2
Date: Mon, 20 Apr 2026 08:26:20 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 13:51:31.200071
Title: Exploring the Capability Boundaries of LLMs in Mastering of Chinese Chouxiang Language
Title（参考訳）: 中国語Chouxiang言語習得におけるLLMの能力境界の探索
Authors: Dianqing Lin, Tian Lan, Jiali Zhu, Jiang Li, Wei Chen, Xu Liu, Aruukhan, Xiangdong Su, Hongxu Hou, Guanglai Gao,
Abstract要約: 我々は,Chouxiang言語を含むNLPタスクにおいて,大規模言語モデル(LLM)の能力を評価するために設計された,特殊なベンチマークであるMマウスを紹介する。実験の結果,現状のSOTA (State-of-the-art) LLMは複数のタスクに対して明確な制限を示し,文脈的意味理解を伴うタスクでは良好に機能することがわかった。本研究は、NLPコミュニティにおける多文化統合と進化するインターネット言語のダイナミクスに関するさらなる研究を促進することを目的としている。
参考スコア（独自算出の注目度）: 26.275675761892654
License: http://creativecommons.org/licenses/by/4.0/
Abstract: While large language models (LLMs) have achieved remarkable success in general language tasks, their performance on Chouxiang Language, a representative subcultural language in the Chinese internet context, remains largely unexplored. In this paper, we introduce Mouse, a specialized benchmark designed to evaluate the capabilities of LLMs on NLP tasks involving Chouxiang Language across six tasks. Experimental results show that, current state-of-the-art (SOTA) LLMs exhibit clear limitations on multiple tasks, while performing well on tasks that involve contextual semantic understanding. In addition, we further discuss the reasons behind the generally low performance of SOTA LLMs on Chouxiang Language, examine whether the LLM-as-a-judge approach adopted for translation tasks aligns with human judgments and values, and analyze the key factors that influence Chouxiang translation. Our study aims to promote further research in the NLP community on multicultural integration and the dynamics of evolving internet languages. Our code and data are publicly available.
Abstract（参考訳）: 大規模言語モデル(LLM)は、一般的な言語タスクにおいて顕著な成功を収めてきたが、中国のインターネットにおける代表的サブカルチャー言語であるChouxiang Languageのパフォーマンスは、まだ明らかにされていない。そこで本稿では,Chouxiang Language を含む NLP タスクにおける LLM の機能を評価するための特殊なベンチマークである Mouse について紹介する。実験結果から,現状のSOTA (State-of-the-art) LLMは複数のタスクに対して明確な制限を呈し,文脈意味理解を伴うタスクでは良好に機能することが示された。さらに,チョクシアン語におけるSOTA LLMの低性能化の背景として,翻訳タスクにLLM-as-a-judgeアプローチがヒトの判断や値に合致するかどうかを検証し,チョクシアン語翻訳に影響を及ぼす重要な要因を分析した。本研究は、NLPコミュニティにおける多文化統合と進化するインターネット言語のダイナミクスに関するさらなる研究を促進することを目的としている。私たちのコードとデータは公開されています。

論文の概要: Exploring the Capability Boundaries of LLMs in Mastering of Chinese Chouxiang Language

関連論文リスト