Fugu-MT 論文翻訳(概要): OLA: Output Language Alignment in Code-Switched LLM Interactions

論文の概要: OLA: Output Language Alignment in Code-Switched LLM Interactions

arxiv url: http://arxiv.org/abs/2601.03589v1
Date: Wed, 07 Jan 2026 05:07:22 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-09 02:15:23.274724
Title: OLA: Output Language Alignment in Code-Switched LLM Interactions
Title（参考訳）: OLA: Code-Switched LLM インタラクションにおける出力言語アライメント
Authors: Juhyun Oh, Haneul Yoo, Faiz Ghifari Haznitrama, Alice Oh,
Abstract要約: 会話の中で言語を交互に切り替えるコードスイッチングは、多言語ユーザにとっては自然なことですが、大きな言語モデルには根本的な課題があります。コード切替インタラクションにおけるLLMの出力言語アライメントを評価するためのベンチマークであるOLAを紹介する。 OLAは韓国語-英語のコードスイッチングに重点を置いており、命令-内容ミスマッチへの単純な文内ミキシングにまたがっている。
参考スコア（独自算出の注目度）: 31.119553472916234
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Code-switching, alternating between languages within a conversation, is natural for multilingual users, yet poses fundamental challenges for large language models (LLMs). When a user code-switches in their prompt to an LLM, they typically do not specify the expected language of the LLM response, and thus LLMs must infer the output language from contextual and pragmatic cues. We find that current LLMs systematically fail to align with this expectation, responding in undesired languages even when cues are clear to humans. We introduce OLA, a benchmark to evaluate LLMs' Output Language Alignment in code-switched interactions. OLA focuses on Korean--English code-switching and spans simple intra-sentential mixing to instruction-content mismatches. Even frontier models frequently misinterpret implicit language expectation, exhibiting a bias toward non-English responses. We further show this bias generalizes beyond Korean to Chinese and Indonesian pairs. Models also show instability through mid-response switching and language intrusions. Chain-of-Thought prompting fails to resolve these errors, indicating weak pragmatic reasoning about output language. However, Code-Switching Aware DPO with minimal data (about 1K examples) substantially reduces misalignment, suggesting these failures stem from insufficient alignment rather than fundamental limitations. Our results highlight the need to align multilingual LLMs with users' implicit expectations in real-world code-switched interactions.
Abstract（参考訳）: 会話の中で言語を交互に切り替えるコードスイッチングは、多言語ユーザにとっては自然なことですが、大きな言語モデル(LLM)には根本的な課題があります。ユーザがLLMへのプロンプトをコードスイッチすると、通常、LLM応答の期待言語は指定されないため、LLMは文脈的および実用的なキューから出力言語を推論しなければならない。現状のLLMは,人間に明快な言語であっても,望ましくない言語に反応するが,体系的にこの期待と一致しないことがわかった。コード切替インタラクションにおけるLLMの出力言語アライメントを評価するためのベンチマークであるOLAを紹介する。 OLAは韓国語-英語のコードスイッチングに重点を置いており、命令-内容ミスマッチへの単純な文内ミキシングにまたがっている。フロンティアモデルでさえ、暗黙の言語期待をしばしば誤解し、非英語の反応に対する偏見を示す。さらに、このバイアスは韓国から中国とインドネシアのペアまで一般化されていることを示す。モデルはまた、ミッドレスポンススイッチングと言語侵入を通じて不安定を示す。 Chain-of-Thoughtプロンプトはこれらのエラーを解決するのに失敗し、出力言語に関する実践的な推論が弱いことを示している。しかし、最小限のデータ(約1Kの例)を持つコードスイッチング DPOは、ミスアライメントを大幅に減らし、これらの失敗は基本的な制限ではなく、アライメントが不十分であることを示している。この結果から,マルチリンガル LLM と実世界のコード交換インタラクションにおけるユーザの暗黙の期待を一致させる必要性が浮き彫りになった。

論文の概要: OLA: Output Language Alignment in Code-Switched LLM Interactions

関連論文リスト