Fugu-MT 論文翻訳(概要): Is Cross-Lingual Transfer in Bilingual Models Human-Like? A Study with Overlapping Word Forms in Dutch and English

論文の概要: Is Cross-Lingual Transfer in Bilingual Models Human-Like? A Study with Overlapping Word Forms in Dutch and English

arxiv url: http://arxiv.org/abs/2604.07067v1
Date: Wed, 08 Apr 2026 13:17:59 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-09 17:30:51.548757
Title: Is Cross-Lingual Transfer in Bilingual Models Human-Like? A Study with Overlapping Word Forms in Dutch and English
Title（参考訳）: バイリンガルモデルにおける言語間移動は人間に似ているか? オランダ語と英語の重なり合う単語形式を用いた研究
Authors: Iza Škrjanec, Irene Elisabeth Winther, Vera Demberg, Stefan L. Frank,
Abstract要約: コニャート(友人)は典型的にはファシリテーションを引き起こすが、言語間ホモグラフ(偽の友人)は干渉や効果を生じない。バイリンガル言語モデルにおける言語間アクティベーションがこれらのパターンを反映しているかどうかを検討する。
参考スコア（独自算出の注目度）: 12.996963143295654
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Bilingual speakers show cross-lingual activation during reading, especially for words with shared surface form. Cognates (friends) typically lead to facilitation, whereas interlingual homographs (false friends) cause interference or no effect. We examine whether cross-lingual activation in bilingual language models mirrors these patterns. We train Dutch-English causal Transformers under four vocabulary-sharing conditions that manipulate whether (false) friends receive shared or language-specific embeddings. Using psycholinguistic stimuli from bilingual reading studies, we evaluate the models through surprisal and embedding similarity analyses. The models largely maintain language separation, and cross-lingual effects arise primarily when embeddings are shared. In these cases, both friends and false friends show facilitation relative to controls. Regression analyses reveal that these effects are mainly driven by frequency rather than consistency in form-meaning mapping. Only when just friends share embeddings are the qualitative patterns of bilinguals reproduced. Overall, bilingual language models capture some cross-linguistic activation effects. However, their alignment with human processing seems to critically depend on how lexical overlap is encoded, possibly limiting their explanatory adequacy as models of bilingual reading.
Abstract（参考訳）: バイリンガル話者は読み上げ中に言語間のアクティベーションを示す。コニャート(友人)は典型的にはファシリテーションを引き起こすが、言語間ホモグラフ(偽の友人)は干渉や効果を生じない。バイリンガル言語モデルにおける言語間アクティベーションがこれらのパターンを反映しているかどうかを検討する。オランダ語と英語の因果変換器を4つの語彙共有条件で訓練し、(偽)友人が共有または言語固有の埋め込みを受けるかどうかを制御した。バイリンガル読解研究の心理言語学的刺激を用いて,予備的および埋め込み的類似性分析によるモデルの評価を行った。モデルは言語分離を主に維持し、埋め込みを共有する際には言語間効果が主に生じる。これらのケースでは、友人と偽の友人の両方が、コントロールに対してファシリテーションを示す。回帰分析により、これらの効果は主に形式的意味のマッピングにおける一貫性よりも周波数によって引き起こされることが明らかとなった。友達が埋め込みを共有するときだけ、バイリンガルの質的なパターンが再現される。全体として、バイリンガル言語モデルは言語間のアクティベーション効果を捉えている。しかしながら、人間の処理との整合性は、どのように語彙的重複がコード化されているかに大きく依存しているようで、おそらくはバイリンガル読解のモデルとしての説明的妥当性を制限している。

論文の概要: Is Cross-Lingual Transfer in Bilingual Models Human-Like? A Study with Overlapping Word Forms in Dutch and English

関連論文リスト