Fugu-MT 論文翻訳(概要): Filling in the Mechanisms: How do LMs Learn Filler-Gap Dependencies under Developmental Constraints?

論文の概要: Filling in the Mechanisms: How do LMs Learn Filler-Gap Dependencies under Developmental Constraints?

arxiv url: http://arxiv.org/abs/2604.14459v1
Date: Wed, 15 Apr 2026 22:32:22 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-17 21:29:31.63646
Title: Filling in the Mechanisms: How do LMs Learn Filler-Gap Dependencies under Developmental Constraints?
Title（参考訳）: メカニズムの充足: LMはどのようにして発達制約下でFiller-Gap依存を学習するか?
Authors: Atrey Desai, Sathvik Nair,
Abstract要約: 我々は,ホイルクエストとトピック化の間で,フィラーギャップ依存性の表現が伝達されるかどうかを評価する。この結果から,学習データに制限がある場合,共有的かつアイテムに敏感なメカニズムが発達する可能性が示唆された。さらに重要なのは、LMは人間よりもはるかに多くのデータを必要とすることだ。
参考スコア（独自算出の注目度）: 0.31511847280063693
License: http://creativecommons.org/licenses/by/4.0/
Abstract: For humans, filler-gap dependencies require a shared representation across different syntactic constructions. Although causal analyses suggest this may also be true for LLMs (Boguraev et al., 2025), it is still unclear if such a representation also exists for language models trained on developmentally feasible quantities of data. We applied Distributed Alignment Search (DAS, Geiger et al. (2024)) to LMs trained on varying amounts of data from the BabyLM challenge (Warstadt et al., 2023), to evaluate whether representations of filler-gap dependencies transfer between wh-questions and topicalization, which greatly vary in terms of their input frequency. Our results suggest shared, yet item-sensitive mechanisms may develop with limited training data. More importantly, LMs still require far more data than humans to learn comparable generalizations, highlighting the need for language-specific biases in models of language acquisition.
Abstract（参考訳）: 人間にとって、フィラーギャップ依存は、異なる構文構造をまたいだ共有表現を必要とする。因果解析から LLM にも当てはまる可能性が示唆されているが (Boguraev et al , 2025) 、そのような表現が発達可能なデータ量で訓練された言語モデルにも存在するかどうかは不明である。 We applied Distributed Alignment Search (DAS, Geiger et al (2024)) to the BabyLM Challenge (Warstadt et al , 2023) to evaluate on various amount of data from the BabyLM Challenge (Warstadt et al , 2023)。この結果から,学習データに制限がある場合,共有的かつアイテムに敏感なメカニズムが発達する可能性が示唆された。さらに重要なのは、LMは匹敵する一般化を学ぶために人間よりもはるかに多くのデータを必要としており、言語習得のモデルにおける言語固有のバイアスの必要性を強調している。

論文の概要: Filling in the Mechanisms: How do LMs Learn Filler-Gap Dependencies under Developmental Constraints?

関連論文リスト