Fugu-MT 論文翻訳(概要): Different types of syntactic agreement recruit the same units within large language models

論文の概要: Different types of syntactic agreement recruit the same units within large language models

arxiv url: http://arxiv.org/abs/2512.03676v1
Date: Wed, 03 Dec 2025 11:07:50 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-04 20:02:55.260045
Title: Different types of syntactic agreement recruit the same units within large language models
Title（参考訳）: 異なるタイプの構文合意は、大きな言語モデル内で同じ単位を募集する
Authors: Daria Kryvosheieva, Andrea de Varda, Evelina Fedorenko, Greta Tuckute,
Abstract要約: 大規模言語モデルにおいて、異なる構文現象が共有または異なるコンポーネントを採用できるかどうかを検討する。認知神経科学にインスパイアされた機能的局所化アプローチを用いて、67の英構文現象に最も反応するLLM単位を同定した。
参考スコア（独自算出の注目度）: 6.459078990392929
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Large language models (LLMs) can reliably distinguish grammatical from ungrammatical sentences, but how grammatical knowledge is represented within the models remains an open question. We investigate whether different syntactic phenomena recruit shared or distinct components in LLMs. Using a functional localization approach inspired by cognitive neuroscience, we identify the LLM units most responsive to 67 English syntactic phenomena in seven open-weight models. These units are consistently recruited across sentences containing the phenomena and causally support the models' syntactic performance. Critically, different types of syntactic agreement (e.g., subject-verb, anaphor, determiner-noun) recruit overlapping sets of units, suggesting that agreement constitutes a meaningful functional category for LLMs. This pattern holds in English, Russian, and Chinese; and further, in a cross-lingual analysis of 57 diverse languages, structurally more similar languages share more units for subject-verb agreement. Taken together, these findings reveal that syntactic agreement-a critical marker of syntactic dependencies-constitutes a meaningful category within LLMs' representational spaces.
Abstract（参考訳）: 大規模言語モデル(LLM)は、文法的な文と非文法的な文とを確実に区別することができるが、モデル内の文法的知識がどのように表現されているかは未解決の問題である。本研究は,LLMの相違点と相違点の相違点について検討する。認知神経科学にインスパイアされた機能的局在化アプローチを用いて、7つのオープンウェイトモデルにおいて67の英構文現象に最も反応するLLM単位を同定した。これらのユニットは、現象を含む文間で一貫して採用され、モデルの構文的パフォーマンスを因果的に支援する。批判的に、異なるタイプの統語的合意(例えば、主語-動詞、アナパル、決定子-名詞)は、重なり合う単位の集合を採用し、この合意がLLMにとって有意義な機能圏を構成することを示唆している。このパターンは英語、ロシア語、中国語で成り立ち、さらに57の多言語を言語横断で分析すると、構造的に類似した言語が主語と動詞の合意のためにより多くの単位を共有している。これらの結果は,LLMの表現空間における意味のあるカテゴリを構成する構文的コンセンサス(syntactic agreement)が,構文的依存の重要なマーカーであることを示している。

論文の概要: Different types of syntactic agreement recruit the same units within large language models

関連論文リスト