Investigating Cross-Linguistic Adjective Ordering Tendencies with a
Latent-Variable Model
- URL: http://arxiv.org/abs/2010.04755v1
- Date: Fri, 9 Oct 2020 18:27:55 GMT
- Title: Investigating Cross-Linguistic Adjective Ordering Tendencies with a
Latent-Variable Model
- Authors: Jun Yen Leung, Guy Emerson, Ryan Cotterell
- Abstract summary: We present the first purely corpus-driven model of multi-lingual adjective ordering in the form of a latent-variable model.
We provide strong converging evidence for the existence of universal, cross-linguistic, hierarchical adjective ordering tendencies.
- Score: 66.84264870118723
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Across languages, multiple consecutive adjectives modifying a noun (e.g. "the
big red dog") follow certain unmarked ordering rules. While explanatory
accounts have been put forward, much of the work done in this area has relied
primarily on the intuitive judgment of native speakers, rather than on corpus
data. We present the first purely corpus-driven model of multi-lingual
adjective ordering in the form of a latent-variable model that can accurately
order adjectives across 24 different languages, even when the training and
testing languages are different. We utilize this novel statistical model to
provide strong converging evidence for the existence of universal,
cross-linguistic, hierarchical adjective ordering tendencies.
Related papers
- What an Elegant Bridge: Multilingual LLMs are Biased Similarly in Different Languages [51.0349882045866]
This paper investigates biases of Large Language Models (LLMs) through the lens of grammatical gender.
We prompt a model to describe nouns with adjectives in various languages, focusing specifically on languages with grammatical gender.
We find that a simple classifier can not only predict noun gender above chance but also exhibit cross-language transferability.
arXiv Detail & Related papers (2024-07-12T22:10:16Z) - Computational Modelling of Plurality and Definiteness in Chinese Noun
Phrases [13.317456093426808]
We focus on the omission of the plurality and definiteness markers in Chinese noun phrases (NPs)
We build a corpus of Chinese NPs, each of which is accompanied by its corresponding context, and by labels indicating its singularity/plurality and definiteness/indefiniteness.
We train a bank of computational models using both classic machine learning models and state-of-the-art pre-trained language models to predict the plurality and definiteness of each NP.
arXiv Detail & Related papers (2024-03-07T10:06:54Z) - A Cross-Linguistic Pressure for Uniform Information Density in Word
Order [79.54362557462359]
We use computational models to test whether real orders lead to greater information uniformity than counterfactual orders.
Among SVO languages, real word orders consistently have greater uniformity than reverse word orders.
Only linguistically implausible counterfactual orders consistently exceed the uniformity of real orders.
arXiv Detail & Related papers (2023-06-06T14:52:15Z) - Montague semantics and modifier consistency measurement in neural
language models [1.6799377888527685]
This work proposes a methodology for measuring compositional behavior in contemporary language models.
Specifically, we focus on adjectival modifier phenomena in adjective-noun phrases.
Our experimental results indicate that current neural language models behave according to the expected linguistic theories to a limited extent only.
arXiv Detail & Related papers (2022-10-10T18:43:16Z) - Universality and diversity in word patterns [0.0]
We present an analysis of lexical statistical connections for eleven major languages.
We find that the diverse manners that languages utilize to express word relations give rise to unique pattern distributions.
arXiv Detail & Related papers (2022-08-23T20:03:27Z) - Scalar Adjective Identification and Multilingual Ranking [4.915907527975786]
We introduce a new multilingual dataset in order to promote research on scalar adjectives in new languages.
We perform a series of experiments and set performance baselines on this dataset, using monolingual and multilingual contextual language models.
We introduce a new binary classification task for English scalar adjective identification.
arXiv Detail & Related papers (2021-05-03T21:32:41Z) - Word Frequency Does Not Predict Grammatical Knowledge in Language Models [2.1984302611206537]
We investigate whether there are systematic sources of variation in the language models' accuracy.
We find that certain nouns are systematically understood better than others, an effect which is robust across grammatical tasks and different language models.
We find that a novel noun's grammatical properties can be few-shot learned from various types of training data.
arXiv Detail & Related papers (2020-10-26T19:51:36Z) - On the Language Neutrality of Pre-trained Multilingual Representations [70.93503607755055]
We investigate the language-neutrality of multilingual contextual embeddings directly and with respect to lexical semantics.
Our results show that contextual embeddings are more language-neutral and, in general, more informative than aligned static word-type embeddings.
We show how to reach state-of-the-art accuracy on language identification and match the performance of statistical methods for word alignment of parallel sentences.
arXiv Detail & Related papers (2020-04-09T19:50:32Z) - On the Importance of Word Order Information in Cross-lingual Sequence
Labeling [80.65425412067464]
Cross-lingual models that fit into the word order of the source language might fail to handle target languages.
We investigate whether making models insensitive to the word order of the source language can improve the adaptation performance in target languages.
arXiv Detail & Related papers (2020-01-30T03:35:44Z) - A Simple Joint Model for Improved Contextual Neural Lemmatization [60.802451210656805]
We present a simple joint neural model for lemmatization and morphological tagging that achieves state-of-the-art results on 20 languages.
Our paper describes the model in addition to training and decoding procedures.
arXiv Detail & Related papers (2019-04-04T02:03:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.