Quantifying Cognitive Factors in Lexical Decline
- URL: http://arxiv.org/abs/2110.05775v1
- Date: Tue, 12 Oct 2021 07:12:56 GMT
- Title: Quantifying Cognitive Factors in Lexical Decline
- Authors: David Francis, Ella Rabinovich, Farhan Samir, David Mortensen, Suzanne
Stevenson
- Abstract summary: We propose a variety of psycholinguistic factors -- semantic, distributional, and phonological -- that we hypothesize are predictive of lexical decline.
We find that most of our proposed factors show a significant difference in the expected direction between each curated set of declining words and their matched stable words.
Further diachronic analysis reveals that declining words tend to decrease in the diversity of their lexical contexts over time, gradually narrowing their 'ecological niches'
- Score: 2.4424095531386234
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We adopt an evolutionary view on language change in which cognitive factors
(in addition to social ones) affect the fitness of words and their success in
the linguistic ecosystem. Specifically, we propose a variety of
psycholinguistic factors -- semantic, distributional, and phonological -- that
we hypothesize are predictive of lexical decline, in which words greatly
decrease in frequency over time. Using historical data across three languages
(English, French, and German), we find that most of our proposed factors show a
significant difference in the expected direction between each curated set of
declining words and their matched stable words. Moreover, logistic regression
analyses show that semantic and distributional factors are significant in
predicting declining words. Further diachronic analysis reveals that declining
words tend to decrease in the diversity of their lexical contexts over time,
gradually narrowing their 'ecological niches'.
Related papers
- Syntactic Language Change in English and German: Metrics, Parsers, and Convergences [56.47832275431858]
The current paper looks at diachronic trends in syntactic language change in both English and German, using corpora of parliamentary debates from the last c. 160 years.
We base our observations on five dependencys, including the widely used Stanford Core as well as 4 newer alternatives.
We show that changes in syntactic measures seem to be more frequent at the tails of sentence length distributions.
arXiv Detail & Related papers (2024-02-18T11:46:16Z) - Quantifying the Dialect Gap and its Correlates Across Languages [69.18461982439031]
This work will lay the foundation for furthering the field of dialectal NLP by laying out evident disparities and identifying possible pathways for addressing them through mindful data collection.
arXiv Detail & Related papers (2023-10-23T17:42:01Z) - Neighboring Words Affect Human Interpretation of Saliency Explanations [65.29015910991261]
Word-level saliency explanations are often used to communicate feature-attribution in text-based models.
Recent studies found that superficial factors such as word length can distort human interpretation of the communicated saliency scores.
We investigate how the marking of a word's neighboring words affect the explainee's perception of the word's importance in the context of a saliency explanation.
arXiv Detail & Related papers (2023-05-04T09:50:25Z) - Subdiffusive semantic evolution in Indo-European languages [0.0]
We find that semantic evolution is strongly subdiffusive across five major Indo-European languages.
We show that words follow trajectories in meaning space with an anomalous diffusion exponent.
We furthermore show that strong subdiffusion is a robust phenomenon under a wide variety of choices in data analysis and interpretation.
arXiv Detail & Related papers (2022-09-10T15:57:32Z) - Contextualized language models for semantic change detection: lessons
learned [4.436724861363513]
We present a qualitative analysis of the outputs of contextualized embedding-based methods for detecting diachronic semantic change.
Our findings show that contextualized methods can often predict high change scores for words which are not undergoing any real diachronic semantic shift.
Our conclusion is that pre-trained contextualized language models are prone to confound changes in lexicographic senses and changes in contextual variance.
arXiv Detail & Related papers (2022-08-31T23:35:24Z) - Slangvolution: A Causal Analysis of Semantic Change and Frequency
Dynamics in Slang [18.609276255676175]
We study slang, an informal language that is typically restricted to a specific group or social setting.
We analyze the semantic change and frequency shift of slang words and compare them to those of standard, nonslang words.
We show that slang words undergo less semantic change but tend to have larger frequency shifts over time.
arXiv Detail & Related papers (2022-03-09T11:34:43Z) - How individuals change language [1.2437226707039446]
We introduce a very general mathematical model that encompasses a wide variety of individual-level linguistic behaviours.
We compare the likelihood of empirically-attested changes in definite and indefinite articles in multiple languages under different assumptions.
We find that accounts of language change that appeal primarily to errors in childhood language acquisition are very weakly supported by the historical data.
arXiv Detail & Related papers (2021-04-20T19:02:49Z) - Disambiguatory Signals are Stronger in Word-initial Positions [48.18148856974974]
We point out the confounds in existing methods for comparing the informativeness of segments early in the word versus later in the word.
We find evidence across hundreds of languages that indeed there is a cross-linguistic tendency to front-load information in words.
arXiv Detail & Related papers (2021-02-03T18:19:16Z) - Mechanisms for Handling Nested Dependencies in Neural-Network Language
Models and Humans [75.15855405318855]
We studied whether a modern artificial neural network trained with "deep learning" methods mimics a central aspect of human sentence processing.
Although the network was solely trained to predict the next word in a large corpus, analysis showed the emergence of specialized units that successfully handled local and long-distance syntactic agreement.
We tested the model's predictions in a behavioral experiment where humans detected violations in number agreement in sentences with systematic variations in the singular/plural status of multiple nouns.
arXiv Detail & Related papers (2020-06-19T12:00:05Z) - Where New Words Are Born: Distributional Semantic Analysis of Neologisms
and Their Semantic Neighborhoods [51.34667808471513]
We investigate the importance of two factors, semantic sparsity and frequency growth rates of semantic neighbors, formalized in the distributional semantics paradigm.
We show that both factors are predictive word emergence although we find more support for the latter hypothesis.
arXiv Detail & Related papers (2020-01-21T19:09:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.