A Unified Feature Representation for Lexical Connotations
- URL: http://arxiv.org/abs/2006.00635v2
- Date: Mon, 1 Mar 2021 14:14:21 GMT
- Title: A Unified Feature Representation for Lexical Connotations
- Authors: Emily Allaway and Kathleen McKeown
- Abstract summary: We use distant labeling to create a new lexical resource representing connotation aspects for nouns and adjectives.
Our analysis shows that it aligns well with human judgments.
- Score: 13.153001795077227
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Ideological attitudes and stance are often expressed through subtle meanings
of words and phrases. Understanding these connotations is critical to
recognizing the cultural and emotional perspectives of the speaker. In this
paper, we use distant labeling to create a new lexical resource representing
connotation aspects for nouns and adjectives. Our analysis shows that it aligns
well with human judgments. Additionally, we present a method for creating
lexical representations that captures connotations within the embedding space
and show that using the embeddings provides a statistically significant
improvement on the task of stance detection when data is limited.
Related papers
- Explaining Representation Learning with Perceptual Components [14.10876324116018]
Self-supervised models create representation spaces that lack clear semantic meaning.
We introduce a novel method to analyze representation spaces using three key perceptual components: color, shape, and texture.
Our approach enhances the interpretability of the representation space, offering explanations that resonate with human visual perception.
arXiv Detail & Related papers (2024-06-11T04:08:37Z) - Quantifying the redundancy between prosody and text [67.07817268372743]
We use large language models to estimate how much information is redundant between prosody and the words themselves.
We find a high degree of redundancy between the information carried by the words and prosodic information across several prosodic features.
Still, we observe that prosodic features can not be fully predicted from text, suggesting that prosody carries information above and beyond the words.
arXiv Detail & Related papers (2023-11-28T21:15:24Z) - Distributed Marker Representation for Ambiguous Discourse Markers and
Entangled Relations [50.31129784616845]
We learn a Distributed Marker Representation (DMR) by utilizing the unlimited discourse marker data with a latent discourse sense.
Our method also offers a valuable tool to understand complex ambiguity and entanglement among discourse markers and manually defined discourse relations.
arXiv Detail & Related papers (2023-06-19T00:49:51Z) - Adverbs, Surprisingly [1.9075820340282936]
We show that adverbs are neglected in computational linguistics.
We suggest that using Frame Semantics for characterizing word meaning, as in FrameNet, provides a promising approach to adverb analysis.
arXiv Detail & Related papers (2023-05-31T08:30:08Z) - Natural Language Decompositions of Implicit Content Enable Better Text
Representations [56.85319224208865]
We introduce a method for the analysis of text that takes implicitly communicated content explicitly into account.
We use a large language model to produce sets of propositions that are inferentially related to the text that has been observed.
Our results suggest that modeling the meanings behind observed language, rather than the literal text alone, is a valuable direction for NLP.
arXiv Detail & Related papers (2023-05-23T23:45:20Z) - Neighboring Words Affect Human Interpretation of Saliency Explanations [65.29015910991261]
Word-level saliency explanations are often used to communicate feature-attribution in text-based models.
Recent studies found that superficial factors such as word length can distort human interpretation of the communicated saliency scores.
We investigate how the marking of a word's neighboring words affect the explainee's perception of the word's importance in the context of a saliency explanation.
arXiv Detail & Related papers (2023-05-04T09:50:25Z) - Shades of meaning: Uncovering the geometry of ambiguous word
representations through contextualised language models [6.760960482418417]
Lexical ambiguity presents a profound and enduring challenge to the language sciences.
Our work offers new insight into psychological understanding of lexical ambiguity through a series of simulations.
arXiv Detail & Related papers (2023-04-26T14:47:38Z) - Lost in Context? On the Sense-wise Variance of Contextualized Word
Embeddings [11.475144702935568]
We quantify how much the contextualized embeddings of each word sense vary across contexts in typical pre-trained models.
We find that word representations are position-biased, where the first words in different contexts tend to be more similar.
arXiv Detail & Related papers (2022-08-20T12:27:25Z) - Latent Topology Induction for Understanding Contextualized
Representations [84.7918739062235]
We study the representation space of contextualized embeddings and gain insight into the hidden topology of large language models.
We show there exists a network of latent states that summarize linguistic properties of contextualized representations.
arXiv Detail & Related papers (2022-06-03T11:22:48Z) - Aspectuality Across Genre: A Distributional Semantics Approach [25.816944882581343]
The interpretation of the lexical aspect of verbs in English plays a crucial role for recognizing textual entailment and learning discourse-level inferences.
We show that two elementary dimensions of aspectual class, states vs. events, and telic vs. atelic events, can be modelled effectively with distributional semantics.
arXiv Detail & Related papers (2020-10-31T19:37:22Z) - On Vocabulary Reliance in Scene Text Recognition [79.21737876442253]
Methods perform well on images with words within vocabulary but generalize poorly to images with words outside vocabulary.
We call this phenomenon "vocabulary reliance"
We propose a simple yet effective mutual learning strategy to allow models of two families to learn collaboratively.
arXiv Detail & Related papers (2020-05-08T11:16:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.