Related papers: On the Context-Free Ambiguity of Emoji: A Data-Driven Study of 1,289 Emojis

On the Context-Free Ambiguity of Emoji: A Data-Driven Study of 1,289 Emojis

URL: http://arxiv.org/abs/2201.06302v1
Date: Mon, 17 Jan 2022 09:33:29 GMT
Title: On the Context-Free Ambiguity of Emoji: A Data-Driven Study of 1,289 Emojis
Authors: Justyna Czestochowska, Kristina Gligoric, Maxime Peyrard, Yann Mentha, Michal Bien, Andrea Grutter, Anita Auer, Aris Xanthos, Robert West
Abstract summary: We collect a crowdsourced dataset of one-word emoji descriptions for 1,289 emojis presented to participants with no surrounding text. We find that with 30 annotations per emoji, 16 emojis are completely unambiguous, whereas 55 emojis are so ambiguous that their descriptions are indistinguishable from randomly chosen descriptions.
Score: 28.04805745702487
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Emojis come with prepacked semantics making them great candidates to create new forms of more accessible communications. Yet, little is known about how much of this emojis semantic is agreed upon by humans, outside of textual contexts. Thus, we collected a crowdsourced dataset of one-word emoji descriptions for 1,289 emojis presented to participants with no surrounding text. The emojis and their interpretations were then examined for ambiguity. We find that with 30 annotations per emoji, 16 emojis (1.2%) are completely unambiguous, whereas 55 emojis (4.3%) are so ambiguous that their descriptions are indistinguishable from randomly chosen descriptions. Most of studied emojis are spread out between the two extremes. Furthermore, investigating the ambiguity of different types of emojis, we find that an important factor is the extent to which an emoji has an embedded symbolical meaning drawn from an established code-book of symbols. We conclude by discussing design implications.

Related papers

The Prosody of Emojis [73.70220975424597]
This study examines how emojis influence prosodic realisation in speech and how listeners interpret prosodic cues to recover emoji meanings.<n>Unlike previous work, we directly link prosody and emoji by analysing actual human speech data, collected through structured but open-ended production and perception tasks.<n>Results show that speakers adapt their prosody based on emoji cues, listeners can often identify the intended emoji from prosodic variation alone, and greater semantic differences between emojis correspond to increased prosodic divergence.
arXiv Detail & Related papers (2025-08-01T11:24:12Z)
Irony in Emojis: A Comparative Study of Human and LLM Interpretation [53.66354612549173]
This study examines the ability of GPT-4o to interpret irony in emojis. By prompting GPT-4o to evaluate the likelihood of specific emojis being used to express irony on social media, we aim to bridge the gap between machine and human understanding.
arXiv Detail & Related papers (2025-01-20T03:02:00Z)
Semantics Preserving Emoji Recommendation with Large Language Models [47.94761630160614]
Existing emoji recommendation methods are primarily evaluated based on their ability to match the exact emoji a user chooses in the original text. We propose a new semantics preserving evaluation framework for emoji recommendation, which measures a model's ability to recommend emojis that maintain the semantic consistency with the user's text.
arXiv Detail & Related papers (2024-09-16T22:27:46Z)
Emojinize: Enriching Any Text with Emoji Translations [10.674155943520729]
Emojinize is a method for translating arbitrary text phrases into sequences of one or more emoji without requiring human input. By leveraging the power of large language models, Emojinize can choose appropriate emoji by disambiguating based on context. Emojinize's translations increase the human guessability of masked words by 55%, whereas human-picked emoji translations do so by only 29%.
arXiv Detail & Related papers (2024-03-06T17:06:17Z)
EmojiLM: Modeling the New Emoji Language [44.23076273155259]
We develop a text-emoji parallel corpus, Text2Emoji, from a large language model. Based on the parallel corpus, we distill a sequence-to-sequence model, EmojiLM, which is specialized in the text-emoji bidirectional translation. Our proposed model outperforms strong baselines and the parallel corpus benefits emoji-related downstream tasks.
arXiv Detail & Related papers (2023-11-03T07:06:51Z)
Emojich -- zero-shot emoji generation using Russian language: a technical report [52.77024349608834]
"Emojich" is a text-to-image neural network that generates emojis using captions in Russian language as a condition. We aim to keep the generalization ability of a pretrained big model ruDALL-E Malevich (XL) 1.3B parameters at the fine-tuning stage.
arXiv Detail & Related papers (2021-12-04T23:37:32Z)
How to Do Things without Words: Modeling Semantic Drift of Emoji [0.2538209532048866]
We model and analyze the semantic drift of emoji and discuss the features that may be contributing to the drift. This evolution could be addressed through the framework of semantic drifts.
arXiv Detail & Related papers (2021-10-08T12:45:26Z)
Black or White but never neutral: How readers perceive identity from yellow or skin-toned emoji [90.14874935843544]
Recent work established a connection between expression of identity and emoji usage on social media. This work asks if, as with language, readers are sensitive to such acts of self-expression and use them to understand the identity of authors.
arXiv Detail & Related papers (2021-05-12T18:23:51Z)
Semantic Journeys: Quantifying Change in Emoji Meaning from 2012-2018 [66.28665205489845]
We offer the first longitudinal study of how emoji semantics changes over time, applying techniques from computational linguistics to six years of Twitter data. We identify five patterns in emoji semantic development and find evidence that the less abstract an emoji is, the more likely it is to undergo semantic change. To aid future work on emoji and semantics, we make our data publicly available along with a web-based interface that anyone can use to explore semantic change in emoji.
arXiv Detail & Related papers (2021-05-03T13:35:10Z)
Emoji Prediction: Extensions and Benchmarking [30.642840676899734]
The emoji prediction task aims at predicting the proper set of emojis associated with a piece of text. We extend the existing setting of the emoji prediction task to include a richer set of emojis and to allow multi-label classification. We propose novel models for multi-class and multi-label emoji prediction based on Transformer networks.
arXiv Detail & Related papers (2020-07-14T22:41:20Z)
Word-Emoji Embeddings from large scale Messaging Data reflect real-world Semantic Associations of Expressive Icons [7.032245866317618]
We train word-emoji embeddings on large scale messaging data obtained from the Jodel online social network. Our data set contains more than 40 million sentences, of which 11 million sentences are annotated with a subset of the Unicode 13.0 standard Emoji list.
arXiv Detail & Related papers (2020-05-19T19:55:56Z)

This list is automatically generated from the titles and abstracts of the papers in this site.