Social Analysis of Young Basque Speaking Communities in Twitter
- URL: http://arxiv.org/abs/2109.03487v1
- Date: Wed, 8 Sep 2021 08:19:08 GMT
- Title: Social Analysis of Young Basque Speaking Communities in Twitter
- Authors: J. Fernandez de Landa and R. Agerri
- Abstract summary: We take into account both social and linguistic aspects to perform demographic analysis by processing a large amount of tweets in Basque language.
The study of demographic characteristics and social relationships are approached by applying machine learning and modern deep-learning Natural Language Processing (NLP) techniques.
- Score: 0.9445512376558136
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: In this paper we take into account both social and linguistic aspects to
perform demographic analysis by processing a large amount of tweets in Basque
language. The study of demographic characteristics and social relationships are
approached by applying machine learning and modern deep-learning Natural
Language Processing (NLP) techniques, combining social sciences with automatic
text processing. More specifically, our main objective is to combine
demographic inference and social analysis in order to detect young Basque
Twitter users and to identify the communities that arise from their
relationships or shared content. This social and demographic analysis will be
entirely based on the~automatically collected tweets using NLP to convert
unstructured textual information into interpretable knowledge.
Related papers
- From a Social Cognitive Perspective: Context-aware Visual Social Relationship Recognition [59.57095498284501]
We propose a novel approach that recognizes textbfContextual textbfSocial textbfRelationships (textbfConSoR) from a social cognitive perspective.
We construct social-aware descriptive language prompts with social relationships for each image.
Impressively, ConSoR outperforms previous methods with a 12.2% gain on the People-in-Social-Context (PISC) dataset and a 9.8% increase on the People-in-Photo-Album (PIPA) benchmark.
arXiv Detail & Related papers (2024-06-12T16:02:28Z) - The Call for Socially Aware Language Technologies [94.6762219597438]
We argue that many of these issues share a common core: a lack of awareness of the factors, context, and implications of the social environment in which NLP operates.
We argue that substantial challenges remain for NLP to develop social awareness and that we are just at the beginning of a new era for the field.
arXiv Detail & Related papers (2024-05-03T18:12:39Z) - Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions [67.60397632819202]
Building socially-intelligent AI agents (Social-AI) is a multidisciplinary, multimodal research goal.
We identify a set of underlying technical challenges and open questions for researchers across computing communities to advance Social-AI.
arXiv Detail & Related papers (2024-04-17T02:57:42Z) - SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language Agents [73.35393511272791]
We propose an interactive learning method, SOTOPIA-$pi$, improving the social intelligence of language agents.
This method leverages behavior cloning and self-reinforcement training on filtered social interaction data according to large language model (LLM) ratings.
arXiv Detail & Related papers (2024-03-13T17:17:48Z) - Social Intelligence Data Infrastructure: Structuring the Present and Navigating the Future [59.78608958395464]
We build a Social AI Data Infrastructure, which consists of a comprehensive social AI taxonomy and a data library of 480 NLP datasets.
Our infrastructure allows us to analyze existing dataset efforts, and also evaluate language models' performance in different social intelligence aspects.
We show there is a need for multifaceted datasets, increased diversity in language and culture, more long-tailed social situations, and more interactive data in future social intelligence data efforts.
arXiv Detail & Related papers (2024-02-28T00:22:42Z) - Social Orientation: A New Feature for Dialogue Analysis [15.192659799728181]
We introduce a new data set of dialogue utterances machine-labeled with social orientation tags.
We show that social orientation tags improve task performance, especially in low-resource settings.
We also demonstrate how social orientation tags help explain the outcomes of social interactions when used in neural models.
arXiv Detail & Related papers (2024-02-26T01:55:45Z) - TwHIN-BERT: A Socially-Enriched Pre-trained Language Model for
Multilingual Tweet Representations at Twitter [31.698196219228024]
We present TwHIN-BERT, a multilingual language model productionized at Twitter.
Our model is trained on 7 billion tweets covering over 100 distinct languages.
We evaluate our model on various multilingual social recommendation and semantic understanding tasks.
arXiv Detail & Related papers (2022-09-15T19:01:21Z) - Sentiment Analysis with R: Natural Language Processing for
Semi-Automated Assessments of Qualitative Data [0.0]
This tutorial introduces the basic functions for performing a sentiment analysis with R and explains how text documents can be analysed step by step.
A comparison of two political speeches illustrates a possible use case.
arXiv Detail & Related papers (2022-06-25T13:25:39Z) - Identifying and Characterizing Active Citizens who Refute Misinformation
in Social Media [25.986531330843434]
We study the task across different social media platforms (i.e., Twitter and Weibo) and languages (i.e., English and Chinese) for the first time.
We develop and make publicly available a new dataset of Weibo users mapped into one of the two categories (i.e., misinformation posters or active citizens)
We present an extensive analysis of the differences in language use between the two user categories.
arXiv Detail & Related papers (2022-04-21T13:22:48Z) - TLA: Twitter Linguistic Analysis [0.0]
TLA(Twitter Linguistic Analysis) is a framework for collecting, labeling, and analyzing data from Twitter for a corpus of languages.
We provide a basic understanding of the framework and discuss the process of collecting, labeling, and analyzing data from Twitter for a corpus of languages.
The analysis provided by TLA will also go a long way in understanding the sentiments of different linguistic communities.
arXiv Detail & Related papers (2021-07-20T18:25:48Z) - Can You be More Social? Injecting Politeness and Positivity into
Task-Oriented Conversational Agents [60.27066549589362]
Social language used by human agents is associated with greater users' responsiveness and task completion.
The model uses a sequence-to-sequence deep learning architecture, extended with a social language understanding element.
Evaluation in terms of content preservation and social language level using both human judgment and automatic linguistic measures shows that the model can generate responses that enable agents to address users' issues in a more socially appropriate way.
arXiv Detail & Related papers (2020-12-29T08:22:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.