Identifying and Characterizing Active Citizens who Refute Misinformation
in Social Media
- URL: http://arxiv.org/abs/2204.10080v1
- Date: Thu, 21 Apr 2022 13:22:48 GMT
- Title: Identifying and Characterizing Active Citizens who Refute Misinformation
in Social Media
- Authors: Yida Mu and Pu Niu and Nikolaos Aletras
- Abstract summary: We study the task across different social media platforms (i.e., Twitter and Weibo) and languages (i.e., English and Chinese) for the first time.
We develop and make publicly available a new dataset of Weibo users mapped into one of the two categories (i.e., misinformation posters or active citizens)
We present an extensive analysis of the differences in language use between the two user categories.
- Score: 25.986531330843434
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The phenomenon of misinformation spreading in social media has developed a
new form of active citizens who focus on tackling the problem by refuting posts
that might contain misinformation. Automatically identifying and characterizing
the behavior of such active citizens in social media is an important task in
computational social science for complementing studies in misinformation
analysis. In this paper, we study this task across different social media
platforms (i.e., Twitter and Weibo) and languages (i.e., English and Chinese)
for the first time. To this end, (1) we develop and make publicly available a
new dataset of Weibo users mapped into one of the two categories (i.e.,
misinformation posters or active citizens); (2) we evaluate a battery of
supervised models on our new Weibo dataset and an existing Twitter dataset
which we repurpose for the task; and (3) we present an extensive analysis of
the differences in language use between the two user categories.
Related papers
- Modeling Political Orientation of Social Media Posts: An Extended
Analysis [0.0]
Developing machine learning models to characterize political polarization on online social media presents significant challenges.
These challenges mainly stem from various factors such as the lack of annotated data, presence of noise in social media datasets, and the sheer volume of data.
We introduce two methods that leverage on news media bias and post content to label social media posts.
We demonstrate that current machine learning models can exhibit improved performance in predicting political orientation of social media posts.
arXiv Detail & Related papers (2023-11-21T03:34:20Z) - Countering Misinformation via Emotional Response Generation [15.383062216223971]
proliferation of misinformation on social media platforms (SMPs) poses a significant danger to public health, social cohesion and democracy.
Previous research has shown how social correction can be an effective way to curb misinformation.
We present VerMouth, the first large-scale dataset comprising roughly 12 thousand claim-response pairs.
arXiv Detail & Related papers (2023-11-17T15:37:18Z) - Simulating Social Media Using Large Language Models to Evaluate
Alternative News Feed Algorithms [8.602553195689513]
Social media is often criticized for amplifying toxic discourse and discouraging constructive conversations.
This paper asks whether simulating social media can help researchers study how different news feed algorithms shape the quality of online conversations.
arXiv Detail & Related papers (2023-10-05T18:26:06Z) - ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media [74.93847489218008]
We present a novel task, identifying manipulation of news on social media, which aims to detect manipulation in social media posts and identify manipulated or inserted information.
To study this task, we have proposed a data collection schema and curated a dataset called ManiTweet, consisting of 3.6K pairs of tweets and corresponding articles.
Our analysis demonstrates that this task is highly challenging, with large language models (LLMs) yielding unsatisfactory performance.
arXiv Detail & Related papers (2023-05-23T16:40:07Z) - A Weibo Dataset for the 2022 Russo-Ukrainian Crisis [59.258530429699924]
We present the Russia-Ukraine Crisis Weibo dataset, with over 3.5M user posts and comments in the first release.
Our data is available at https://github.com/yrf1/Russia-Ukraine_weibo_dataset.
arXiv Detail & Related papers (2022-03-09T19:06:04Z) - Misinformation Detection in Social Media Video Posts [0.4724825031148411]
Short-form video by social media platforms has become a critical challenge for social media providers.
We develop methods to detect misinformation in social media posts, exploiting modalities such as video and text.
We collect 160,000 video posts from Twitter, and leverage self-supervised learning to learn expressive representations of joint visual and textual data.
arXiv Detail & Related papers (2022-02-15T20:14:54Z) - Social Analysis of Young Basque Speaking Communities in Twitter [0.9445512376558136]
We take into account both social and linguistic aspects to perform demographic analysis by processing a large amount of tweets in Basque language.
The study of demographic characteristics and social relationships are approached by applying machine learning and modern deep-learning Natural Language Processing (NLP) techniques.
arXiv Detail & Related papers (2021-09-08T08:19:08Z) - News consumption and social media regulations policy [70.31753171707005]
We analyze two social media that enforced opposite moderation methods, Twitter and Gab, to assess the interplay between news consumption and content regulation.
Our results show that the presence of moderation pursued by Twitter produces a significant reduction of questionable content.
The lack of clear regulation on Gab results in the tendency of the user to engage with both types of content, showing a slight preference for the questionable ones which may account for a dissing/endorsement behavior.
arXiv Detail & Related papers (2021-06-07T19:26:32Z) - Can You be More Social? Injecting Politeness and Positivity into
Task-Oriented Conversational Agents [60.27066549589362]
Social language used by human agents is associated with greater users' responsiveness and task completion.
The model uses a sequence-to-sequence deep learning architecture, extended with a social language understanding element.
Evaluation in terms of content preservation and social language level using both human judgment and automatic linguistic measures shows that the model can generate responses that enable agents to address users' issues in a more socially appropriate way.
arXiv Detail & Related papers (2020-12-29T08:22:48Z) - I Know Where You Are Coming From: On the Impact of Social Media Sources
on AI Model Performance [79.05613148641018]
We will study the performance of different machine learning models when being learned on multi-modal data from different social networks.
Our initial experimental results reveal that social network choice impacts the performance.
arXiv Detail & Related papers (2020-02-05T11:10:44Z) - Mining Disinformation and Fake News: Concepts, Methods, and Recent
Advancements [55.33496599723126]
disinformation including fake news has become a global phenomenon due to its explosive growth.
Despite the recent progress in detecting disinformation and fake news, it is still non-trivial due to its complexity, diversity, multi-modality, and costs of fact-checking or annotation.
arXiv Detail & Related papers (2020-01-02T21:01:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.