PoliWAM: An Exploration of a Large Scale Corpus of Political Discussions
on WhatsApp Messenger
- URL: http://arxiv.org/abs/2010.13263v2
- Date: Mon, 20 Sep 2021 00:47:39 GMT
- Title: PoliWAM: An Exploration of a Large Scale Corpus of Political Discussions
on WhatsApp Messenger
- Authors: Vivek Srivastava, Mayank Singh
- Abstract summary: WhatsApp Messenger is one of the most popular channels for spreading information with a current reach of more than 180 countries and 2 billion people.
In the recent past, several countries have witnessed its effectiveness and influence in political and social campaigns.
We observe a high surge in information and propaganda flow during election campaigning.
- Score: 1.2301855531996841
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: WhatsApp Messenger is one of the most popular channels for spreading
information with a current reach of more than 180 countries and 2 billion
people. Its widespread usage has made it one of the most popular media for
information propagation among the masses during any socially engaging event. In
the recent past, several countries have witnessed its effectiveness and
influence in political and social campaigns. We observe a high surge in
information and propaganda flow during election campaigning. In this paper, we
explore a high-quality large-scale user-generated dataset curated from WhatsApp
comprising of 281 groups, 31,078 unique users, and 223,404 messages shared
before, during, and after the Indian General Elections 2019, encompassing all
major Indian political parties and leaders. In addition to the raw noisy
user-generated data, we present a fine-grained annotated dataset of 3,848
messages that will be useful to understand the various dimensions of WhatsApp
political campaigning. We present several complementary insights into the
investigative and sensational news stories from the same period. Exploratory
data analysis and experiments showcase several exciting results and future
research opportunities. To facilitate reproducible research, we make the
anonymized datasets available in the public domain.
Related papers
- Prediction of the 2023 Turkish Presidential Election Results Using
Social Media Data [0.5156484100374059]
We aim to predict the vote shares of parties participating in the 2023 elections in Turkey by combining social media data with traditional polling data.
Our approach is a volume-based approach that considers the number of social media interactions rather than content.
arXiv Detail & Related papers (2023-05-28T13:17:51Z) - Unveiling the Hidden Agenda: Biases in News Reporting and Consumption [59.55900146668931]
We build a six-year dataset on the Italian vaccine debate and adopt a Bayesian latent space model to identify narrative and selection biases.
We found a nonlinear relationship between biases and engagement, with higher engagement for extreme positions.
Analysis of news consumption on Twitter reveals common audiences among news outlets with similar ideological positions.
arXiv Detail & Related papers (2023-01-14T18:58:42Z) - Design and analysis of tweet-based election models for the 2021 Mexican
legislative election [55.41644538483948]
We use a dataset of 15 million election-related tweets in the six months preceding election day.
We find that models using data with geographical attributes determine the results of the election with better precision and accuracy than conventional polling methods.
arXiv Detail & Related papers (2023-01-02T12:40:05Z) - #Secim2023: First Public Dataset for Studying Turkish General Election [2.9080451420355344]
Secim2023 is a comprehensive dataset for social media researchers to study the upcoming Turkish election.
We provide tools to prevent online manipulation, and gather novel information to inform the public.
Using the Secim2023 dataset, researchers can examine the social and communication networks between political actors, track current trends, and investigate emerging threats to election integrity.
arXiv Detail & Related papers (2022-11-22T11:42:32Z) - A Hierarchical Network-Oriented Analysis of User Participation in
Misinformation Spread on WhatsApp [0.9774299772405469]
We present a hierarchical network-oriented characterization of the users engaged in misinformation spread on WhatsApp.
Our study offers valuable insights into how WhatsApp users leverage the underlying network connecting different groups to gain large reach in the spread of misinformation on the platform.
arXiv Detail & Related papers (2021-09-22T00:00:02Z) - Reaching the bubble may not be enough: news media role in online
political polarization [58.720142291102135]
A way of reducing polarization would be by distributing cross-partisan news among individuals with distinct political orientations.
This study investigates whether this holds in the context of nationwide elections in Brazil and Canada.
arXiv Detail & Related papers (2021-09-18T11:34:04Z) - 2020 U.S. Presidential Election: Analysis of Female and Male Users on
Twitter [8.651122862855495]
Current literature mainly focuses on analyzing the content of tweets without considering the gender of users.
This research collects and analyzes a large number of tweets posted during the 2020 U.S. presidential election.
Our findings are based upon a wide range of topics, such as tax, climate change, and the COVID-19 pandemic.
arXiv Detail & Related papers (2021-08-21T01:31:03Z) - News consumption and social media regulations policy [70.31753171707005]
We analyze two social media that enforced opposite moderation methods, Twitter and Gab, to assess the interplay between news consumption and content regulation.
Our results show that the presence of moderation pursued by Twitter produces a significant reduction of questionable content.
The lack of clear regulation on Gab results in the tendency of the user to engage with both types of content, showing a slight preference for the questionable ones which may account for a dissing/endorsement behavior.
arXiv Detail & Related papers (2021-06-07T19:26:32Z) - Political audience diversity and news reliability in algorithmic ranking [54.23273310155137]
We propose using the political diversity of a website's audience as a quality signal.
Using news source reliability ratings from domain experts and web browsing data from a diverse sample of 6,890 U.S. citizens, we first show that websites with more extreme and less politically diverse audiences have lower journalistic standards.
arXiv Detail & Related papers (2020-07-16T02:13:55Z) - Echo Chambers on Social Media: A comparative analysis [64.2256216637683]
We introduce an operational definition of echo chambers and perform a massive comparative analysis on 1B pieces of contents produced by 1M users on four social media platforms.
We infer the leaning of users about controversial topics and reconstruct their interaction networks by analyzing different features.
We find support for the hypothesis that platforms implementing news feed algorithms like Facebook may elicit the emergence of echo-chambers.
arXiv Detail & Related papers (2020-04-20T20:00:27Z) - The Pushshift Telegram Dataset [1.7109522466982476]
We present a dataset from one such mobile messaging platform: Telegram.
Our dataset is made up of over 27.8K channels and 317M messages from 2.2M unique users.
arXiv Detail & Related papers (2020-01-23T10:37:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.