Embeddings-Based Clustering for Target Specific Stances: The Case of a
Polarized Turkey
- URL: http://arxiv.org/abs/2005.09649v2
- Date: Thu, 24 Feb 2022 05:07:52 GMT
- Title: Embeddings-Based Clustering for Target Specific Stances: The Case of a
Polarized Turkey
- Authors: Ammar Rashed, Mucahid Kutlu, Kareem Darwish, Tamer Elsayed, Cans{\i}n
Bayrak
- Abstract summary: We present an unsupervised method for target-specific stance detection in a polarized setting, specifically Turkish politics.
We show the effectiveness of our method in properly clustering users of divergent groups across multiple targets.
We perform our analysis on a large dataset of 108M Turkish election-related tweets along with the timeline tweets of 168k Turkish users.
- Score: 6.130136112098865
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: On June 24, 2018, Turkey conducted a highly consequential election in which
the Turkish people elected their president and parliament in the first election
under a new presidential system. During the election period, the Turkish people
extensively shared their political opinions on Twitter. One aspect of
polarization among the electorate was support for or opposition to the
reelection of Recep Tayyip Erdo\u{g}an. In this paper, we present an
unsupervised method for target-specific stance detection in a polarized
setting, specifically Turkish politics, achieving 90% precision in identifying
user stances, while maintaining more than 80% recall. The method involves
representing users in an embedding space using Google's Convolutional Neural
Network (CNN) based multilingual universal sentence encoder. The
representations are then projected onto a lower dimensional space in a manner
that reflects similarities and are consequently clustered. We show the
effectiveness of our method in properly clustering users of divergent groups
across multiple targets that include political figures, different groups, and
parties. We perform our analysis on a large dataset of 108M Turkish
election-related tweets along with the timeline tweets of 168k Turkish users,
who authored 213M tweets. Given the resultant user stances, we are able to
observe correlations between topics and compute topic polarization.
Related papers
- On the Use of Proxies in Political Ad Targeting [49.61009579554272]
We show that major political advertisers circumvented mitigations by targeting proxy attributes.
Our findings have crucial implications for the ongoing discussion on the regulation of political advertising.
arXiv Detail & Related papers (2024-10-18T17:15:13Z) - Electoral Agitation Data Set: The Use Case of the Polish Election [3.671887117122512]
We present the first publicly open data set for detecting electoral agitation in the Polish language.
It contains 6,112 human-annotated tweets tagged with four legally conditioned categories.
The newly created data set was used to fine-tune a Polish Language Model called HerBERT.
arXiv Detail & Related papers (2023-07-13T18:14:43Z) - Computational Assessment of Hyperpartisanship in News Titles [55.92100606666497]
We first adopt a human-guided machine learning framework to develop a new dataset for hyperpartisan news title detection.
Overall the Right media tends to use proportionally more hyperpartisan titles.
We identify three major topics including foreign issues, political systems, and societal issues that are suggestive of hyperpartisanship in news titles.
arXiv Detail & Related papers (2023-01-16T05:56:58Z) - Design and analysis of tweet-based election models for the 2021 Mexican
legislative election [55.41644538483948]
We use a dataset of 15 million election-related tweets in the six months preceding election day.
We find that models using data with geographical attributes determine the results of the election with better precision and accuracy than conventional polling methods.
arXiv Detail & Related papers (2023-01-02T12:40:05Z) - Does Twitter know your political views? POLiTweets dataset and
semi-automatic method for political leaning discovery [0.0]
POLiTweets is the first publicly open Polish dataset for political affiliation discovery in a multiparty setup.
It consists of over 147k tweets from almost 10k Polish-writing users annotatedally and almost 40k tweets from 166 users annotated manually as a test set.
We used our data to study the aspects of domain shift in the context of topics and the type of content writers - ordinary citizens vs. professional politicians.
arXiv Detail & Related papers (2022-06-14T10:28:23Z) - Shifting Polarization and Twitter News Influencers between two U.S.
Presidential Elections [92.33485580547801]
We analyze the change of polarization between the 2016 and 2020 U.S. presidential elections.
Most of the top influencers were affiliated with media organizations during both elections.
75% of the top influencers in 2020 were not present in 2016, demonstrating that such status is difficult to retain.
arXiv Detail & Related papers (2021-11-03T20:08:54Z) - Reaching the bubble may not be enough: news media role in online
political polarization [58.720142291102135]
A way of reducing polarization would be by distributing cross-partisan news among individuals with distinct political orientations.
This study investigates whether this holds in the context of nationwide elections in Brazil and Canada.
arXiv Detail & Related papers (2021-09-18T11:34:04Z) - Mundus vult decipi, ergo decipiatur: Visual Communication of Uncertainty
in Election Polls [56.8172499765118]
We discuss potential sources of bias in nowcasting and forecasting.
Concepts are presented to attenuate the issue of falsely perceived accuracy.
One key idea is the use of Probabilities of Events instead of party shares.
arXiv Detail & Related papers (2021-04-28T07:02:24Z) - Exploring Polarization of Users Behavior on Twitter During the 2019
South American Protests [15.065938163384235]
We explore polarization on Twitter in a different context, namely the protest that paralyzed several countries in the South American region in 2019.
By leveraging users' endorsement of politicians' tweets and hashtag campaigns with defined stances towards the protest (for or against), we construct a weakly labeled stance dataset with millions of users.
We find empirical evidence of the "filter bubble" phenomenon during the event, as we not only show that the user bases are homogeneous in terms of stance, but the probability that a user transitions from media of different clusters is low.
arXiv Detail & Related papers (2021-04-05T07:13:18Z) - Political Advertising Dataset: the use case of the Polish 2020
Presidential Elections [4.560033258611709]
We present the first publicly open dataset for detecting specific text chunks and categories of political advertising in the Polish language.
It contains 1,705 human-annotated tweets tagged with nine categories, which constitute campaigning under Polish electoral law.
arXiv Detail & Related papers (2020-06-17T23:58:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.