Understanding Political Polarisation using Language Models: A dataset
and method
- URL: http://arxiv.org/abs/2301.00891v1
- Date: Mon, 2 Jan 2023 22:15:04 GMT
- Title: Understanding Political Polarisation using Language Models: A dataset
and method
- Authors: Samiran Gode, Supreeth Bare, Bhiksha Raj, Hyungon Yoo
- Abstract summary: This paper aims to analyze political polarization in US political system using Language Models.
Our main contributions are a dataset extracted from Wikipedia that spans the past 120 years.
- Score: 18.40891764492186
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Our paper aims to analyze political polarization in US political system using
Language Models, and thereby help candidates make an informed decision. The
availability of this information will help voters understand their candidates
views on the economy, healthcare, education and other social issues. Our main
contributions are a dataset extracted from Wikipedia that spans the past 120
years and a Language model based method that helps analyze how polarized a
candidate is. Our data is divided into 2 parts, background information and
political information about a candidate, since our hypothesis is that the
political views of a candidate should be based on reason and be independent of
factors such as birthplace, alma mater, etc. We further split this data into 4
phases chronologically, to help understand if and how the polarization amongst
candidates changes. This data has been cleaned to remove biases. To understand
the polarization we begin by showing results from some classical language
models in Word2Vec and Doc2Vec. And then use more powerful techniques like the
Longformer, a transformer based encoder, to assimilate more information and
find the nearest neighbors of each candidate based on their political view and
their background.
Related papers
- Representation Bias in Political Sample Simulations with Large Language Models [54.48283690603358]
This study seeks to identify and quantify biases in simulating political samples with Large Language Models.
Using the GPT-3.5-Turbo model, we leverage data from the American National Election Studies, German Longitudinal Election Study, Zuobiao dataset, and China Family Panel Studies.
arXiv Detail & Related papers (2024-07-16T05:52:26Z) - L(u)PIN: LLM-based Political Ideology Nowcasting [1.124958340749622]
We present a method to analyze ideological positions of individual parliamentary representatives by leveraging the latent knowledge of LLMs.
The method allows us to evaluate the stance of politicians on an axis of our choice allowing us to flexibly measure the stance of politicians in regards to a topic/controversy of our choice.
arXiv Detail & Related papers (2024-05-12T16:14:07Z) - Whose Side Are You On? Investigating the Political Stance of Large Language Models [56.883423489203786]
We investigate the political orientation of Large Language Models (LLMs) across a spectrum of eight polarizing topics.
Our investigation delves into the political alignment of LLMs across a spectrum of eight polarizing topics, spanning from abortion to LGBTQ issues.
The findings suggest that users should be mindful when crafting queries, and exercise caution in selecting neutral prompt language.
arXiv Detail & Related papers (2024-03-15T04:02:24Z) - Mapping Election Polarization and Competitiveness using Election Results [0.18641315013048293]
We argue that voting patterns can lead to mapping effective proxies of citizen divisions on election day.
This paper perspectives two complementary concepts, Election Polarization (EP) and Election Competitiveness (EC)
We present an approach that relies solely on election data and validate it using synthetic and real-world election data across 13 countries in the Eurozone, North America, Latin America, and New Zealand.
arXiv Detail & Related papers (2023-08-16T18:06:00Z) - Detecting Political Biases of Named Entities and Hashtags on Twitter [28.02430167720734]
Ideological divisions in the United States have become increasingly prominent in daily communication.
By detecting political biases in a corpus of text, one can attempt to describe and discern the polarity of that text.
We propose the Polarity-aware Embedding Multi-task learning model.
arXiv Detail & Related papers (2022-09-16T18:00:13Z) - Millions of Co-purchases and Reviews Reveal the Spread of Polarization
and Lifestyle Politics across Online Markets [68.8204255655161]
We study the pervasiveness of polarization and lifestyle politics over different product segments in a diverse market.
We sample 234.6 million relations among 21.8 million market entities to find product categories that are politically relevant, aligned, and polarized.
Cultural products are 4 times more polarized than any other segment.
arXiv Detail & Related papers (2022-01-17T18:16:37Z) - Shifting Polarization and Twitter News Influencers between two U.S.
Presidential Elections [92.33485580547801]
We analyze the change of polarization between the 2016 and 2020 U.S. presidential elections.
Most of the top influencers were affiliated with media organizations during both elections.
75% of the top influencers in 2020 were not present in 2016, demonstrating that such status is difficult to retain.
arXiv Detail & Related papers (2021-11-03T20:08:54Z) - Reaching the bubble may not be enough: news media role in online
political polarization [58.720142291102135]
A way of reducing polarization would be by distributing cross-partisan news among individuals with distinct political orientations.
This study investigates whether this holds in the context of nationwide elections in Brazil and Canada.
arXiv Detail & Related papers (2021-09-18T11:34:04Z) - Exploring Polarization of Users Behavior on Twitter During the 2019
South American Protests [15.065938163384235]
We explore polarization on Twitter in a different context, namely the protest that paralyzed several countries in the South American region in 2019.
By leveraging users' endorsement of politicians' tweets and hashtag campaigns with defined stances towards the protest (for or against), we construct a weakly labeled stance dataset with millions of users.
We find empirical evidence of the "filter bubble" phenomenon during the event, as we not only show that the user bases are homogeneous in terms of stance, but the probability that a user transitions from media of different clusters is low.
arXiv Detail & Related papers (2021-04-05T07:13:18Z) - Political Posters Identification with Appearance-Text Fusion [49.55696202606098]
We propose a method that efficiently utilizes appearance features and text vectors to accurately classify political posters.
The majority of this work focuses on political posters that are designed to serve as a promotion of a certain political event.
arXiv Detail & Related papers (2020-12-19T16:14:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.