Happenstance: Utilizing Semantic Search to Track Russian State Media
Narratives about the Russo-Ukrainian War On Reddit
- URL: http://arxiv.org/abs/2205.14484v3
- Date: Tue, 30 May 2023 20:45:34 GMT
- Title: Happenstance: Utilizing Semantic Search to Track Russian State Media
Narratives about the Russo-Ukrainian War On Reddit
- Authors: Hans W. A. Hanley, Deepak Kumar, Zakir Durumeric
- Abstract summary: We study Russian state media narratives touted by the Russian government to English-speaking audiences.
We first perform sentence-level topic analysis using the large-language model MPNet on articles published by ten different pro-Russian propaganda websites.
Using MPNet and a semantic search algorithm, we map these subreddits' comments to the set of topics extracted from our set of Russian websites.
- Score: 5.567674129101803
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In the buildup to and in the weeks following the Russian Federation's
invasion of Ukraine, Russian state media outlets output torrents of misleading
and outright false information. In this work, we study this coordinated
information campaign in order to understand the most prominent state media
narratives touted by the Russian government to English-speaking audiences. To
do this, we first perform sentence-level topic analysis using the
large-language model MPNet on articles published by ten different pro-Russian
propaganda websites including the new Russian "fact-checking" website
waronfakes.com. Within this ecosystem, we show that smaller websites like
katehon.com were highly effective at publishing topics that were later echoed
by other Russian sites. After analyzing this set of Russian information
narratives, we then analyze their correspondence with narratives and topics of
discussion on the r/Russia and 10 other political subreddits. Using MPNet and a
semantic search algorithm, we map these subreddits' comments to the set of
topics extracted from our set of Russian websites, finding that 39.6% of
r/Russia comments corresponded to narratives from pro-Russian propaganda
websites compared to 8.86% on r/politics.
Related papers
- A Study of Partisan News Sharing in the Russian invasion of Ukraine [31.211851388657152]
Since the Russian invasion of Ukraine, a large volume of biased and partisan news has been spread via social media platforms.
We aim to characterize the role of such sharing in influencing users' communications.
We build a predictive model to identify users likely to spread partisan news.
arXiv Detail & Related papers (2023-11-26T13:25:11Z) - Russo-Ukrainian War: Prediction and explanation of Twitter suspension [47.61306219245444]
This study focuses on the Twitter suspension mechanism and the analysis of shared content and features of user accounts that may lead to this.
We have obtained a dataset containing 107.7M tweets, originating from 9.8 million users, using Twitter API.
Our results reveal scam campaigns taking advantage of trending topics regarding the Russia-Ukrainian conflict for Bitcoin fraud, spam, and advertisement campaigns.
arXiv Detail & Related papers (2023-06-06T08:41:02Z) - Partial Mobilization: Tracking Multilingual Information Flows Amongst Russian Media Outlets and Telegram [5.161088104035108]
We study how 16 Russian media outlets interacted with and utilized 732 Telegram channels throughout 2022.
We show that news outlets not only propagate existing narratives through Telegram but that they source material from the messaging platform.
For example, across the websites in our study, between 2.3% (ura.news) and 26.7% (ukraina.ru) of articles discussed content that originated/resulted from activity on Telegram.
arXiv Detail & Related papers (2023-01-25T22:27:40Z) - Fake News and Hate Speech: Language in Common [73.4764550713355]
We compute a novel index, the ingroup vs outgroup index, in three different datasets and show that both phenomena share an "us vs them" narrative.
arXiv Detail & Related papers (2022-12-05T15:35:10Z) - "A Special Operation": A Quantitative Approach to Dissecting and
Comparing Different Media Ecosystems' Coverage of the Russo-Ukrainian War [5.567674129101803]
The coverage of the Russian invasion of Ukraine has varied widely between Western, Russian, and Chinese media ecosystems.
We find that while the Western press outlets have focused on the military and humanitarian aspects of the war, Russian media have focused on the purported justifications for the "special military operation"
We measure the degree to which Russian media has influenced Chinese coverage across Chinese outlets' news articles, Weibo accounts, and Twitter accounts.
arXiv Detail & Related papers (2022-10-06T16:04:19Z) - VoynaSlov: A Data Set of Russian Social Media Activity during the 2022
Ukraine-Russia War [36.18151945028956]
We describe a new data set called VoynaSlov which contains 21M+ Russian-language social media activities.
We scraped the data from two major platforms that are widely used in Russia: Twitter and VKontakte (VK), a Russian social media platform based in Saint Petersburg commonly referred to as "Russian Facebook"
The main differences that distinguish our data from previously released data related to the ongoing war are its focus on Russian media and consideration of state-affiliation.
arXiv Detail & Related papers (2022-05-24T21:59:10Z) - Twitter Dataset on the Russo-Ukrainian War [68.713984286035]
We have initiated an ongoing dataset acquisition from Twitter API.
The dataset has reached the amount of 57.3 million tweets, originating from 7.7 million users.
We apply an initial volume and sentiment analysis, while the dataset can be used to further exploratory investigation towards topic analysis, hate speech, propaganda recognition, or even show potential malicious entities like botnets.
arXiv Detail & Related papers (2022-04-07T12:33:06Z) - The Rise and Fall of Fake News sites: A Traffic Analysis [62.51737815926007]
We investigate the online presence of fake news websites and characterize their behavior in comparison to real news websites.
Based on our findings, we build a content-agnostic ML for automatic detection of fake news websites.
arXiv Detail & Related papers (2021-03-16T18:10:22Z) - Russian trolls speaking Russian: Regional Twitter operations and MH17 [68.8204255655161]
In 2018, Twitter released data on accounts identified as Russian trolls.
We analyze the Russian-language operations of these trolls.
We find that trolls' information campaign on the MH17 crash was the largest in terms of tweet count.
arXiv Detail & Related papers (2020-05-13T19:48:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.