Twitter Dataset on the Russo-Ukrainian War
- URL: http://arxiv.org/abs/2204.08530v1
- Date: Thu, 7 Apr 2022 12:33:06 GMT
- Title: Twitter Dataset on the Russo-Ukrainian War
- Authors: Alexander Shevtsov, Christos Tzagkarakis, Despoina Antonakaki,
Polyvios Pratikakis, Sotiris Ioannidis
- Abstract summary: We have initiated an ongoing dataset acquisition from Twitter API.
The dataset has reached the amount of 57.3 million tweets, originating from 7.7 million users.
We apply an initial volume and sentiment analysis, while the dataset can be used to further exploratory investigation towards topic analysis, hate speech, propaganda recognition, or even show potential malicious entities like botnets.
- Score: 68.713984286035
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: On 24 February 2022, Russia invaded Ukraine, also known now as
Russo-Ukrainian War. We have initiated an ongoing dataset acquisition from
Twitter API. Until the day this paper was written the dataset has reached the
amount of 57.3 million tweets, originating from 7.7 million users. We apply an
initial volume and sentiment analysis, while the dataset can be used to further
exploratory investigation towards topic analysis, hate speech, propaganda
recognition, or even show potential malicious entities like botnets.
Related papers
- Russo-Ukrainian War: Prediction and explanation of Twitter suspension [47.61306219245444]
This study focuses on the Twitter suspension mechanism and the analysis of shared content and features of user accounts that may lead to this.
We have obtained a dataset containing 107.7M tweets, originating from 9.8 million users, using Twitter API.
Our results reveal scam campaigns taking advantage of trending topics regarding the Russia-Ukrainian conflict for Bitcoin fraud, spam, and advertisement campaigns.
arXiv Detail & Related papers (2023-06-06T08:41:02Z) - MiDe22: An Annotated Multi-Event Tweet Dataset for Misinformation Detection [4.799822253865053]
We construct a new human-annotated dataset, called MiDe22, having 5,284 English and 5,064 Turkish tweets with their misinformation labels.
The dataset includes user engagements with the tweets in terms of likes, replies, retweets, and quotes.
arXiv Detail & Related papers (2022-10-11T12:25:26Z) - VoynaSlov: A Data Set of Russian Social Media Activity during the 2022
Ukraine-Russia War [36.18151945028956]
We describe a new data set called VoynaSlov which contains 21M+ Russian-language social media activities.
We scraped the data from two major platforms that are widely used in Russia: Twitter and VKontakte (VK), a Russian social media platform based in Saint Petersburg commonly referred to as "Russian Facebook"
The main differences that distinguish our data from previously released data related to the ongoing war are its focus on Russian media and consideration of state-affiliation.
arXiv Detail & Related papers (2022-05-24T21:59:10Z) - Manipulating Twitter Through Deletions [64.33261764633504]
Research into influence campaigns on Twitter has mostly relied on identifying malicious activities from tweets obtained via public APIs.
Here, we provide the first exhaustive, large-scale analysis of anomalous deletion patterns involving more than a billion deletions by over 11 million accounts.
We find that a small fraction of accounts delete a large number of tweets daily.
First, limits on tweet volume are circumvented, allowing certain accounts to flood the network with over 26 thousand daily tweets.
Second, coordinated networks of accounts engage in repetitive likes and unlikes of content that is eventually deleted, which can manipulate ranking algorithms.
arXiv Detail & Related papers (2022-03-25T20:07:08Z) - Tweets in Time of Conflict: A Public Dataset Tracking the Twitter
Discourse on the War Between Ukraine and Russia [14.000779544058144]
On February 24, 2022, Russia invaded Ukraine. In the days that followed, reports kept flooding in from layman to news anchors of a conflict quickly escalating into war.
Russia faced immediate backlash and condemnation from the world at large.
While the war continues to contribute to an ongoing humanitarian and refugee crisis in Ukraine, a second battlefield has emerged in the online space.
arXiv Detail & Related papers (2022-03-14T20:52:02Z) - A Weibo Dataset for the 2022 Russo-Ukrainian Crisis [59.258530429699924]
We present the Russia-Ukraine Crisis Weibo dataset, with over 3.5M user posts and comments in the first release.
Our data is available at https://github.com/yrf1/Russia-Ukraine_weibo_dataset.
arXiv Detail & Related papers (2022-03-09T19:06:04Z) - BLM-17m: A Large-Scale Dataset for Black Lives Matter Topic Detection on
Twitter [25.881740515679393]
We propose a labeled dataset for topic detection that contains 17 million tweets.
These Tweets are collected from 25 May 2020 to 21 August 2020 that covers 89 days from start of this incident.
arXiv Detail & Related papers (2021-05-04T07:27:42Z) - Russian trolls speaking Russian: Regional Twitter operations and MH17 [68.8204255655161]
In 2018, Twitter released data on accounts identified as Russian trolls.
We analyze the Russian-language operations of these trolls.
We find that trolls' information campaign on the MH17 crash was the largest in terms of tweet count.
arXiv Detail & Related papers (2020-05-13T19:48:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.