Detecting Human Rights Violations on Social Media during Russia-Ukraine
War
- URL: http://arxiv.org/abs/2306.05370v1
- Date: Tue, 6 Jun 2023 12:59:03 GMT
- Title: Detecting Human Rights Violations on Social Media during Russia-Ukraine
War
- Authors: Poli Nemkova, Solomon Ubani, Suleyman Olcay Polat, Nayeon Kim, Rodney
D. Nielsen
- Abstract summary: The present-day Russia-Ukraine military conflict has exposed the pivotal role of social media in enabling the transparent and unbridled sharing of information.
Social media platforms have the potential to serve as effective instruments for monitoring and documenting Human Rights Violations (HRV)
Our research focuses on the analysis of data from Telegram, the leading social media platform for reading independent news in post-Soviet regions.
- Score: 1.2599533416395763
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: The present-day Russia-Ukraine military conflict has exposed the pivotal role
of social media in enabling the transparent and unbridled sharing of
information directly from the frontlines. In conflict zones where freedom of
expression is constrained and information warfare is pervasive, social media
has emerged as an indispensable lifeline. Anonymous social media platforms, as
publicly available sources for disseminating war-related information, have the
potential to serve as effective instruments for monitoring and documenting
Human Rights Violations (HRV). Our research focuses on the analysis of data
from Telegram, the leading social media platform for reading independent news
in post-Soviet regions. We gathered a dataset of posts sampled from 95 public
Telegram channels that cover politics and war news, which we have utilized to
identify potential occurrences of HRV. Employing a mBERT-based text classifier,
we have conducted an analysis to detect any mentions of HRV in the Telegram
data. Our final approach yielded an $F_2$ score of 0.71 for HRV detection,
representing an improvement of 0.38 over the multilingual BERT base model. We
release two datasets that contains Telegram posts: (1) large corpus with over
2.3 millions posts and (2) annotated at the sentence-level dataset to indicate
HRVs. The Telegram posts are in the context of the Russia-Ukraine war. We posit
that our findings hold significant implications for NGOs, governments, and
researchers by providing a means to detect and document possible human rights
violations.
Related papers
- CrediRAG: Network-Augmented Credibility-Based Retrieval for Misinformation Detection in Reddit [18.974778179375463]
We present CrediRAG, the first fake news detection model that combines language models with access to a rich external political knowledge base.
CrediRAG uses a news retriever to initially assign a misinformation score to each post based on the source credibility of similar news articles to the post title content.
We achieve 11% increase in the F1-score in detecting misinformative posts over state-of-the-art methods.
arXiv Detail & Related papers (2024-10-15T20:58:42Z) - PropaInsight: Toward Deeper Understanding of Propaganda in Terms of Techniques, Appeals, and Intent [71.20471076045916]
Propaganda plays a critical role in shaping public opinion and fueling disinformation.
Propainsight systematically dissects propaganda into techniques, arousal appeals, and underlying intent.
Propagaze combines human-annotated data with high-quality synthetic data.
arXiv Detail & Related papers (2024-09-19T06:28:18Z) - Characterizing and Detecting Propaganda-Spreading Accounts on Telegram [7.759087666892532]
Information-based attacks on social media, such as disinformation campaigns and propaganda, are emerging cybersecurity threats.
We propose a novel mechanism for detecting propaganda that capitalizes on the relationship between legitimate user messages and propaganda replies.
Our method is faster, cheaper, and has a detection rate (97.6%) 11.6 percentage points higher than human moderators after seeing only one message from an account.
arXiv Detail & Related papers (2024-06-12T11:07:27Z) - A New Task and Dataset on Detecting Attacks on Human Rights Defenders [68.45906430323156]
We propose a new dataset for detecting Attacks on Human Rights Defenders (HRDsAttack) consisting of crowdsourced annotations on 500 online news articles.
The annotations include fine-grained information about the type and location of the attacks, as well as information about the victim(s)
We demonstrate the usefulness of the dataset by using it to train and evaluate baseline models on several sub-tasks to predict the annotated characteristics.
arXiv Detail & Related papers (2023-06-30T14:20:06Z) - ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media [74.93847489218008]
We present a novel task, identifying manipulation of news on social media, which aims to detect manipulation in social media posts and identify manipulated or inserted information.
To study this task, we have proposed a data collection schema and curated a dataset called ManiTweet, consisting of 3.6K pairs of tweets and corresponding articles.
Our analysis demonstrates that this task is highly challenging, with large language models (LLMs) yielding unsatisfactory performance.
arXiv Detail & Related papers (2023-05-23T16:40:07Z) - Automated multilingual detection of Pro-Kremlin propaganda in newspapers
and Telegram posts [5.886782001771578]
The full-scale conflict between the Russian Federation and Ukraine generated an unprecedented amount of news articles and social media data.
This study analyses how the media affected and mirrored public opinion during the first month of the war using news articles and Telegram news channels in Ukrainian, Russian, Romanian and English.
We propose and compare two methods of multilingual automated pro-Kremlin propaganda identification, based on Transformers and linguistic features.
arXiv Detail & Related papers (2023-01-25T14:25:37Z) - Nothing Stands Alone: Relational Fake News Detection with Hypergraph
Neural Networks [49.29141811578359]
We propose to leverage a hypergraph to represent group-wise interaction among news, while focusing on important news relations with its dual-level attention mechanism.
Our approach yields remarkable performance and maintains the high performance even with a small subset of labeled news data.
arXiv Detail & Related papers (2022-12-24T00:19:32Z) - VoynaSlov: A Data Set of Russian Social Media Activity during the 2022
Ukraine-Russia War [36.18151945028956]
We describe a new data set called VoynaSlov which contains 21M+ Russian-language social media activities.
We scraped the data from two major platforms that are widely used in Russia: Twitter and VKontakte (VK), a Russian social media platform based in Saint Petersburg commonly referred to as "Russian Facebook"
The main differences that distinguish our data from previously released data related to the ongoing war are its focus on Russian media and consideration of state-affiliation.
arXiv Detail & Related papers (2022-05-24T21:59:10Z) - Twitter Dataset on the Russo-Ukrainian War [68.713984286035]
We have initiated an ongoing dataset acquisition from Twitter API.
The dataset has reached the amount of 57.3 million tweets, originating from 7.7 million users.
We apply an initial volume and sentiment analysis, while the dataset can be used to further exploratory investigation towards topic analysis, hate speech, propaganda recognition, or even show potential malicious entities like botnets.
arXiv Detail & Related papers (2022-04-07T12:33:06Z) - FinnSentiment -- A Finnish Social Media Corpus for Sentiment Polarity
Annotation [1.5039745292757671]
There is no large-scale social media data set with sentiment polarity annotations for Finnish.
We introduce a 27,000 sentence data set annotated independently with sentiment polarity by three native annotators.
We analyse their inter-annotator agreement and provide two baselines to validate the usefulness of the data set.
arXiv Detail & Related papers (2020-12-04T14:17:46Z) - Leveraging Multi-Source Weak Social Supervision for Early Detection of
Fake News [67.53424807783414]
Social media has greatly enabled people to participate in online activities at an unprecedented rate.
This unrestricted access also exacerbates the spread of misinformation and fake news online which might cause confusion and chaos unless being detected early for its mitigation.
We jointly leverage the limited amount of clean data along with weak signals from social engagements to train deep neural networks in a meta-learning framework to estimate the quality of different weak instances.
Experiments on realworld datasets demonstrate that the proposed framework outperforms state-of-the-art baselines for early detection of fake news without using any user engagements at prediction time.
arXiv Detail & Related papers (2020-04-03T18:26:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.