TweetBLM: A Hate Speech Dataset and Analysis of Black Lives
Matter-related Microblogs on Twitter
- URL: http://arxiv.org/abs/2108.12521v1
- Date: Fri, 27 Aug 2021 22:47:02 GMT
- Title: TweetBLM: A Hate Speech Dataset and Analysis of Black Lives
Matter-related Microblogs on Twitter
- Authors: Sumit Kumar, Raj Ratn Pranesh
- Abstract summary: We have proposed a Black Lives Matter related tweet hate speech dataset TweetBLM.
Our dataset comprises 9165 manually annotated tweets that target the Black Lives Matter movement.
- Score: 2.741266294612776
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In the past few years, there has been a significant rise in toxic and hateful
content on various social media platforms. Recently Black Lives Matter movement
came into the picture, causing an avalanche of user generated responses on the
internet. In this paper, we have proposed a Black Lives Matter related tweet
hate speech dataset TweetBLM. Our dataset comprises 9165 manually annotated
tweets that target the Black Lives Matter movement. We annotated the tweets
into two classes, i.e., HATE and NONHATE based on their content related to
racism erupted from the movement for the black community. In this work, we also
generated useful statistical insights on our dataset and performed a systematic
analysis of various machine learning models such as Random Forest, CNN, LSTM,
BiLSTM, Fasttext, BERTbase, and BERTlarge for the classification task on our
dataset. Through our work, we aim at contributing to the substantial efforts of
the research community for the identification and mitigation of hate speech on
the internet. The dataset is publicly available.
Related papers
- What are People Talking about in #BlackLivesMatter and #StopAsianHate?
Exploring and Categorizing Twitter Topics Emerging in Online Social Movements
through the Latent Dirichlet Allocation Model [27.53788299995914]
Black Lives Matter (BLM) and Stop Asian Hate (SAH) are two successful social movements that have spread on Twitter.
This study adopts a mixed-methods approach to comprehensively analyze BLM and SAH Twitter topics.
We collected more than one million tweets with the #blacklivesmatter and #stopasianhate hashtags and compared their topics.
arXiv Detail & Related papers (2022-05-29T17:29:40Z) - DisinfoMeme: A Multimodal Dataset for Detecting Meme Intentionally
Spreading Out Disinformation [72.18912216025029]
We present DisinfoMeme to help detect disinformation memes.
The dataset contains memes mined from Reddit covering three current topics: the COVID-19 pandemic, the Black Lives Matter movement, and veganism/vegetarianism.
arXiv Detail & Related papers (2022-05-25T09:54:59Z) - What Truly Matters? Using Linguistic Cues for Analyzing the
#BlackLivesMatter Movement and its Counter Protests: 2013 to 2020 [31.670509436139735]
There has been a significant increase in digital activism addressing police-brutality related and racially-motivated incidents in the U.S.
We use social media as an authoritative tool to examine and analyze the linguistic cues and thematic relationships in these three mediums.
Our results show that excessive use of racially-related or prejudicial hashtags were used by the counter protests which portray potential discriminatory tendencies.
arXiv Detail & Related papers (2021-09-20T18:34:30Z) - Out of the Shadows: Analyzing Anonymous' Twitter Resurgence during the
2020 Black Lives Matter Protests [6.510061176722249]
Recently, there had been little notable activity from the once prominent hacktivist group, Anonymous.
The group, responsible for activist-based cyber attacks on major businesses and governments, appeared to have fragmented after key members were arrested in 2013.
In response to the major Black Lives Matter protests, however, reports indicated that the group was back.
arXiv Detail & Related papers (2021-07-22T10:18:32Z) - News consumption and social media regulations policy [70.31753171707005]
We analyze two social media that enforced opposite moderation methods, Twitter and Gab, to assess the interplay between news consumption and content regulation.
Our results show that the presence of moderation pursued by Twitter produces a significant reduction of questionable content.
The lack of clear regulation on Gab results in the tendency of the user to engage with both types of content, showing a slight preference for the questionable ones which may account for a dissing/endorsement behavior.
arXiv Detail & Related papers (2021-06-07T19:26:32Z) - hBert + BiasCorp -- Fighting Racism on the Web [58.768804813646334]
We are releasing BiasCorp, a dataset containing 139,090 comments and news segment from three specific sources - Fox News, BreitbartNews and YouTube.
In this work, we present hBERT, where we modify certain layers of the pretrained BERT model with the new Hopfield Layer.
We are also releasing a JavaScript library and a Chrome Extension Application, to help developers make use of our trained model in web applications.
arXiv Detail & Related papers (2021-04-06T02:17:20Z) - Hate is the New Infodemic: A Topic-aware Modeling of Hate Speech
Diffusion on Twitter [24.94135874070525]
We focus on exploring user behaviour, which triggers the genesis of hate speech on Twitter.
We crawl a large-scale dataset of tweets, retweets, user activity history, and follower networks.
We characterize different signals of information that govern these dynamics.
arXiv Detail & Related papers (2020-10-09T05:43:08Z) - Twitter Corpus of the #BlackLivesMatter Movement And Counter Protests:
2013 to 2021 [3.026131612560646]
Black Lives Matter (BLM) is a decentralized social movement protesting violence against Black individuals and communities.
The #BlackLivesMatter social media hashtag has come to represent the grassroots movement.
Similar hashtags counter protesting the BLM movement, such as #AllLivesMatter, and #BlueLivesMatter.
arXiv Detail & Related papers (2020-09-01T17:37:39Z) - Racism is a Virus: Anti-Asian Hate and Counterspeech in Social Media
during the COVID-19 Crisis [51.39895377836919]
COVID-19 has sparked racism and hate on social media targeted towards Asian communities.
We study the evolution and spread of anti-Asian hate speech through the lens of Twitter.
We create COVID-HATE, the largest dataset of anti-Asian hate and counterspeech spanning 14 months.
arXiv Detail & Related papers (2020-05-25T21:58:09Z) - Echo Chambers on Social Media: A comparative analysis [64.2256216637683]
We introduce an operational definition of echo chambers and perform a massive comparative analysis on 1B pieces of contents produced by 1M users on four social media platforms.
We infer the leaning of users about controversial topics and reconstruct their interaction networks by analyzing different features.
We find support for the hypothesis that platforms implementing news feed algorithms like Facebook may elicit the emergence of echo-chambers.
arXiv Detail & Related papers (2020-04-20T20:00:27Z) - #MeToo on Campus: Studying College Sexual Assault at Scale Using Data
Reported on Social Media [71.74529365205053]
We analyze the influence of the # trend on a pool of college followers.
The results show that the majority of topics embedded in those # tweets detail sexual harassment stories.
There exists a significant correlation between the prevalence of this trend and official reports on several major geographical regions.
arXiv Detail & Related papers (2020-01-16T18:05:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.