Measuring and Characterizing Hate Speech on News Websites
- URL: http://arxiv.org/abs/2005.07926v1
- Date: Sat, 16 May 2020 09:59:01 GMT
- Title: Measuring and Characterizing Hate Speech on News Websites
- Authors: Savvas Zannettou, Mai ElSherief, Elizabeth Belding, Shirin Nilizadeh,
Gianluca Stringhini
- Abstract summary: We analyze 125M comments posted on 412K news articles over the course of 19 months.
We find statistically significant increases in hateful commenting activity around real-world divisive events like the "Unite the Right" rally in Charlottesville.
We find that articles that attract a substantial number of hateful comments have different linguistic characteristics when compared to articles that do not attract hateful comments.
- Score: 13.289076063197466
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The Web has become the main source for news acquisition. At the same time,
news discussion has become more social: users can post comments on news
articles or discuss news articles on other platforms like Reddit. These
features empower and enable discussions among the users; however, they also act
as the medium for the dissemination of toxic discourse and hate speech. The
research community lacks a general understanding on what type of content
attracts hateful discourse and the possible effects of social networks on the
commenting activity on news articles. In this work, we perform a large-scale
quantitative analysis of 125M comments posted on 412K news articles over the
course of 19 months. We analyze the content of the collected articles and their
comments using temporal analysis, user-based analysis, and linguistic analysis,
to shed light on what elements attract hateful comments on news articles. We
also investigate commenting activity when an article is posted on either
4chan's Politically Incorrect board (/pol/) or six selected subreddits. We find
statistically significant increases in hateful commenting activity around
real-world divisive events like the "Unite the Right" rally in Charlottesville
and political events like the second and third 2016 US presidential debates.
Also, we find that articles that attract a substantial number of hateful
comments have different linguistic characteristics when compared to articles
that do not attract hateful comments. Furthermore, we observe that the post of
a news articles on either /pol/ or the six subreddits is correlated with an
increase of (hateful) commenting activity on the news articles.
Related papers
- Exploring the topics, sentiments and hate speech in the Spanish information environment [0.0]
This study examines the topics, sentiments, and hate prevalence in 337,807 response messages (website comments and tweets) to news from five Spanish media outlets.
Content is mainly negative (62.7%) and neutral (28.57%), with low positivity (8.73%)
Despite low levels of hate speech (3.98%), the study confirms high toxicity in online responses to social and political topics.
arXiv Detail & Related papers (2024-09-19T11:19:44Z) - Toxic Synergy Between Hate Speech and Fake News Exposure [2.190432422548697]
We study the correlation between exposure to news from low-credibility sources through following connections and the use of hate speech on Twitter.
We find that hate speakers are exposed to lower percentages of posts linking to credible news sources.
While hate speech is associated with low-credibility news from partisan sources, we find that those sources tend to skew to the political left for antisemitic content.
arXiv Detail & Related papers (2024-04-11T20:17:35Z) - A Study of Partisan News Sharing in the Russian invasion of Ukraine [31.211851388657152]
Since the Russian invasion of Ukraine, a large volume of biased and partisan news has been spread via social media platforms.
We aim to characterize the role of such sharing in influencing users' communications.
We build a predictive model to identify users likely to spread partisan news.
arXiv Detail & Related papers (2023-11-26T13:25:11Z) - Fake News and Hate Speech: Language in Common [73.4764550713355]
We compute a novel index, the ingroup vs outgroup index, in three different datasets and show that both phenomena share an "us vs them" narrative.
arXiv Detail & Related papers (2022-12-05T15:35:10Z) - "I Can't Keep It Up." A Dataset from the Defunct Voat.co News Aggregator [0.0]
Voat.co was a news aggregator website that shut down on December 25, 2020.
This paper presents a dataset with over 2.3M submissions and 16.2M comments posted from 113K users in 7.1K subverses.
arXiv Detail & Related papers (2022-01-15T23:25:53Z) - Comparing the Language of QAnon-related content on Parler, Gab, and
Twitter [68.8204255655161]
Parler, a "free speech" platform popular with conservatives, was taken offline in January 2021 due to the lack of moderation of hateful and QAnon- and other conspiracy-related content.
We compare posts with the hashtag #QAnon on Parler over a month-long period with posts on Twitter and Gab.
Gab has the highest proportion of #QAnon posts with hate terms, and Parler and Twitter are similar in this respect.
On all three platforms, posts mentioning female political figures, Democrats, or Donald Trump have more anti-social language than posts mentioning male politicians, Republicans, or
arXiv Detail & Related papers (2021-11-22T11:19:15Z) - News consumption and social media regulations policy [70.31753171707005]
We analyze two social media that enforced opposite moderation methods, Twitter and Gab, to assess the interplay between news consumption and content regulation.
Our results show that the presence of moderation pursued by Twitter produces a significant reduction of questionable content.
The lack of clear regulation on Gab results in the tendency of the user to engage with both types of content, showing a slight preference for the questionable ones which may account for a dissing/endorsement behavior.
arXiv Detail & Related papers (2021-06-07T19:26:32Z) - The Rise and Fall of Fake News sites: A Traffic Analysis [62.51737815926007]
We investigate the online presence of fake news websites and characterize their behavior in comparison to real news websites.
Based on our findings, we build a content-agnostic ML for automatic detection of fake news websites.
arXiv Detail & Related papers (2021-03-16T18:10:22Z) - A Multi-Platform Analysis of Political News Discussion and Sharing on
Web Communities [13.364612995946876]
We compile a list of 1,073 news websites and extract posts from four Web communities that contain URLs from these sources.
This yields a dataset of 38M posts containing 15M news URLs, spanning almost three years.
We study the data along several axes, assessing the trustworthiness of shared news, designing a method to group news articles into stories, analyzing these stories are discussed and measuring the influence various Web communities have in that.
arXiv Detail & Related papers (2021-03-05T12:27:28Z) - Racism is a Virus: Anti-Asian Hate and Counterspeech in Social Media
during the COVID-19 Crisis [51.39895377836919]
COVID-19 has sparked racism and hate on social media targeted towards Asian communities.
We study the evolution and spread of anti-Asian hate speech through the lens of Twitter.
We create COVID-HATE, the largest dataset of anti-Asian hate and counterspeech spanning 14 months.
arXiv Detail & Related papers (2020-05-25T21:58:09Z) - Echo Chambers on Social Media: A comparative analysis [64.2256216637683]
We introduce an operational definition of echo chambers and perform a massive comparative analysis on 1B pieces of contents produced by 1M users on four social media platforms.
We infer the leaning of users about controversial topics and reconstruct their interaction networks by analyzing different features.
We find support for the hypothesis that platforms implementing news feed algorithms like Facebook may elicit the emergence of echo-chambers.
arXiv Detail & Related papers (2020-04-20T20:00:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.