Multilingual Disinformation Detection for Digital Advertising
- URL: http://arxiv.org/abs/2207.10649v1
- Date: Mon, 4 Jul 2022 10:29:20 GMT
- Title: Multilingual Disinformation Detection for Digital Advertising
- Authors: Zofia Trstanova, Nadir El Manouzi, Maryline Chen, Andre L. V. da
Cunha, Sergei Ivanov
- Abstract summary: We make the first step towards quickly detecting and red-flaging websites that potentially manipulate the public with disinformation.
We build a machine learning model based on multilingual text embeddings that first determines whether the page mentions a topic of interest, then estimates the likelihood of the content being malicious.
Our system empowers internal teams to proactively blacklist unsafe content, thus protecting the reputation of the advertisement provider.
- Score: 0.9684919127633844
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In today's world, the presence of online disinformation and propaganda is
more widespread than ever. Independent publishers are funded mostly via digital
advertising, which is unfortunately also the case for those publishing
disinformation content. The question of how to remove such publishers from
advertising inventory has long been ignored, despite the negative impact on the
open internet. In this work, we make the first step towards quickly detecting
and red-flagging websites that potentially manipulate the public with
disinformation. We build a machine learning model based on multilingual text
embeddings that first determines whether the page mentions a topic of interest,
then estimates the likelihood of the content being malicious, creating a
shortlist of publishers that will be reviewed by human experts. Our system
empowers internal teams to proactively, rather than defensively, blacklist
unsafe content, thus protecting the reputation of the advertisement provider.
Related papers
- Before & After: The Effect of EU's 2022 Code of Practice on Disinformation [2.1456348289599134]
We show that ad networks have withdrawn mostly from unpopular misinformation websites with very few visitors.
We show that ad networks continue to place advertisements of legitimate companies next to misinformation content.
In fact, major ad networks place ads in almost 400 misinformation websites of our dataset.
arXiv Detail & Related papers (2024-10-15T07:48:43Z) - PropaInsight: Toward Deeper Understanding of Propaganda in Terms of Techniques, Appeals, and Intent [71.20471076045916]
Propaganda plays a critical role in shaping public opinion and fueling disinformation.
Propainsight systematically dissects propaganda into techniques, arousal appeals, and underlying intent.
Propagaze combines human-annotated data with high-quality synthetic data.
arXiv Detail & Related papers (2024-09-19T06:28:18Z) - Long-Term Ad Memorability: Understanding & Generating Memorable Ads [54.23854539909078]
There has been no large-scale study on the memorability of ads.
We release the first memorability dataset, LAMBDA, consisting of 1749 participants and 2205 ads covering 276 brands.
arXiv Detail & Related papers (2023-09-01T10:27:04Z) - Discrimination through Image Selection by Job Advertisers on Facebook [79.21648699199648]
We propose and investigate the prevalence of a new means for discrimination in job advertising.
It combines both targeting and delivery -- through the disproportionate representation or exclusion of people of certain demographics in job ad images.
We use the Facebook Ad Library to demonstrate the prevalence of this practice.
arXiv Detail & Related papers (2023-06-13T03:43:58Z) - Problematic Advertising and its Disparate Exposure on Facebook [15.667983888666312]
We study Facebook and investigate key gaps in our understanding of problematic online advertising.
We find that older people and minority groups are especially likely to be shown such ads.
Given that 22% of problematic ads had no specific targeting from advertisers, we infer that ad delivery algorithms played a significant role in the biased distribution of these ads.
arXiv Detail & Related papers (2023-06-09T17:23:59Z) - Unveiling the Hidden Agenda: Biases in News Reporting and Consumption [59.55900146668931]
We build a six-year dataset on the Italian vaccine debate and adopt a Bayesian latent space model to identify narrative and selection biases.
We found a nonlinear relationship between biases and engagement, with higher engagement for extreme positions.
Analysis of news consumption on Twitter reveals common audiences among news outlets with similar ideological positions.
arXiv Detail & Related papers (2023-01-14T18:58:42Z) - Deep Breath: A Machine Learning Browser Extension to Tackle Online
Misinformation [0.0]
This paper proposes a novel system for detecting, processing, and warning users about misleading content online.
By training a machine learning model on an existing dataset of 32,000 clickbait news article headlines, the model predicts how sensationalist a headline is.
It interfaces with a web browser extension which constructs a unique content warning notification based on existing design principles.
arXiv Detail & Related papers (2023-01-09T12:43:58Z) - AI-Driven Contextual Advertising: A Technology Report and Implication
Analysis [0.0]
Programmatic advertising consists in automated auctioning of digital ad space.
The interest in contextual advertising is in part a counterreaction to the current dependency on personal data.
Developments in Artificial Intelligence (AI) allow for a deeper semantic understanding of context.
arXiv Detail & Related papers (2022-05-02T13:44:58Z) - Online Advertising Revenue Forecasting: An Interpretable Deep Learning
Approach [0.0]
We propose a novel attention-based architecture to predict publishers' advertising revenues.
Our results outperform several benchmark deep-learning time-series forecast models over multiple time horizons.
arXiv Detail & Related papers (2021-11-16T23:55:02Z) - Cross-Domain Learning for Classifying Propaganda in Online Contents [67.10699378370752]
We present an approach to leverage cross-domain learning, based on labeled documents and sentences from news and tweets, as well as political speeches with a clear difference in their degrees of being propagandistic.
Our experiments demonstrate the usefulness of this approach, and identify difficulties and limitations in various configurations of sources and targets for the transfer step.
arXiv Detail & Related papers (2020-11-13T10:19:13Z) - Political audience diversity and news reliability in algorithmic ranking [54.23273310155137]
We propose using the political diversity of a website's audience as a quality signal.
Using news source reliability ratings from domain experts and web browsing data from a diverse sample of 6,890 U.S. citizens, we first show that websites with more extreme and less politically diverse audiences have lower journalistic standards.
arXiv Detail & Related papers (2020-07-16T02:13:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.