Fake Advertisements Detection Using Automated Multimodal Learning: A Case Study for Vietnamese Real Estate Data
- URL: http://arxiv.org/abs/2501.10848v1
- Date: Sat, 18 Jan 2025 18:48:06 GMT
- Title: Fake Advertisements Detection Using Automated Multimodal Learning: A Case Study for Vietnamese Real Estate Data
- Authors: Duy Nguyen, Trung T. Nguyen, Cuong V. Nguyen,
- Abstract summary: FADAML is a novel end-to-end machine learning system to detect and filter out fake online advertisements.
Our system combines techniques in multimodal machine learning and automated machine learning to achieve a high detection rate.
- Score: 4.506099292980221
- License:
- Abstract: The popularity of e-commerce has given rise to fake advertisements that can expose users to financial and data risks while damaging the reputation of these e-commerce platforms. For these reasons, detecting and removing such fake advertisements are important for the success of e-commerce websites. In this paper, we propose FADAML, a novel end-to-end machine learning system to detect and filter out fake online advertisements. Our system combines techniques in multimodal machine learning and automated machine learning to achieve a high detection rate. As a case study, we apply FADAML to detect fake advertisements on popular Vietnamese real estate websites. Our experiments show that we can achieve 91.5% detection accuracy, which significantly outperforms three different state-of-the-art fake news detection systems.
Related papers
- Proactive Detection and Calibration of Seasonal Advertisements with Multimodal Large Language Models [5.425511887990726]
We present a research problem that is of interest for the ads ranking and recommendation community.
Our paper provides detailed guidelines from various angles of this problem tested in, and motivated by a large-scale industrial ads ranking system.
We present a conclusive solution we took during this research exploration: to detect seasonality, we leveraged Multimodal LLMs.
arXiv Detail & Related papers (2024-10-16T18:14:05Z) - FNDaaS: Content-agnostic Detection of Fake News sites [3.936965297430477]
We propose FND, the first automatic, content-agnostic fake news detection method.
It considers new and unstudied features such as network and structural characteristics per news website.
It can achieve an AUC score of up to 0.967 on past sites, and up to 77-92% accuracy on newly-flagged ones.
arXiv Detail & Related papers (2022-12-13T11:17:32Z) - Multiverse: Multilingual Evidence for Fake News Detection [71.51905606492376]
Multiverse is a new feature based on multilingual evidence that can be used for fake news detection.
The hypothesis of the usage of cross-lingual evidence as a feature for fake news detection is confirmed.
arXiv Detail & Related papers (2022-11-25T18:24:17Z) - UrduFake@FIRE2020: Shared Track on Fake News Identification in Urdu [62.6928395368204]
This paper gives the overview of the first shared task at FIRE 2020 on fake news detection in the Urdu language.
The goal is to identify fake news using a dataset composed of 900 annotated news articles for training and 400 news articles for testing.
The dataset contains news in five domains: (i) Health, (ii) Sports, (iii) Showbiz, (iv) Technology, and (v) Business.
arXiv Detail & Related papers (2022-07-25T03:46:51Z) - Faking Fake News for Real Fake News Detection: Propaganda-loaded
Training Data Generation [105.20743048379387]
We propose a novel framework for generating training examples informed by the known styles and strategies of human-authored propaganda.
Specifically, we perform self-critical sequence training guided by natural language inference to ensure the validity of the generated articles.
Our experimental results show that fake news detectors trained on PropaNews are better at detecting human-written disinformation by 3.62 - 7.69% F1 score on two public datasets.
arXiv Detail & Related papers (2022-03-10T14:24:19Z) - Applying Multi-armed Bandit Algorithms to Computational Advertising [0.0]
We study the performance of various online learning algorithms to identify and display the best ads/offers with the highest conversion rates to web users.
We formulate our ad-selection problem as a Multi-Armed Bandit problem which is a classical paradigm in Machine Learning.
This article highlights some of our findings in the area of computational advertising from 2011 to 2015.
arXiv Detail & Related papers (2020-11-22T03:23:13Z) - A novel auction system for selecting advertisements in Real-Time bidding [68.8204255655161]
Real-Time Bidding is a new Internet advertising system that has become very popular in recent years.
We propose an alternative betting system with a new approach that not only considers the economic aspect but also other relevant factors for the functioning of the advertising system.
arXiv Detail & Related papers (2020-10-22T18:36:41Z) - Predicting conversions in display advertising based on URL embeddings [16.63178490961762]
We introduce and examine different models for estimating the probability of a user converting, given their history of visited URLs.
Inspired by natural language processing, we introduce three URL embedding models to compute semantically meaningful URL representations.
arXiv Detail & Related papers (2020-08-27T09:14:28Z) - A Deep Learning Approach for Automatic Detection of Fake News [47.00462375817434]
We propose two models based on deep learning for solving fake news detection problem in online news contents of multiple domains.
We evaluate our techniques on the two recently released datasets, namely FakeNews AMT and Celebrity for fake news detection.
arXiv Detail & Related papers (2020-05-11T09:07:46Z) - SpotTheFake: An Initial Report on a New CNN-Enhanced Platform for
Counterfeit Goods Detection [68.8204255655161]
The counterfeit goods trade represents nowadays more than 3.3% of the whole world trade.
This paper presents the design and early stage development of a novel counterfeit goods detection platform.
arXiv Detail & Related papers (2020-02-17T01:51:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.