I call BS: Fraud Detection in Crowdfunding Campaigns
- URL: http://arxiv.org/abs/2006.16849v1
- Date: Tue, 30 Jun 2020 14:38:21 GMT
- Title: I call BS: Fraud Detection in Crowdfunding Campaigns
- Authors: Beatrice Perez, Sara R. Machado, Jerone T. A. Andrews, Nicolas
Kourtellis
- Abstract summary: Donations to charity-based crowdfunding environments have been on the rise in the last few years.
We analyze data collected from different crowdfunding platforms, and annotate 700 campaigns as fraud or not.
It is possible to automatically classify such fraudulent behavior with up to 90.14% accuracy and 96.01% AUC.
- Score: 3.785123406103386
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Donations to charity-based crowdfunding environments have been on the rise in
the last few years. Unsurprisingly, deception and fraud in such platforms have
also increased, but have not been thoroughly studied to understand what
characteristics can expose such behavior and allow its automatic detection and
blocking. Indeed, crowdfunding platforms are the only ones typically performing
oversight for the campaigns launched in each service. However, they are not
properly incentivized to combat fraud among users and the campaigns they
launch: on the one hand, a platform's revenue is directly proportional to the
number of transactions performed (since the platform charges a fixed amount per
donation); on the other hand, if a platform is transparent with respect to how
much fraud it has, it may discourage potential donors from participating.
In this paper, we take the first step in studying fraud in crowdfunding
campaigns. We analyze data collected from different crowdfunding platforms, and
annotate 700 campaigns as fraud or not. We compute various textual and
image-based features and study their distributions and how they associate with
campaign fraud. Using these attributes, we build machine learning classifiers,
and show that it is possible to automatically classify such fraudulent behavior
with up to 90.14% accuracy and 96.01% AUC, only using features available from
the campaign's description at the moment of publication (i.e., with no user or
money activity), making our method applicable for real-time operation on a user
browser.
Related papers
- On the Use of Proxies in Political Ad Targeting [49.61009579554272]
We show that major political advertisers circumvented mitigations by targeting proxy attributes.
Our findings have crucial implications for the ongoing discussion on the regulation of political advertising.
arXiv Detail & Related papers (2024-10-18T17:15:13Z) - Unraveling the Web of Disinformation: Exploring the Larger Context of State-Sponsored Influence Campaigns on Twitter [16.64763746842362]
We study 19 state-sponsored disinformation campaigns that took place on Twitter, originating from various countries.
We build a machine learning-based classifier that can correctly identify up to 94% of accounts from unseen campaigns.
We also run our system in the wild and find more accounts that could potentially belong to state-backed operations.
arXiv Detail & Related papers (2024-07-25T15:03:33Z) - Using Artificial Intelligence to Unlock Crowdfunding Success for Small Businesses [8.226509113718125]
We utilize the latest advancements in AI technology to identify crucial factors that influence the success of crowdfunding campaigns.
Our best-performing machine learning model accurately predicts the fundraising outcomes of 81.0% of campaigns.
We demonstrate that by augmenting just three aspects of the narrative using a large language model, a campaign becomes more preferable to 83% human evaluators.
arXiv Detail & Related papers (2024-04-24T20:53:10Z) - A Latent Dirichlet Allocation (LDA) Semantic Text Analytics Approach to
Explore Topical Features in Charity Crowdfunding Campaigns [0.6298586521165193]
This study introduces an inventive text analytics framework, utilizing Latent Dirichlet Allocation (LDA) to extract latent themes from textual descriptions of charity campaigns.
The study has explored four different themes, two each in campaign and incentive descriptions.
The study was successful in using Random Forest to predict success of the campaign using both thematic and numerical parameters.
arXiv Detail & Related papers (2024-01-03T09:17:46Z) - Cybercrime Bitcoin Revenue Estimations: Quantifying the Impact of Methodology and Coverage [5.732759656069282]
We perform the first systematic analysis on the estimation of cybercrime bitcoin revenue.
In contrast to what is widely believed, we show that the revenue is not always underestimated.
We quantify, for the first time, the impact of the (lack of) coverage on the estimation.
arXiv Detail & Related papers (2023-09-07T09:35:23Z) - Discrimination through Image Selection by Job Advertisers on Facebook [79.21648699199648]
We propose and investigate the prevalence of a new means for discrimination in job advertising.
It combines both targeting and delivery -- through the disproportionate representation or exclusion of people of certain demographics in job ad images.
We use the Facebook Ad Library to demonstrate the prevalence of this practice.
arXiv Detail & Related papers (2023-06-13T03:43:58Z) - Identification of Twitter Bots based on an Explainable ML Framework: the
US 2020 Elections Case Study [72.61531092316092]
This paper focuses on the design of a novel system for identifying Twitter bots based on labeled Twitter data.
Supervised machine learning (ML) framework is adopted using an Extreme Gradient Boosting (XGBoost) algorithm.
Our study also deploys Shapley Additive Explanations (SHAP) for explaining the ML model predictions.
arXiv Detail & Related papers (2021-12-08T14:12:24Z) - How COVID-19 Have Changed Crowdfunding: Evidence From GoFundMe [77.34726150561087]
This study uses a unique data set of all the campaigns published over the past two years on GoFundMe.
We study a corpus of crowdfunded projects, analyzing cover images and other variables commonly present on crowdfunding sites.
arXiv Detail & Related papers (2021-06-18T08:03:58Z) - Keystroke Biometrics in Response to Fake News Propagation in a Global
Pandemic [77.79066811371978]
This work proposes and analyzes the use of keystroke biometrics for content de-anonymization.
Fake news have become a powerful tool to manipulate public opinion, especially during major events.
arXiv Detail & Related papers (2020-05-15T17:56:11Z) - Adversarial Attacks on Linear Contextual Bandits [87.08004581867537]
Malicious agents may have incentives to attack the bandit algorithm to induce it to perform a desired behavior.
We show that a malicious agent can force a linear contextual bandit algorithm to pull any desired arm $T - o(T)$ times over a horizon of $T$ steps.
We also investigate the case when a malicious agent is interested in affecting the behavior of the bandit algorithm in a single context.
arXiv Detail & Related papers (2020-02-10T15:04:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.