Merchant Category Identification Using Credit Card Transactions
- URL: http://arxiv.org/abs/2011.02602v1
- Date: Thu, 5 Nov 2020 01:21:30 GMT
- Title: Merchant Category Identification Using Credit Card Transactions
- Authors: Chin-Chia Michael Yeh, Zhongfang Zhuang, Yan Zheng, Liang Wang,
Junpeng Wang, Wei Zhang
- Abstract summary: We design two encoders, where one is responsible for encoding temporal information and the other is responsible for affinity information.
Experiments on real-world credit card transaction data between 71,668 merchants and 433,772,755 customers have demonstrated the effectiveness and efficiency of the proposed model.
- Score: 20.15215476646073
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Digital payment volume has proliferated in recent years with the rapid growth
of small businesses and online shops. When processing these digital
transactions, recognizing each merchant's real identity (i.e., business type)
is vital to ensure the integrity of payment processing systems. Conventionally,
this problem is formulated as a time series classification problem solely using
the merchant transaction history. However, with the large scale of the data,
and changing behaviors of merchants and consumers over time, it is extremely
challenging to achieve satisfying performance from off-the-shelf classification
methods. In this work, we approach this problem from a multi-modal learning
perspective, where we use not only the merchant time series data but also the
information of merchant-merchant relationship (i.e., affinity) to verify the
self-reported business type (i.e., merchant category) of a given merchant.
Specifically, we design two individual encoders, where one is responsible for
encoding temporal information and the other is responsible for affinity
information, and a mechanism to fuse the outputs of the two encoders to
accomplish the identification task. Our experiments on real-world credit card
transaction data between 71,668 merchants and 433,772,755 customers have
demonstrated the effectiveness and efficiency of the proposed model.
Related papers
- Credit Card Fraud Detection Using Enhanced Random Forest Classifier for
Imbalanced Data [0.8223798883838329]
This paper implements the random forest (RF) algorithm to solve the issue in the hand.
A dataset of credit card transactions was used in this study.
arXiv Detail & Related papers (2023-03-11T22:59:37Z) - Challenges and Complexities in Machine Learning based Credit Card Fraud
Detection [0.0]
Volume of transactions, uniqueness of frauds and ingenuity of the fraudster are main challenges in detecting frauds.
The advent of machine learning, artificial intelligence and big data has opened up new tools in the fight against frauds.
However, the developments in fraud detection algorithms has been challenging and slow due to the massively unbalanced nature of fraud data.
arXiv Detail & Related papers (2022-08-20T07:53:51Z) - Evaluating resampling methods on a real-life highly imbalanced online
credit card payments dataset [0.0]
This paper evaluates numerous state-of-the-art resampling methods on a large real-life online credit card payments dataset.
We show they are inefficient because methods are intractable or because metrics do not exhibit substantial improvements.
arXiv Detail & Related papers (2022-06-27T09:57:08Z) - Macroscopic properties of buyer-seller networks in online marketplaces [55.41644538483948]
We analyze two datasets containing 245M transactions that took place on online marketplaces between 2010 and 2021.
We show that transactions in online marketplaces exhibit strikingly similar patterns despite significant differences in language, lifetimes, products, regulation, and technology.
arXiv Detail & Related papers (2021-12-16T18:00:47Z) - Characterization of Frequent Online Shoppers using Statistical Learning
with Sparsity [54.26540039514418]
This work reports a method to learn the shopping preferences of frequent shoppers to an online gift store by combining ideas from retail analytics and statistical learning with sparsity.
arXiv Detail & Related papers (2021-11-11T05:36:39Z) - Feature-Level Fusion of Super-App and Telecommunication Alternative Data
Sources for Credit Card Fraud Detection [106.33204064461802]
We review the effectiveness of a feature-level fusion of super-app customer information, mobile phone line data, and traditional credit risk variables for the early detection of identity theft credit card fraud.
We evaluate our approach over approximately 90,000 users from a credit lender's digital platform database.
arXiv Detail & Related papers (2021-11-05T19:10:35Z) - E-Commerce Dispute Resolution Prediction [69.84319333335935]
We take a first step towards automatically assisting human agents in dispute resolution at scale.
We construct a large dataset of disputes from the eBay online marketplace, and identify several interesting behavioral and linguistic patterns.
We then train classifiers to predict dispute outcomes with high accuracy.
arXiv Detail & Related papers (2021-10-13T09:45:06Z) - Supporting Financial Inclusion with Graph Machine Learning and Super-App
Alternative Data [63.942632088208505]
Super-Apps have changed the way we think about the interactions between users and commerce.
This paper investigates how different interactions between users within a Super-App provide a new source of information to predict borrower behavior.
arXiv Detail & Related papers (2021-02-19T15:13:06Z) - Credit card fraud detection using machine learning: A survey [0.5134435281973136]
We study data-driven credit card fraud detection particularities and several machine learning methods to address each of its intricate challenges.
In particular, we first characterize a typical credit card detection task: the dataset and its attributes, the metric choice along with some methods to handle such unbalanced datasets.
arXiv Detail & Related papers (2020-10-13T15:35:32Z) - Automatic Validation of Textual Attribute Values in E-commerce Catalog
by Learning with Limited Labeled Data [61.789797281676606]
We propose a novel meta-learning latent variable approach, called MetaBridge.
It can learn transferable knowledge from a subset of categories with limited labeled data.
It can capture the uncertainty of never-seen categories with unlabeled data.
arXiv Detail & Related papers (2020-06-15T21:31:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.