Knowledge Sharing via Domain Adaptation in Customs Fraud Detection
- URL: http://arxiv.org/abs/2201.06759v1
- Date: Tue, 18 Jan 2022 06:17:03 GMT
- Title: Knowledge Sharing via Domain Adaptation in Customs Fraud Detection
- Authors: Sungwon Park and Sundong Kim and Meeyoung Cha
- Abstract summary: This paper proposes DAS, a memory bank platform to facilitate knowledge sharing across multi-national customs administrations.
Data encompassing over 8 million import declarations have been used to test the feasibility of this new system.
- Score: 14.933341652591224
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Knowledge of the changing traffic is critical in risk management. Customs
offices worldwide have traditionally relied on local resources to accumulate
knowledge and detect tax fraud. This naturally poses countries with weak
infrastructure to become tax havens of potentially illicit trades. The current
paper proposes DAS, a memory bank platform to facilitate knowledge sharing
across multi-national customs administrations to support each other. We propose
a domain adaptation method to share transferable knowledge of frauds as
prototypes while safeguarding the local trade information. Data encompassing
over 8 million import declarations have been used to test the feasibility of
this new system, which shows that participating countries may benefit up to
2-11 times in fraud detection with the help of shared knowledge. We discuss
implications for substantial tax revenue potential and strengthened policy
against illicit trades.
Related papers
- Efficient and Deployable Knowledge Infusion for Open-World Recommendations via Large Language Models [53.547190001324665]
We propose REKI to acquire two types of external knowledge about users and items from large language models (LLMs)
We develop individual knowledge extraction and collective knowledge extraction tailored for different scales of scenarios, effectively reducing offline resource consumption.
Experiments demonstrate that REKI outperforms state-of-the-art baselines and is compatible with lots of recommendation algorithms and tasks.
arXiv Detail & Related papers (2024-08-20T03:45:24Z) - How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models [95.44559524735308]
Large language or multimodal model based verification has been proposed to scale up online policing mechanisms for mitigating spread of false and harmful content.
We test the limits of improving foundation model performance without continual updating through an initial study of knowledge transfer.
Our results on two recent multi-modal fact-checking benchmarks, Mocheg and Fakeddit, indicate that knowledge transfer strategies can improve Fakeddit performance over the state-of-the-art by up to 1.7% and Mocheg performance by up to 2.9%.
arXiv Detail & Related papers (2024-06-29T08:39:07Z) - Creating Geospatial Trajectories from Human Trafficking Text Corpora [0.0]
We propose a Narrative to Trajectory (N2T) information extraction system.
N2T analyzes reported narratives, extracts relevant information through the use of Natural Language Processing (NLP) techniques, and applies geospatial augmentation.
We evaluate N2T on human trafficking text corpora and demonstrate that our approach of utilizing data preprocessing and augmenting database techniques with NLP libraries outperforms existing geolocation detection methods.
arXiv Detail & Related papers (2024-05-09T22:24:09Z) - Language Models Can Reduce Asymmetry in Information Markets [100.38786498942702]
We introduce an open-source simulated digital marketplace where intelligent agents, powered by language models, buy and sell information on behalf of external participants.
The central mechanism enabling this marketplace is the agents' dual capabilities: they have the capacity to assess the quality of privileged information but also come equipped with the ability to forget.
To perform well, agents must make rational decisions, strategically explore the marketplace through generated sub-queries, and synthesize answers from purchased information.
arXiv Detail & Related papers (2024-03-21T14:48:37Z) - InfuserKI: Enhancing Large Language Models with Knowledge Graphs via
Infuser-Guided Knowledge Integration [61.554209059971576]
Large Language Models (LLMs) have shown remarkable open-generation capabilities across diverse domains.
Injecting new knowledge poses the risk of forgetting previously acquired knowledge.
We propose a novel Infuser-Guided Knowledge Integration framework.
arXiv Detail & Related papers (2024-02-18T03:36:26Z) - Private Knowledge Sharing in Distributed Learning: A Survey [50.51431815732716]
The rise of Artificial Intelligence has revolutionized numerous industries and transformed the way society operates.
It is crucial to utilize information in learning processes that are either distributed or owned by different entities.
Modern data-driven services have been developed to integrate distributed knowledge entities into their outcomes.
arXiv Detail & Related papers (2024-02-08T07:18:23Z) - GraphFC: Customs Fraud Detection with Label Scarcity [21.7060251265426]
With limited manpower, the custom offices can only undertake manual inspection of a limited number of declarations.
Current approaches for customs fraud detection are not well suited and designed for this real-world setting.
In this work, we propose a model-agnostic, domain-specific, semi-supervised graph neural network based customs fraud detection algorithm.
arXiv Detail & Related papers (2023-05-19T01:47:12Z) - Representation Learning on Graphs to Identifying Circular Trading in
Goods and Services Tax [1.5608023535768845]
Circular trading is a form of tax evasion where a group of fraudulent taxpayers (traders) aims to mask illegal transactions.
This work uses big data analytics and graph representation learning techniques to propose a framework to identify communities of circular traders.
arXiv Detail & Related papers (2022-08-16T10:46:21Z) - Customs Fraud Detection in the Presence of Concept Drift [2.257416403770908]
ADAPT is an adaptive selection method that controls the balance between exploitation and exploration strategies.
We find the system with ADAPT can gradually adapt to the dataset and find the appropriate amount of exploration ratio with high performance.
arXiv Detail & Related papers (2021-09-29T02:52:19Z) - Characterization of the Firm-Firm Public Procurement Co-Bidding Network
from the State of Cear\'a (Brazil) Municipalities [58.720142291102135]
We study the co-biding relationships between firms that participate in public tenders issued by the $184$ municipalities of the State of Cear'a (Brazil) between 2015 and 2019.
We identify $22$ groups/communities of firms with similar patterns of procurement activity, defined by their geographic and activity.
arXiv Detail & Related papers (2021-04-17T13:58:30Z) - Active Learning for Human-in-the-Loop Customs Inspection [12.66621970520437]
We study the human-in-the-loop customs inspection scenario, where an AI-assisted algorithm supports customs officers by recommending a set of imported goods to be inspected.
We show that a hybrid strategy of selecting likely fraudulent and uncertain items will eventually outperform the exploitation-only strategy.
arXiv Detail & Related papers (2020-10-27T13:31:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.