RiskSEA : A Scalable Graph Embedding for Detecting On-chain Fraudulent Activities on the Ethereum Blockchain
- URL: http://arxiv.org/abs/2410.02160v1
- Date: Thu, 3 Oct 2024 02:54:19 GMT
- Title: RiskSEA : A Scalable Graph Embedding for Detecting On-chain Fraudulent Activities on the Ethereum Blockchain
- Authors: Ayush Agarwal, Lv Lu, Arjun Maheswaran, Varsha Mahadevan, Bhaskar Krishnamachari,
- Abstract summary: RiskSEA is a scalable risk scoring system capable of handling the dynamic nature of large-scale blockchain transaction graphs.
We present two novel approaches for generating node2vec embeddings and effectively scaling it to the entire set of blockchain addresses.
Our experiments show that combining both behavioral and node2vec features boosts the classification performance significantly.
- Score: 7.954598640724204
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Like any other useful technology, cryptocurrencies are sometimes used for criminal activities. While transactions are recorded on the blockchain, there exists a need for a more rapid and scalable method to detect addresses associated with fraudulent activities. We present RiskSEA, a scalable risk scoring system capable of effectively handling the dynamic nature of large-scale blockchain transaction graphs. The risk scoring system, which we implement for Ethereum, consists of 1. a scalable approach to generating node2vec embedding for entire set of addresses to capture the graph topology 2. transaction-based features to capture the transactional behavioral pattern of an address 3. a classifier model to generate risk score for addresses that combines the node2vec embedding and behavioral features. Efficiently generating node2vec embedding for large scale and dynamically evolving blockchain transaction graphs is challenging, we present two novel approaches for generating node2vec embeddings and effectively scaling it to the entire set of blockchain addresses: 1. node2vec embedding propagation and 2. dynamic node2vec embedding. We present a comprehensive analysis of the proposed approaches. Our experiments show that combining both behavioral and node2vec features boosts the classification performance significantly, and that the dynamic node2vec embeddings perform better than the node2vec propagated embeddings.
Related papers
- Cluster-wise Graph Transformer with Dual-granularity Kernelized Attention [27.29964380651613]
We envision the graph as a network of interconnected node sets without compressing each cluster into a single embedding.
To enable effective information transfer among these node sets, we propose the Node-to-Cluster Attention (N2C-Attn) mechanism.
We show how N2C-Attn combines bi-level feature maps of queries and keys, demonstrating its capability to merge dual-granularity information.
arXiv Detail & Related papers (2024-10-09T10:30:01Z) - Data Availability and Decentralization: New Techniques for zk-Rollups in Layer 2 Blockchain Networks [14.27943855519429]
This paper introduces new techniques to address the data availability and decentralization challenges in Layer 2 networks.
We introduce the concept of proof of download'', which ensures that Layer 2 nodes cannot aggregate transactions without downloading historical data.
For decentralization, we introduce a new role separation for Layer 2, allowing nodes with limited hardware to participate.
arXiv Detail & Related papers (2024-03-16T06:34:51Z) - EntailE: Introducing Textual Entailment in Commonsense Knowledge Graph
Completion [54.12709176438264]
Commonsense knowledge graphs (CSKGs) utilize free-form text to represent named entities, short phrases, and events as their nodes.
Current methods leverage semantic similarities to increase the graph density, but the semantic plausibility of the nodes and their relations are under-explored.
We propose to adopt textual entailment to find implicit entailment relations between CSKG nodes, to effectively densify the subgraph connecting nodes within the same conceptual class.
arXiv Detail & Related papers (2024-02-15T02:27:23Z) - NodeFormer: A Scalable Graph Structure Learning Transformer for Node
Classification [70.51126383984555]
We introduce a novel all-pair message passing scheme for efficiently propagating node signals between arbitrary nodes.
The efficient computation is enabled by a kernerlized Gumbel-Softmax operator.
Experiments demonstrate the promising efficacy of the method in various tasks including node classification on graphs.
arXiv Detail & Related papers (2023-06-14T09:21:15Z) - Highly Available Blockchain Nodes With N-Version Design [13.131269677617286]
This paper introduces the concept of N-version blockchain nodes.
N-version nodes rely on simultaneous execution of different implementations of the same blockchain protocol.
We show that N-ETH is able to mitigate the effects of unstable execution environments and significantly enhance availability under environment faults.
arXiv Detail & Related papers (2023-03-25T11:16:17Z) - Graph-adaptive Rectified Linear Unit for Graph Neural Networks [64.92221119723048]
Graph Neural Networks (GNNs) have achieved remarkable success by extending traditional convolution to learning on non-Euclidean data.
We propose Graph-adaptive Rectified Linear Unit (GReLU) which is a new parametric activation function incorporating the neighborhood information in a novel and efficient way.
We conduct comprehensive experiments to show that our plug-and-play GReLU method is efficient and effective given different GNN backbones and various downstream tasks.
arXiv Detail & Related papers (2022-02-13T10:54:59Z) - Self-supervised Incremental Deep Graph Learning for Ethereum Phishing
Scam Detection [15.350215512903361]
Graph neural network (GNN) has shown promising performance in various node classification tasks.
For transaction data, which could be naturally abstracted to a real-world complex graph, the scarcity of labels and the huge volume of transaction data make it difficult to take advantage of GNN methods.
We propose a Self-supervised Incremental Graph learning model (SIEGE) for the phishing scam detection problem.
arXiv Detail & Related papers (2021-06-18T15:06:26Z) - Unveiling Anomalous Edges and Nominal Connectivity of Attributed
Networks [53.56901624204265]
The present work deals with uncovering anomalous edges in attributed graphs using two distinct formulations with complementary strengths.
The first relies on decomposing the graph data matrix into low rank plus sparse components to improve markedly performance.
The second broadens the scope of the first by performing robust recovery of the unperturbed graph, which enhances the anomaly identification performance.
arXiv Detail & Related papers (2021-04-17T20:00:40Z) - Identity Inference on Blockchain using Graph Neural Network [5.5927440285709835]
Identity inference, which aims to make a preliminary inference about account identity, plays a significant role in blockchain security.
We present a novel approach to analyze user's behavior from the perspective of the transaction subgraph, which naturally transforms the identity inference task into a graph classification pattern.
We also propose a generic end-to-end graph neural network model, named $textI2 textBGNN$, which can accept subgraph as input and learn a function mapping the transaction subgraph pattern to account identity.
arXiv Detail & Related papers (2021-04-14T00:15:38Z) - Node2Seq: Towards Trainable Convolutions in Graph Neural Networks [59.378148590027735]
We propose a graph network layer, known as Node2Seq, to learn node embeddings with explicitly trainable weights for different neighboring nodes.
For a target node, our method sorts its neighboring nodes via attention mechanism and then employs 1D convolutional neural networks (CNNs) to enable explicit weights for information aggregation.
In addition, we propose to incorporate non-local information for feature learning in an adaptive manner based on the attention scores.
arXiv Detail & Related papers (2021-01-06T03:05:37Z) - Self-supervised Graph Learning for Recommendation [69.98671289138694]
We explore self-supervised learning on user-item graph for recommendation.
An auxiliary self-supervised task reinforces node representation learning via self-discrimination.
Empirical studies on three benchmark datasets demonstrate the effectiveness of SGL.
arXiv Detail & Related papers (2020-10-21T06:35:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.