RDF-star2Vec: RDF-star Graph Embeddings for Data Mining
- URL: http://arxiv.org/abs/2312.15626v1
- Date: Mon, 25 Dec 2023 06:32:14 GMT
- Title: RDF-star2Vec: RDF-star Graph Embeddings for Data Mining
- Authors: Shusaku Egami, Takanori Ugai, Masateru Oota, Kyoumoto Matsushita,
Takahiro Kawamura, Kouji Kozaki, Ken Fukuda
- Abstract summary: This study introduces RDF-star2Vec, a novel Knowledge Graph embedding model for RDF-star graphs.
We provide a dataset and a benchmarking framework for data mining tasks focused on complex RDF-star graphs.
- Score: 1.6492989697868894
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Knowledge Graphs (KGs) such as Resource Description Framework (RDF) data
represent relationships between various entities through the structure of
triples (<subject, predicate, object>). Knowledge graph embedding (KGE) is
crucial in machine learning applications, specifically in node classification
and link prediction tasks. KGE remains a vital research topic within the
semantic web community. RDF-star introduces the concept of a quoted triple
(QT), a specific form of triple employed either as the subject or object within
another triple. Moreover, RDF-star permits a QT to act as compositional
entities within another QT, thereby enabling the representation of recursive,
hyper-relational KGs with nested structures. However, existing KGE models fail
to adequately learn the semantics of QTs and entities, primarily because they
do not account for RDF-star graphs containing multi-leveled nested QTs and
QT-QT relationships. This study introduces RDF-star2Vec, a novel KGE model
specifically designed for RDF-star graphs. RDF-star2Vec introduces graph walk
techniques that enable probabilistic transitions between a QT and its
compositional entities. Feature vectors for QTs, entities, and relations are
derived from generated sequences through the structured skip-gram model.
Additionally, we provide a dataset and a benchmarking framework for data mining
tasks focused on complex RDF-star graphs. Evaluative experiments demonstrated
that RDF-star2Vec yielded superior performance compared to recent extensions of
RDF2Vec in various tasks including classification, clustering, entity
relatedness, and QT similarity.
Related papers
- A Query-Aware Multi-Path Knowledge Graph Fusion Approach for Enhancing Retrieval-Augmented Generation in Large Language Models [3.0748861313823]
QMKGF is a Query-Aware Multi-Path Knowledge Graph Fusion Approach for Enhancing Retrieval Augmented Generation.<n>We design prompt templates and employ general-purpose LLMs to extract entities and relations.<n>We introduce a multi-path subgraph construction strategy that incorporates one-hop relations, multi-hop relations, and importance-based relations.
arXiv Detail & Related papers (2025-07-07T02:22:54Z) - Relational Deep Learning: Challenges, Foundations and Next-Generation Architectures [50.46688111973999]
Graph machine learning has led to a significant increase in the capabilities of models that learn on arbitrary graph-structured data.<n>We present a new blueprint that enables end-to-end representation of'relational entity graphs' without traditional engineering feature.<n>We discuss key challenges including large-scale multi-table integration and the complexities of modeling temporal dynamics and heterogeneous data.
arXiv Detail & Related papers (2025-06-19T23:51:38Z) - Hierarchical Lexical Graph for Enhanced Multi-Hop Retrieval [22.33550491040999]
RAG grounds large language models in external evidence, yet it still falters when answers must be pieced together across semantically distant documents.<n>We build two plug-and-play retrievers: StatementGraphRAG and TopicGraphRAG.<n>Our methods outperform naive chunk-based RAG achieving an average relative improvement of 23.1% in retrieval recall and correctness.
arXiv Detail & Related papers (2025-06-09T17:58:35Z) - Graph-KV: Breaking Sequence via Injecting Structural Biases into Large Language Models [63.64507678113921]
We introduce Graph-KV, which governs interaction through structural inductive biases.<n>In this framework, 'target' segments selectively attend only to the KV-cache of their designated'source' segments.<n>We evaluate Graph-KV across three scenarios: (1) seven RAG benchmarks spanning direct inference, multi-hop reasoning, and long-document understanding; (2) Arxiv-QA, a novel academic paper QA task with full-text scientific papers structured as citation ego-graphs; and (3) paper topic classification within a citation network.
arXiv Detail & Related papers (2025-06-09T00:30:08Z) - KG-QAGen: A Knowledge-Graph-Based Framework for Systematic Question Generation and Long-Context LLM Evaluation [3.618621510356872]
KG-QAGen is a framework that extracts QA pairs at multiple complexity levels.<n>We construct a dataset of 20,139 QA pairs and open-source a part of it.<n>We evaluate 13 proprietary and open-source LLMs and observe that even the best-performing models are struggling with set-based comparisons.
arXiv Detail & Related papers (2025-05-18T16:46:39Z) - Transformers for Complex Query Answering over Knowledge Hypergraphs [48.55646194244594]
Triple KGs, as the classic KGs composed of entities and relations of arity 2, have limited representation of real-world facts.
We propose a two-stage transformer model, the Logical Knowledge Hypergraph Transformer (LKHGT), which consists of a Projection for atomic projection and a Logical for complex logical operations.
Experimental results on CQA datasets show that LKHGT is a state-of-the-art CQA method over KHG and is able to generalize to out-of-distribution query types.
arXiv Detail & Related papers (2025-04-23T09:07:21Z) - Scalable Weibull Graph Attention Autoencoder for Modeling Document Networks [50.42343781348247]
We develop a graph Poisson factor analysis (GPFA) which provides analytic conditional posteriors to improve the inference accuracy.
We also extend GPFA to a multi-stochastic-layer version named graph Poisson gamma belief network (GPGBN) to capture the hierarchical document relationships at multiple semantic levels.
Our models can extract high-quality hierarchical latent document representations and achieve promising performance on various graph analytic tasks.
arXiv Detail & Related papers (2024-10-13T02:22:14Z) - A Pure Transformer Pretraining Framework on Text-attributed Graphs [50.833130854272774]
We introduce a feature-centric pretraining perspective by treating graph structure as a prior.
Our framework, Graph Sequence Pretraining with Transformer (GSPT), samples node contexts through random walks.
GSPT can be easily adapted to both node classification and link prediction, demonstrating promising empirical success on various datasets.
arXiv Detail & Related papers (2024-06-19T22:30:08Z) - Unleashing the Potential of Text-attributed Graphs: Automatic Relation Decomposition via Large Language Models [31.443478448031886]
RoSE (Relation-oriented Semantic Edge-decomposition) is a novel framework that decomposes the graph structure by analyzing raw text attributes.
Our framework significantly enhances node classification performance across various datasets, with improvements of up to 16% on the Wisconsin dataset.
arXiv Detail & Related papers (2024-05-28T20:54:47Z) - Graph Attention with Hierarchies for Multi-hop Question Answering [19.398300844233837]
We present two extensions to the SOTA Graph Neural Network (GNN) based model for HotpotQA.
Experiments on HotpotQA demonstrate the efficiency of the proposed modifications.
arXiv Detail & Related papers (2023-01-27T15:49:50Z) - GrannGAN: Graph annotation generative adversarial networks [72.66289932625742]
We consider the problem of modelling high-dimensional distributions and generating new examples of data with complex relational feature structure coherent with a graph skeleton.
The model we propose tackles the problem of generating the data features constrained by the specific graph structure of each data point by splitting the task into two phases.
In the first it models the distribution of features associated with the nodes of the given graph, in the second it complements the edge features conditionally on the node features.
arXiv Detail & Related papers (2022-12-01T11:49:07Z) - Relational Graph Convolutional Neural Networks for Multihop Reasoning: A
Comparative Study [22.398477810999818]
Multihop Question Answering is a complex task that requires steps of reasoning to find the correct answer.
In this paper we explore a number of RGCN-based Multihop QA models, graph relations, and node embeddings, and empirically explore the influence each on Multihop QA performance on the WikiHop dataset.
arXiv Detail & Related papers (2022-10-12T17:13:30Z) - ExpressivE: A Spatio-Functional Embedding For Knowledge Graph Completion [78.8942067357231]
ExpressivE embeds pairs of entities as points and relations as hyper-parallelograms in the virtual triple space.
We show that ExpressivE is competitive with state-of-the-art KGEs and even significantly outperforms them on W18RR.
arXiv Detail & Related papers (2022-06-08T23:34:39Z) - Skip Vectors for RDF Data: Extraction Based on the Complexity of Feature
Patterns [0.0]
The Resource Description Framework (RDF) is a framework for describing metadata, such as attributes and relationships of resources on the Web.
We propose a novel feature vector (called a Skip vector) that represents some features of each resource in an RDF graph by extracting various combinations of neighboring edges and nodes.
The classification tasks can be performed by applying the low-dimensional Skip vector of each resource to conventional machine learning algorithms, such as SVMs, the k-nearest neighbors method, neural networks, random forests, and AdaBoost.
arXiv Detail & Related papers (2022-01-06T10:07:49Z) - Graph Neural Networks with Composite Kernels [60.81504431653264]
We re-interpret node aggregation from the perspective of kernel weighting.
We present a framework to consider feature similarity in an aggregation scheme.
We propose feature aggregation as the composition of the original neighbor-based kernel and a learnable kernel to encode feature similarities in a feature space.
arXiv Detail & Related papers (2020-05-16T04:44:29Z) - Embedding Graph Auto-Encoder for Graph Clustering [90.8576971748142]
Graph auto-encoder (GAE) models are based on semi-supervised graph convolution networks (GCN)
We design a specific GAE-based model for graph clustering to be consistent with the theory, namely Embedding Graph Auto-Encoder (EGAE)
EGAE consists of one encoder and dual decoders.
arXiv Detail & Related papers (2020-02-20T09:53:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.