Construction and Applications of Billion-Scale Pre-Trained Multimodal
Business Knowledge Graph
- URL: http://arxiv.org/abs/2209.15214v6
- Date: Sun, 19 Mar 2023 11:28:38 GMT
- Title: Construction and Applications of Billion-Scale Pre-Trained Multimodal
Business Knowledge Graph
- Authors: Shumin Deng, Chengming Wang, Zhoubo Li, Ningyu Zhang, Zelin Dai,
Hehong Chen, Feiyu Xiong, Ming Yan, Qiang Chen, Mosha Chen, Jiaoyan Chen,
Jeff Z. Pan, Bryan Hooi, Huajun Chen
- Abstract summary: We introduce the process of building an open business knowledge graph (OpenBG) derived from a well-known enterprise, Alibaba Group.
OpenBG is an open business KG of unprecedented scale: 2.6 billion triples with more than 88 million entities covering over 1 million core classes/concepts and 2,681 types of relations.
- Score: 64.42060648398743
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Business Knowledge Graphs (KGs) are important to many enterprises today,
providing factual knowledge and structured data that steer many products and
make them more intelligent. Despite their promising benefits, building business
KG necessitates solving prohibitive issues of deficient structure and multiple
modalities. In this paper, we advance the understanding of the practical
challenges related to building KG in non-trivial real-world systems. We
introduce the process of building an open business knowledge graph (OpenBG)
derived from a well-known enterprise, Alibaba Group. Specifically, we define a
core ontology to cover various abstract products and consumption demands, with
fine-grained taxonomy and multimodal facts in deployed applications. OpenBG is
an open business KG of unprecedented scale: 2.6 billion triples with more than
88 million entities covering over 1 million core classes/concepts and 2,681
types of relations. We release all the open resources (OpenBG benchmarks)
derived from it for the community and report experimental results of KG-centric
tasks. We also run up an online competition based on OpenBG benchmarks, and has
attracted thousands of teams. We further pre-train OpenBG and apply it to many
KG- enhanced downstream tasks in business scenarios, demonstrating the
effectiveness of billion-scale multimodal knowledge for e-commerce. All the
resources with codes have been released at
\url{https://github.com/OpenBGBenchmark/OpenBG}.
Related papers
- Uncertainty Management in the Construction of Knowledge Graphs: a Survey [3.5639148953570845]
Knowledge Graphs (KGs) are a major asset for companies thanks to their great flexibility in data representation.
To build a KG it is a common practice to rely on automatic methods for extracting knowledge from various heterogeneous sources.
In a noisy and uncertain world, knowledge may not be reliable and conflicts between data sources may occur.
arXiv Detail & Related papers (2024-05-27T08:22:52Z) - Multi-domain Knowledge Graph Collaborative Pre-training and Prompt Tuning for Diverse Downstream Tasks [48.102084345907095]
Knowledge graph pre-training (KGP) aims to pre-train neural networks on large-scale Knowledge graphs (KGs)
MuDoK is a plug-and-play prompt learning approach that can be adapted to different downstream task backbones.
Our framework brings significant performance gains, along with its generality, efficiency, and transferability.
arXiv Detail & Related papers (2024-05-21T08:22:14Z) - Generate-on-Graph: Treat LLM as both Agent and KG in Incomplete Knowledge Graph Question Answering [90.30473970040362]
We propose a training-free method called Generate-on-Graph (GoG) that can generate new factual triples while exploring on Knowledge Graphs (KGs)
Specifically, we propose a selecting-generating-answering framework, which not only treat the LLM as an agent to explore on KGs, but also treat it as a KG to generate new facts based on the explored subgraph.
arXiv Detail & Related papers (2024-04-23T04:47:22Z) - MyGO: Discrete Modality Information as Fine-Grained Tokens for Multi-modal Knowledge Graph Completion [51.80447197290866]
We introduce MyGO to process, fuse, and augment the fine-grained modality information from MMKGs.
MyGO tokenizes multi-modal raw data as fine-grained discrete tokens and learns entity representations with a cross-modal entity encoder.
Experiments on standard MMKGC benchmarks reveal that our method surpasses 20 of the latest models.
arXiv Detail & Related papers (2024-04-15T05:40:41Z) - FedCQA: Answering Complex Queries on Multi-Source Knowledge Graphs via
Federated Learning [55.02512821257247]
Complex logical query answering is a challenging task in knowledge graphs (KGs)
Recent approaches are proposed to represent KG entities into embedding vectors and find answers to logical queries from the KGs.
It remains unknown how to answer queries on multi-source KGs.
arXiv Detail & Related papers (2024-02-22T14:57:44Z) - Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey [61.8716670402084]
This survey focuses on KG-aware research in two principal aspects: KG-driven Multi-Modal (KG4MM) learning, and Multi-Modal Knowledge Graph (MM4KG)
Our review includes two primary task categories: KG-aware multi-modal learning tasks, and intrinsic MMKG tasks.
For most of these tasks, we provide definitions, evaluation benchmarks, and additionally outline essential insights for conducting relevant research.
arXiv Detail & Related papers (2024-02-08T04:04:36Z) - Generations of Knowledge Graphs: The Crazy Ideas and the Business Impact [19.774378927811725]
We describe three generations of knowledge graphs: entity-based KGs, text-rich KGs, and dual neural KGs.
We use KGs as examples to demonstrate a recipe to evolve research ideas from innovations to production practice, and then to the next level of innovations.
arXiv Detail & Related papers (2023-08-27T22:35:27Z) - Construction of Knowledge Graphs: State and Challenges [2.245333517888782]
We discuss the main graph models for knowledge graphs (KGs) and introduce the major requirement for future KG construction pipelines.
Next, we provide an overview of the necessary steps to build high-quality KGs, including cross-cutting topics such as metadata management.
We evaluate the state of the art of KG construction w.r.t the introduced requirements for specific popular KGs as well as some recent tools and strategies for KG construction.
arXiv Detail & Related papers (2023-02-22T17:26:03Z) - KGTK: A Toolkit for Large Knowledge Graph Manipulation and Analysis [9.141014703209494]
KGTK is a data science-centric toolkit designed to represent, create, transform, enhance and analyze KGs.
We illustrate the framework with real-world scenarios where we have used KGTK to integrate and manipulate large KGs, such as Wikidata, DBpedia and ConceptNet.
arXiv Detail & Related papers (2020-05-29T21:29:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.