Query-based Industrial Analytics over Knowledge Graphs with Ontology
Reshaping
- URL: http://arxiv.org/abs/2209.11089v1
- Date: Thu, 22 Sep 2022 15:20:58 GMT
- Title: Query-based Industrial Analytics over Knowledge Graphs with Ontology
Reshaping
- Authors: Zhuoxun Zheng, Baifan Zhou, Dongzhuoran Zhou, Gong Cheng, Ernesto
Jim\'enez-Ruiz, Ahmet Soylu, Evgeny Kharlamo
- Abstract summary: Poor design of high degree of mismatch between them and industrial data naturally lead to KGs of low quality.
We propose an approach to transform analytics into KGta to reflect the underlying data and thus help to maintain better KG schemas.
- Score: 6.047374579252933
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Industrial analytics that includes among others equipment diagnosis and
anomaly detection heavily relies on integration of heterogeneous production
data. Knowledge Graphs (KGs) as the data format and ontologies as the unified
data schemata are a prominent solution that offers high quality data
integration and a convenient and standardised way to exchange data and to layer
analytical applications over it. However, poor design of ontologies of high
degree of mismatch between them and industrial data naturally lead to KGs of
low quality that impede the adoption and scalability of industrial analytics.
Indeed, such KGs substantially increase the training time of writing queries
for users, consume high volume of storage for redundant information, and are
hard to maintain and update. To address this problem we propose an ontology
reshaping approach to transform ontologies into KG schemata that better reflect
the underlying data and thus help to construct better KGs. In this poster we
present a preliminary discussion of our on-going research, evaluate our
approach with a rich set of SPARQL queries on real-world industry data at Bosch
and discuss our findings.
Related papers
- Distill-SynthKG: Distilling Knowledge Graph Synthesis Workflow for Improved Coverage and Efficiency [59.6772484292295]
Knowledge graphs (KGs) generated by large language models (LLMs) are increasingly valuable for Retrieval-Augmented Generation (RAG) applications.
Existing KG extraction methods rely on prompt-based approaches, which are inefficient for processing large-scale corpora.
We propose SynthKG, a multi-step, document-level synthesis KG workflow based on LLMs.
We also design a novel graph-based retrieval framework for RAG.
arXiv Detail & Related papers (2024-10-22T00:47:54Z) - Enhancing Robustness of Graph Neural Networks through p-Laplacian [2.3942577670144423]
Graph Neural Networks (GNNs) have shown great promise in various applications.
adversarial attacks can happen over the data, whether during training (poisoning attack) or during testing (evasion attack)
This paper presents a computationally efficient framework, namely, pLapGNN, based on weighted p-Laplacian for making GNNs robust.
arXiv Detail & Related papers (2024-09-27T18:51:05Z) - Visual Data Diagnosis and Debiasing with Concept Graphs [50.84781894621378]
Deep learning models often pick up inherent biases in the data during the training process, leading to unreliable predictions.
We present CONBIAS, a novel framework for diagnosing and mitigating Concept co-occurrence Biases in visual datasets.
We show that by employing a novel clique-based concept balancing strategy, we can mitigate these imbalances, leading to enhanced performance on downstream tasks.
arXiv Detail & Related papers (2024-09-26T16:59:01Z) - Enhancing High-Energy Particle Physics Collision Analysis through Graph Data Attribution Techniques [0.0]
This paper uses a simulated particle collision dataset to integrate influence analysis inside the graph classification pipeline.
By using a Graph Neural Network for initial training, we applied a gradient-based data influence method to identify influential training samples.
By analyzing the discarded elements we can provide further insights about the event classification task.
arXiv Detail & Related papers (2024-07-20T12:40:03Z) - DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation [83.30006900263744]
Data analysis is a crucial analytical process to generate in-depth studies and conclusive insights.
We propose to automatically generate high-quality answer annotations leveraging the code-generation capabilities of LLMs.
Our DACO-RL algorithm is evaluated by human annotators to produce more helpful answers than SFT model in 57.72% cases.
arXiv Detail & Related papers (2024-03-04T22:47:58Z) - An Open-Source Knowledge Graph Ecosystem for the Life Sciences [5.665519167428707]
PheKnowLator is a semantic ecosystem for automating the construction of ontologically grounded knowledge graphs.
The ecosystem includes KG construction resources, analysis tools, and benchmarks.
PheKnowLator enables fully customizable KGs without compromising performance or usability.
arXiv Detail & Related papers (2023-07-11T18:55:09Z) - Analytical Engines With Context-Rich Processing: Towards Efficient
Next-Generation Analytics [12.317930859033149]
We envision an analytical engine co-optimized with components that enable context-rich analysis.
We aim for a holistic pipeline cost- and rule-based optimization across relational and model-based operators.
arXiv Detail & Related papers (2022-12-14T21:46:33Z) - Towards Ontology Reshaping for KG Generation with User-in-the-Loop:
Applied to Bosch Welding [18.83458273005337]
Knowledge graphs (KG) are used in a wide range of applications.
The automation of KG generation is very desired due to the data volume and variety in industries.
arXiv Detail & Related papers (2022-09-22T14:59:13Z) - Deep Learning based pipeline for anomaly detection and quality
enhancement in industrial binder jetting processes [68.8204255655161]
Anomaly detection describes methods of finding abnormal states, instances or data points that differ from a normal value space.
This paper contributes to a data-centric way of approaching artificial intelligence in industrial production.
arXiv Detail & Related papers (2022-09-21T08:14:34Z) - Mapping Patterns for Virtual Knowledge Graphs [71.61234136161742]
Virtual Knowledge Graphs (VKG) constitute one of the most promising paradigms for integrating and accessing legacy data sources.
We build on well-established methodologies and patterns studied in data management, data analysis, and conceptual modeling.
We validate our catalog on the considered VKG scenarios, showing it covers the vast majority of patterns present therein.
arXiv Detail & Related papers (2020-12-03T13:54:52Z) - Topology-based Clusterwise Regression for User Segmentation and Demand
Forecasting [63.78344280962136]
Using a public and a novel proprietary data set of commercial data, this research shows that the proposed system enables analysts to both cluster their user base and plan demand at a granular level.
This work seeks to introduce TDA-based clustering of time series and clusterwise regression with matrix factorization methods as viable tools for the practitioner.
arXiv Detail & Related papers (2020-09-08T12:10:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.