Digitize-PID: Automatic Digitization of Piping and Instrumentation
Diagrams
- URL: http://arxiv.org/abs/2109.03794v1
- Date: Wed, 8 Sep 2021 17:32:49 GMT
- Title: Digitize-PID: Automatic Digitization of Piping and Instrumentation
Diagrams
- Authors: Shubham Paliwal, Arushi Jain, Monika Sharma and Lovekesh Vig
- Abstract summary: Digitize-PID comprises of an end-to-end pipeline for detection of core components from P&IDs like pipes, symbols and textual information.
A novel and efficient kernel-based line detection and a two-step method for detection of complex symbols based on a fine-grained deep recognition technique is presented.
Results show that Digitize-PID outperforms the existing state-of-the-art for P&ID digitization.
- Score: 21.298283130966148
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Digitization of scanned Piping and Instrumentation diagrams(P&ID), widely
used in manufacturing or mechanical industries such as oil and gas over several
decades, has become a critical bottleneck in dynamic inventory management and
creation of smart P&IDs that are compatible with the latest CAD tools.
Historically, P&ID sheets have been manually generated at the design stage,
before being scanned and stored as PDFs. Current digitization initiatives
involve manual processing and are consequently very time consuming, labour
intensive and error-prone.Thanks to advances in image processing, machine and
deep learning techniques there are emerging works on P&ID digitization.
However, existing solutions face several challenges owing to the variation in
the scale, size and noise in the P&IDs, sheer complexity and crowdedness within
drawings, domain knowledge required to interpret the drawings. This motivates
our current solution called Digitize-PID which comprises of an end-to-end
pipeline for detection of core components from P&IDs like pipes, symbols and
textual information, followed by their association with each other and
eventually, the validation and correction of output data based on inherent
domain knowledge. A novel and efficient kernel-based line detection and a
two-step method for detection of complex symbols based on a fine-grained deep
recognition technique is presented in the paper. In addition, we have created
an annotated synthetic dataset, Dataset-P&ID, of 500 P&IDs by incorporating
different types of noise and complex symbols which is made available for public
use (currently there exists no public P&ID dataset). We evaluate our proposed
method on this synthetic dataset and a real-world anonymized private dataset of
12 P&ID sheets. Results show that Digitize-PID outperforms the existing
state-of-the-art for P&ID digitization.
Related papers
- Order-agnostic Identifier for Large Language Model-based Generative Recommendation [94.37662915542603]
Items are assigned identifiers for Large Language Models (LLMs) to encode user history and generate the next item.
Existing approaches leverage either token-sequence identifiers, representing items as discrete token sequences, or single-token identifiers, using ID or semantic embeddings.
We propose SETRec, which leverages semantic tokenizers to obtain order-agnostic multi-dimensional tokens.
arXiv Detail & Related papers (2025-02-15T15:25:38Z) - Explainable AI for Multivariate Time Series Pattern Exploration: Latent Space Visual Analytics with Temporal Fusion Transformer and Variational Autoencoders in Power Grid Event Diagnosis [1.170167705525779]
This paper proposes a novel visual analytics framework that integrates two generative AI models, Temporal Fusion Transformer (TFT) and Variational Autoencoders (VAEs)
It reduces complex patterns into lower-dimensional latent spaces and visualizes them in 2D using dimensionality reduction techniques such as PCA, t-SNE, and UMAP with DBSCAN.
The framework is demonstrated through a case study on power grid signal data, where it identifies multi-label grid event signatures, including faults and anomalies with diverse root causes.
arXiv Detail & Related papers (2024-12-20T17:41:11Z) - Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining [67.87810796668981]
Information-Sensitive Cropping (ISC) and Self-Refining Dual Learning (SRDL)
Iris achieves state-of-the-art performance across multiple benchmarks with only 850K GUI annotations.
These improvements translate to significant gains in both web and OS agent downstream tasks.
arXiv Detail & Related papers (2024-12-13T18:40:10Z) - Transforming Engineering Diagrams: A Novel Approach for P&ID Digitization using Transformers [2.5056643038238495]
We propose a novel approach that utilizes the Relationformer, a state-of-the-art deep learning architecture, to extract graphs from P&IDs.
We present PID2Graph, the first publicly accessible P&ID dataset featuring comprehensive labels for the graph structure.
For the real-world data, the Relationformer achieves convincing results, outperforming the modular digitization approach for edge detection by more than 25%.
arXiv Detail & Related papers (2024-11-21T08:27:18Z) - PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding [102.07914175196817]
PhotoMaker is an efficient personalized text-to-image generation method.
It encodes an arbitrary number of input ID images into a stack ID embedding for preserving ID information.
arXiv Detail & Related papers (2023-12-07T17:32:29Z) - The Wyner Variational Autoencoder for Unsupervised Multi-Layer Wireless
Fingerprinting [6.632671046812309]
We propose a multi-layer fingerprinting framework that jointly considers the multi-layer signatures for improved identification performance.
In contrast to previous works, by leveraging the recent multi-view machine learning paradigm, our method can cluster the device information shared among the multi-layer features without supervision.
Our empirical results show that the proposed method outperforms the state-of-the-art baselines in both supervised and unsupervised settings.
arXiv Detail & Related papers (2023-03-28T10:05:06Z) - Synthetic ID Card Image Generation for Improving Presentation Attack
Detection [12.232059909207578]
This work explores three methods for synthetically generating ID card images to increase the amount of data while training fraud-detection networks.
Our results indicate that databases can be supplemented with synthetic images without any loss in performance for the print/scan Presentation Attack Instrument Species (PAIS) and a loss in performance of 1% for the screen capture PAIS.
arXiv Detail & Related papers (2022-10-31T19:07:30Z) - Towards automatic generation of Piping and Instrumentation Diagrams
(P&IDs) with Artificial Intelligence [0.0]
We propose a novel, completely data-driven method for the prediction of control structures.
We cast the control structure prediction as a translation task where Process Flow Diagrams (PFDs) are translated to P&IDs.
The model achieved a top-5 accuracy of 74.8% on 10,000 generated P&IDs and 89.2% on 100,000 generated P&IDs.
arXiv Detail & Related papers (2022-10-26T10:03:15Z) - Automatic digital twin data model generation of building energy systems
from piping and instrumentation diagrams [58.720142291102135]
We present an approach to recognize symbols and connections of P&ID from buildings in a completely automated way.
We apply algorithms for symbol recognition, line recognition and derivation of connections to the data sets.
The approach can be used in further processes like control generation, (distributed) model predictive control or fault detection.
arXiv Detail & Related papers (2021-08-31T15:09:39Z) - Generalized Iris Presentation Attack Detection Algorithm under
Cross-Database Settings [63.90855798947425]
Presentation attacks pose major challenges to most of the biometric modalities.
We propose a generalized deep learning-based presentation attack detection network, MVANet.
It is inspired by the simplicity and success of hybrid algorithm or fusion of multiple detection networks.
arXiv Detail & Related papers (2020-10-25T22:42:27Z) - Data Mining with Big Data in Intrusion Detection Systems: A Systematic
Literature Review [68.15472610671748]
Cloud computing has become a powerful and indispensable technology for complex, high performance and scalable computation.
The rapid rate and volume of data creation has begun to pose significant challenges for data management and security.
The design and deployment of intrusion detection systems (IDS) in the big data setting has, therefore, become a topic of importance.
arXiv Detail & Related papers (2020-05-23T20:57:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.