Fugu-MT 論文翻訳(概要): SynthPID: P&ID digitization from Topology-Preserving Synthetic Data

論文の概要: SynthPID: P&ID digitization from Topology-Preserving Synthetic Data

arxiv url: http://arxiv.org/abs/2604.16513v1
Date: Wed, 15 Apr 2026 09:14:44 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.04223
Title: SynthPID: P&ID digitization from Topology-Preserving Synthetic Data
Title（参考訳）: SynthPID: 位相保存合成データからのP&IDデジタル化
Authors: Suraj Prasad, Pinak Mahapatra,
Abstract要約: 実際の図面から直接シードされた665個の合成P&IDのコーパスであるSynthPIDを紹介する。 SynthPIDだけでトレーニングされたモデルは、PID2Graph OPEN100上で63.8 +/-3.1%のエッジmAPを達成する。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Automating the digitization of Piping and Instrumentation Diagrams (P&IDs) into structured process graphs would unlock significant value in plant operations, yet progress is bottlenecked by a fundamental data problem: engineering drawings are proprietary, and the entire community shares a single public benchmark of just 12 annotated images. Prior attempts at synthetic augmentation have fallen short because template-based generators scatter symbols at random, producing graphs that bear little resemblance to real process plants and, accordingly, yield only approximately 33% edge detection accuracy under synth-only training. We argue the failure is structural rather than visual and address it by introducing SynthPID, a corpus of 665 synthetic P&IDs whose pipe topology is seeded directly from real drawings. Paired with a patch-based Relationformer adapted for high-resolution diagrams, a model trained on SynthPID alone achieves 63.8 +/- 3.1% edge mAP on PID2Graph OPEN100 without seeing a single real P&ID during training, closing within 8 pp of the real-data oracle. These gains hold up under a controlled comparison against the template-based regime, confirming that generation quality drives performance rather than model choice. A scaling study reveals that gains flatten beyond roughly 400 synthetic images, pointing to seed diversity as the binding constraint.
Abstract（参考訳）: P&ID(Piping and Instrumentation Diagram)を構造化プロセスグラフにデジタル化すると、プラントの運用において重要な価値が解放されるが、技術図面はプロプライエタリであり、コミュニティ全体がたった12の注釈付きイメージの単一の公開ベンチマークを共有しているという根本的なデータ問題によって、進歩はボトルネックとなる。テンプレートベースのジェネレータがランダムにシンボルを散乱させ、実際のプロセスプラントとほとんど類似しないグラフを生成し、その結果、合成のみのトレーニングでは、約33%のエッジ検出精度しか得られない。実際の図面から直接パイプトポロジをシードした665個の合成P&IDのコーパスであるSynthPIDを導入することで,失敗は視覚的ではなく構造的である,と我々は主張する。パッチベースのリレーショナルフォーマーを高解像度のダイアグラムに適合させ、SynthPIDでトレーニングされたモデルは、PID2Graph OPEN100上の63.8 +/-3.1%のエッジmAPを、トレーニング中に1つの実際のP&IDを見ることなく達成し、実際のデータオラクルの8pp以内で閉じる。これらの利得はテンプレートベースのレシエーションとコントロールされた比較で維持され、生成品質がモデル選択よりもパフォーマンスを向上することを確認した。スケーリングの研究では、約400の合成画像が平坦になり、種子の多様性が結合の制約であることを示している。

論文の概要: SynthPID: P&ID digitization from Topology-Preserving Synthetic Data

関連論文リスト