Fugu-MT 論文翻訳(概要): Continuous Monitoring of Large-Scale Generative AI via Deterministic Knowledge Graph Structures

論文の概要: Continuous Monitoring of Large-Scale Generative AI via Deterministic Knowledge Graph Structures

arxiv url: http://arxiv.org/abs/2509.03857v1
Date: Thu, 04 Sep 2025 03:34:49 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-05 20:21:10.040851
Title: Continuous Monitoring of Large-Scale Generative AI via Deterministic Knowledge Graph Structures
Title（参考訳）: 決定論的知識グラフ構造を用いた大規模生成AIの連続モニタリング
Authors: Kishor Datta Gupta, Mohd Ariful Haque, Hasmot Ali, Marufa Kamal, Syed Bahauddin Alam, Mohammad Ashiqur Rahman,
Abstract要約: そこで本研究では,決定論的・大規模言語モデル(LLM)生成知識グラフ(KG)を用いて,AIの信頼性を計測する手法を提案する。 i) 明示的な規則に基づく手法,辞書,構造化エンティティ関係抽出規則を用いて構築された決定論的KGと,(ii) ライブニュース記事などのリアルタイムデータストリームから動的にLLM生成したKGの2つのKGを構築する。 Instantiated Class Ratio(ICR)、Instantiated Property Ratio(IPR)、Class Instantiation(CI)など、いくつかの確立されたKG指標を用いる。
参考スコア（独自算出の注目度）: 2.7277205894982095
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Generative AI (GEN AI) models have revolutionized diverse application domains but present substantial challenges due to reliability concerns, including hallucinations, semantic drift, and inherent biases. These models typically operate as black-boxes, complicating transparent and objective evaluation. Current evaluation methods primarily depend on subjective human assessment, limiting scalability, transparency, and effectiveness. This research proposes a systematic methodology using deterministic and Large Language Model (LLM)-generated Knowledge Graphs (KGs) to continuously monitor and evaluate GEN AI reliability. We construct two parallel KGs: (i) a deterministic KG built using explicit rule-based methods, predefined ontologies, domain-specific dictionaries, and structured entity-relation extraction rules, and (ii) an LLM-generated KG dynamically derived from real-time textual data streams such as live news articles. Utilizing real-time news streams ensures authenticity, mitigates biases from repetitive training, and prevents adaptive LLMs from bypassing predefined benchmarks through feedback memorization. To quantify structural deviations and semantic discrepancies, we employ several established KG metrics, including Instantiated Class Ratio (ICR), Instantiated Property Ratio (IPR), and Class Instantiation (CI). An automated real-time monitoring framework continuously computes deviations between deterministic and LLM-generated KGs. By establishing dynamic anomaly thresholds based on historical structural metric distributions, our method proactively identifies and flags significant deviations, thus promptly detecting semantic anomalies or hallucinations. This structured, metric-driven comparison between deterministic and dynamically generated KGs delivers a robust and scalable evaluation framework.
Abstract（参考訳）: ジェネレーティブAI(gen AI)モデルは、多様なアプリケーションドメインに革命をもたらしたが、幻覚、セマンティックドリフト、固有のバイアスなど、信頼性に関する懸念から、重大な課題を提起している。これらのモデルは一般にブラックボックスとして機能し、透明で客観的な評価を複雑にする。現在の評価方法は、主に主観的人間評価、スケーラビリティの制限、透明性、有効性に依存している。本研究では,決定論的・大規模言語モデル(LLM)が生成する知識グラフ(KG)を用いて,遺伝子AIの信頼性を継続的に監視・評価する手法を提案する。並列KGを2つ構築する。一明示的な規則に基づく方法、事前定義されたオントロジー、ドメイン固有の辞書、構造化された実体関係抽出規則を用いて構築された決定論的KG (II)ライブニュース記事などのリアルタイムテキストデータストリームからLLM生成KGを動的に生成する。リアルタイムニュースストリームを利用することで、信頼性が保証され、バイアスが反復的なトレーニングから軽減され、適応LDMが事前に定義されたベンチマークをフィードバックの記憶を通じてバイパスするのを防ぐ。 Instantiated Class Ratio(ICR)、Instantiated Property Ratio(IPR)、Class Instantiation(CI)など、いくつかの確立されたKG指標を用いる。自動リアルタイム監視フレームワークは、決定論的KGとLLM生成KGのずれを連続的に計算する。歴史的構造量分布に基づく動的異常しきい値を確立することにより,本手法は有意な偏差を積極的に識別し,フラグを立て,意味的異常や幻覚を迅速に検出する。決定論的および動的に生成されたKGのこの構造化された計量駆動比較は、堅牢でスケーラブルな評価フレームワークを提供する。

論文の概要: Continuous Monitoring of Large-Scale Generative AI via Deterministic Knowledge Graph Structures

関連論文リスト