Fugu-MT 論文翻訳(概要): PrimeKG-CL: A Continual Graph Learning Benchmark on Evolving Biomedical Knowledge Graphs

論文の概要: PrimeKG-CL: A Continual Graph Learning Benchmark on Evolving Biomedical Knowledge Graphs

arxiv url: http://arxiv.org/abs/2605.10529v1
Date: Mon, 11 May 2026 13:14:02 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 23:28:50.83753
Title: PrimeKG-CL: A Continual Graph Learning Benchmark on Evolving Biomedical Knowledge Graphs
Title（参考訳）: PrimeKG-CL: バイオメディカル知識グラフの進化に関する継続的なグラフ学習ベンチマーク
Authors: Yousef A. Radwan, Yao Li, Qing Qing, Ziqi Xu, Xingtong Yu, Jiaxing Huang, Renqiang Luo, Xikun Zhang,
Abstract要約: 既存の連続グラフ学習は、ほとんど完全に合成ランダムな一般的なKGについて研究されている。 9つの信頼できるバイオメディカルデータベースから構築されたベンチマークを導入する。我々は、4つのKGEデコーダ、LKGE、CK-hugging agent、CMKLの6つのCL戦略を評価した。
参考スコア（独自算出の注目度）: 19.859134741964805
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Biomedical knowledge graphs underwrite drug repurposing and clinical decision support, yet the upstream ontologies they depend on update on independent cycles that add millions of edges and deprecate hundreds of thousands more between releases. Yet existing continual graph learning has been studied almost exclusively on synthetic random splits of static, generic KGs, a regime that cannot reproduce the asynchronous, structured evolution real biomedical KGs undergo. To this end, we introduce PrimeKG-CL, a CGL benchmark built from nine authoritative biomedical databases (129K+ nodes, 8.1M+ edges, 10 node types, 30 relation types) with two genuine temporal snapshots (June 2021, July 2023; 5.83M edges added, 889K removed, 7.21M persistent), 10 entity-type-grouped tasks, multimodal node features, and a per-task persistent/added/removed test stratification. On three tasks (biomedical relationship prediction, entity classification, KGQA), we evaluate six CL strategies across four KGE decoders, plus LKGE, an LLM-RAG agent, and CMKL. We find that decoder choice and continual learning strategy interact strongly: no single strategy performs best across all decoders, and mismatched combinations can significantly degrade performance. Moreover, only DistMult exhibits a clear separation between persistent and deprecated knowledge, indicating that standard metrics conflate retention of still-valid facts with failure to forget outdated ones; this effect is absent under RotatE. In addition, multimodal features improve entity-level tasks by up to 60%, and a recent CKGE framework (IncDE) failed to scale to our 5.67M-triple base task across five attempts up to 350GB RAM. Data, pipeline, baselines, and the stratified split are released openly. Dataset:huggingface.co/datasets/yradwan147/PrimeKGCL|Code:github.com/yradwan147/primekg-cl-neurips2026
Abstract（参考訳）: バイオメディカル・ナレッジグラフは、医薬品の買い戻しと臨床的な意思決定を支援するが、上流のオントロジーは、数百万のエッジを追加し、リリースの間に何十万ものものを非推奨にする独立したサイクルの更新に依存している。しかし、既存の連続グラフ学習は、静的な一般的なKGの合成ランダムスプリットにのみ研究されており、これは非同期で構造化された実際の生物医学的なKGを再現できない状態である。この目的のために,9つの信頼できるバイオメディカルデータベース(129K+ノード,8.1M+エッジ,10ノードタイプ,30の関係型)と2つの真の時間的スナップショット(2021年6月,2023年7月,5.83Mエッジの追加,889K削除,7.21M永続性),エンティティタイプグループタスク10,マルチモーダルノード機能,タスク毎の永続/追加/削除テスト層によるCGLベンチマークであるPrimeKG-CLを紹介した。 3つのタスク(生物関係予測,エンティティ分類,KGQA)において,4つのKGEデコーダ,LKGE,LLM-RAGエージェント,CMKLの6つのCL戦略を評価する。デコーダの選択と継続学習戦略は、すべてのデコーダで最善を尽くす戦略はなく、ミスマッチした組み合わせはパフォーマンスを著しく低下させる可能性がある。さらに、DistMultだけが永続的な知識と非推奨な知識の明確な分離を示しており、標準メトリクスが、古い事実を忘れることに失敗して、まだ有効な事実の保持を強調していることを示している。さらに、マルチモーダル機能はエンティティレベルのタスクを最大60%改善し、最近のCKGEフレームワーク(IncDE)は、最大350GB RAMまでの5回までの5.67Mトリプルベースタスクにスケールできなかった。データ、パイプライン、ベースライン、階層化された分割は、オープンにリリースされる。 Dataset:huggingface.co/datasets/yradwan147/PrimeKGCL|Code:github.com/yradwan147/primekg-cl-neurips2026

論文の概要: PrimeKG-CL: A Continual Graph Learning Benchmark on Evolving Biomedical Knowledge Graphs

関連論文リスト