Fugu-MT 論文翻訳(概要): COREKG: Coreset-Guided Personalized Summarization of Knowledge Graphs

論文の概要: COREKG: Coreset-Guided Personalized Summarization of Knowledge Graphs

arxiv url: http://arxiv.org/abs/2605.14900v1
Date: Thu, 14 May 2026 14:40:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-15 21:45:34.882985
Title: COREKG: Coreset-Guided Personalized Summarization of Knowledge Graphs
Title（参考訳）: COREKG:知識グラフのコアセットガイドによるパーソナライズ
Authors: Sohel Aman Khan, Raghava Mutharaju, Supratim Shit,
Abstract要約: 感度重要度サンプリングを用いて三重項の関連部分集合をサンプリングする手法を提案する。クエリの振る舞いに基づいて,ユーザ毎の要約を構築することで,パーソナライズされた知識グラフの要約に着目する。 Freebase, WikiData, DBpedia による評価の結果, COREKG は最先端の手法よりも高い問合せ精度と構造的カバレッジを提供することがわかった。
参考スコア（独自算出の注目度）: 2.8292841621378844
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Knowledge Graphs (KGs) are extensively used across different domains and in several applications. Often, these KGs are very large in size. Such KGs become unwieldy for tasks such as question answering and visualization. Summarization of KGs offers a viable alternative in such cases. Furthermore, personalized KG summarization is crucial in the current data-driven world as it captures the specific requirements of users based on their query patterns. Since it only maintains relevant information, the personalized summaries of KG are small, resulting in significantly smaller storage requirements and query runtime. In this work, we adapt the coreset theory to create personalized KG summaries. For a given dataset and a user-specific query workload, we present an approach that samples a relevant subset of triples using sensitivity-based importance sampling. We ensure that the subset approximates the characteristics of the full dataset with bounded approximation error. We define sensitivity scores that measure the importance of a triple with respect to a user's query workload, which are then used by our coreset construction algorithm. We explicitly focus on personalized knowledge graph summarization by constructing summaries independently for each user based on their query behaviour. Our evaluation on Freebase, WikiData, and DBpedia shows that COREKG delivers higher query-answering accuracy and structural coverage than the state-of-the-art methods, such as GLIMPSE, PPR, iSummary, PEGASUS and APEX$^2$ while requiring only a tiny fraction of the original graph.
Abstract（参考訳）: 知識グラフ(KG)は様々なドメインやいくつかのアプリケーションで広く使われている。これらのKGは、しばしば非常に大きなサイズである。このようなKGは質問応答や可視化といったタスクでは扱いにくいものになっている。 KGs の要約はそのような場合に実行可能な代替手段を提供する。さらに、パーソナライズされたKG要約は、現在のデータ駆動の世界において重要であり、クエリパターンに基づいてユーザの特定の要求をキャプチャする。関連する情報のみを保持するため、KGのパーソナライズされた要約は小さくなり、ストレージ要件とクエリランタイムが大幅に小さくなる。本研究では,コアセット理論を適用して,個別化されたKG要約を生成する。与えられたデータセットとユーザ固有のクエリワークロードに対して、感度に基づく重要度サンプリングを用いてトリプルの関連するサブセットをサンプリングするアプローチを提案する。我々は、その部分集合が有界近似誤差で全データセットの特性に近似することを保証する。我々は,ユーザのクエリ処理量に対して三重項の重要性を測定する感度スコアを定義し,コアセット構築アルゴリズムで使用する。クエリの振る舞いに基づいて,ユーザ毎に個別に要約を構築することで,パーソナライズされた知識グラフの要約に着目する。 Freebase, WikiData, DBpedia での評価では, COREKG は GLIMPSE, PPR, iSummary, PEGASUS, APEX$^2$ といった最先端の手法よりもクエリ答えの精度と構造的カバレッジを向上し, 元のグラフのごく一部しか必要としない。

論文の概要: COREKG: Coreset-Guided Personalized Summarization of Knowledge Graphs

関連論文リスト