Fugu-MT 論文翻訳(概要): Rule2Text: A Framework for Generating and Evaluating Natural Language Explanations of Knowledge Graph Rules

論文の概要: Rule2Text: A Framework for Generating and Evaluating Natural Language Explanations of Knowledge Graph Rules

arxiv url: http://arxiv.org/abs/2508.10971v1
Date: Thu, 14 Aug 2025 16:41:47 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-18 14:51:23.623655
Title: Rule2Text: A Framework for Generating and Evaluating Natural Language Explanations of Knowledge Graph Rules
Title（参考訳）: Rule2Text:知識グラフルールの自然言語記述の生成と評価のためのフレームワーク
Authors: Nasim Shirvani-Mahdavi, Chengkai Li,
Abstract要約: Rule2Textは、大規模言語モデルを利用して、マイニングされた論理ルールの自然言語説明を生成するフレームワークである。その結果,細調整後の説明品質が著しく向上し,特にドメイン固有のデータセットが向上した。
参考スコア（独自算出の注目度）: 0.998900149624725
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Knowledge graphs (KGs) can be enhanced through rule mining; however, the resulting logical rules are often difficult for humans to interpret due to their inherent complexity and the idiosyncratic labeling conventions of individual KGs. This work presents Rule2Text, a comprehensive framework that leverages large language models (LLMs) to generate natural language explanations for mined logical rules, thereby improving KG accessibility and usability. We conduct extensive experiments using multiple datasets, including Freebase variants (FB-CVT-REV, FB+CVT-REV, and FB15k-237) as well as the ogbl-biokg dataset, with rules mined using AMIE 3.5.1. We systematically evaluate several LLMs across a comprehensive range of prompting strategies, including zero-shot, few-shot, variable type incorporation, and Chain-of-Thought reasoning. To systematically assess models' performance, we conduct a human evaluation of generated explanations on correctness and clarity. To address evaluation scalability, we develop and validate an LLM-as-a-judge framework that demonstrates strong agreement with human evaluators. Leveraging the best-performing model (Gemini 2.0 Flash), LLM judge, and human-in-the-loop feedback, we construct high-quality ground truth datasets, which we use to fine-tune the open-source Zephyr model. Our results demonstrate significant improvements in explanation quality after fine-tuning, with particularly strong gains in the domain-specific dataset. Additionally, we integrate a type inference module to support KGs lacking explicit type information. All code and data are publicly available at https://github.com/idirlab/KGRule2NL.
Abstract（参考訳）: 知識グラフ(KG)は、ルールマイニングによって拡張することができるが、結果として生じる論理ルールは、その固有の複雑さと個々のKGの慣用的なラベル付け規則のために、人間が解釈することがしばしば困難である。この研究は、大規模言語モデル(LLM)を活用して、採掘された論理ルールの自然言語説明を生成する包括的なフレームワークであるRule2Textを提示し、KGアクセシビリティとユーザビリティを改善した。我々は、フリーベース変種(FB-CVT-REV、FB+CVT-REV、FB15k-237)やogbl-biokgデータセットなど、複数のデータセットを用いて広範な実験を行い、AMIE 3.5.1を用いてルールをマイニングした。ゼロショット, 少数ショット, 可変型組込み, チェーン・オブ・ソート推論など, 包括的なプロンプト戦略を網羅して, いくつかのLCMを体系的に評価した。モデルの性能を体系的に評価するために,生成した説明の正確さと明快さに関する人間による評価を行う。評価のスケーラビリティに対処するために,人間評価者との強い合意を示すLCM-as-a-judgeフレームワークを開発し,検証する。最高のパフォーマンスモデル(Gemini 2.0 Flash)、LCMの判断、ループ内のフィードバックを活用することで、高品質な地上真実データセットを構築し、オープンソースのZephyrモデルを微調整します。その結果,細調整後の説明品質が著しく向上し,特にドメイン固有のデータセットが向上した。さらに、明示的な型情報を持たないKGをサポートするために、型推論モジュールを統合する。すべてのコードとデータはhttps://github.com/idirlab/KGRule2NLで公開されている。

論文の概要: Rule2Text: A Framework for Generating and Evaluating Natural Language Explanations of Knowledge Graph Rules

関連論文リスト