Fugu-MT 論文翻訳(概要): Evaluating LLMs on Large-Scale Graph Property Estimation via Random Walks

論文の概要: Evaluating LLMs on Large-Scale Graph Property Estimation via Random Walks

arxiv url: http://arxiv.org/abs/2605.01484v1
Date: Sat, 02 May 2026 15:11:52 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-05 20:33:49.797775
Title: Evaluating LLMs on Large-Scale Graph Property Estimation via Random Walks
Title（参考訳）: ランダムウォークによる大規模グラフ特性推定におけるLLMの評価
Authors: Sunil Kumar Maurya, Xin Liu,
Abstract要約: 実世界のデータでは、グラフのサイズは著しく大きくなり、多くの場合、完全にアクセスできない。大規模グラフベンチマークデータセットであるEstGraphを提案し,大規模グラフ特性を推定するための4つの異なるタスクを提案する。我々は,これらのタスクにおけるLLMの推論能力について,多種多様なグラフデータセットを用いて評価する。
参考スコア（独自算出の注目度）: 4.0636315905272475
License: http://creativecommons.org/licenses/by/4.0/
Abstract: With the rapidly improving reasoning abilities of Large Language Models (LLMs), there is also a rising demand to use them in a wide variety of domains. This brings about the need to carefully evaluate the limits of the capabilities of these models with various tests and benchmarks. Graph structures are ubiquitous in real-world data, and are often used to represent and analyze relationship patterns within data. Many benchmarks have already been proposed in the graph literature to test the reasoning ability of LLMs to follow and execute graph algorithms. However, due to the limited context length of LLMs, these benchmarks consist of very small graphs. In real-world data, the size of graphs can be significantly larger, and in many cases, not fully accessible. In this paper, we examine a class of problems that arises with very large graphs having limited accessibility. We propose a large graph benchmark dataset, EstGraph, and introduce four distinct tasks designed to estimate large graph properties. We evaluate the reasoning abilities of LLMs on these tasks using a wide variety of graph datasets. In addition, we provide task-specific prompt constructions based on random walk sampling of large graphs (up to millions of nodes) that effectively convey sufficient information to LLMs within the limits of context length.
Abstract（参考訳）: LLM(Large Language Models)の推論能力が急速に向上し、さまざまな領域でそれらを使用する必要性が高まっている。これにより、様々なテストとベンチマークでこれらのモデルの能力の限界を慎重に評価する必要がある。グラフ構造は現実世界のデータでユビキタスであり、しばしばデータ内の関係パターンを表現および分析するために使用される。グラフアルゴリズムの追従と実行に関するLSMの推論能力をテストするために、グラフ文献にはすでに多くのベンチマークが提案されている。しかし、LLMのコンテキスト長が限られているため、これらのベンチマークは非常に小さなグラフで構成されている。実世界のデータでは、グラフのサイズは著しく大きくなり、多くの場合、完全にアクセスできない。本稿では,アクセシビリティが制限されたグラフで発生する問題のクラスについて検討する。大規模グラフベンチマークデータセットであるEstGraphを提案し,大規模グラフ特性を推定するための4つの異なるタスクを提案する。我々は,これらのタスクにおけるLLMの推論能力について,多種多様なグラフデータセットを用いて評価する。さらに,大規模グラフ(最大数百万ノード)のランダムウォークサンプリングに基づくタスク固有のプロンプト構築を行い,文脈長の制限内でLLMに十分な情報を効果的に伝達する。

論文の概要: Evaluating LLMs on Large-Scale Graph Property Estimation via Random Walks

関連論文リスト