Fugu-MT 論文翻訳(概要): Efficient Knowledge Probing of Large Language Models by Adapting Pre-trained Embeddings

論文の概要: Efficient Knowledge Probing of Large Language Models by Adapting Pre-trained Embeddings

arxiv url: http://arxiv.org/abs/2508.06030v1
Date: Fri, 08 Aug 2025 05:32:31 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-11 20:39:06.088062
Title: Efficient Knowledge Probing of Large Language Models by Adapting Pre-trained Embeddings
Title（参考訳）: 事前学習型埋め込み適応による大規模言語モデルの効率的な知識探索
Authors: Kartik Sharma, Yiqiao Jin, Rakshit Trivedi, Srijan Kumar,
Abstract要約: 大規模言語モデル(LLM)は、科学、歴史、地理など様々な分野の知識を取得する。これらの手法は、特定の事実に関する LLM の知識を調査するために、基礎となるモデルを前方通過する必要がある。 LLMのプロキシとしてテキストやグラフとして事実知識を効果的にエンコードする埋め込みモデルを提案する。
参考スコア（独自算出の注目度）: 27.08405655200845
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) acquire knowledge across diverse domains such as science, history, and geography encountered during generative pre-training. However, due to their stochasticity, it is difficult to predict what LLMs have acquired. Prior work has developed different ways to probe this knowledge by investigating the hidden representations, crafting specific task prompts, curating representative samples, and estimating their uncertainty. However, these methods require making forward passes through the underlying model to probe the LLM's knowledge about a specific fact, making them computationally expensive and time-consuming. To bridge this gap, we propose $\textbf{PEEK}$ or $\textbf{P}$roxy $\textbf{E}$mbeddings to $\textbf{E}$stimate $\textbf{K}$nowledge of LLMs, by leveraging the pre-trained embedding models that effectively encode factual knowledge as text or graphs as proxies for LLMs. First, we identify a training set of facts known by LLMs through various probing strategies and then adapt embedding models to predict the LLM outputs with a linear decoder layer. Comprehensive evaluation on $3$ Wikipedia-derived datasets, $4$ LLMs, and $7$ embedding models shows that embeddings can predict LLM knowledge on a held-out set with up to 90 % accuracy. Furthermore, we find that sentence embedding models are more suitable than graph embeddings to predict LLM knowledge, shedding light on the underlying representation of the factual landscape. Thus, we believe that knowledge-adapted embeddings can be used to identify knowledge gaps in LLMs at scale and can provide deeper insights into LLMs' internal inductive bias. The code and data are made available at https://github.com/claws-lab/peek.
Abstract（参考訳）: 大規模言語モデル(LLM)は、生成前訓練中に遭遇した科学、歴史、地理といった様々な分野の知識を取得する。しかし,その確率性から,LSMが獲得したものを予測することは困難である。以前の研究は、隠された表現を調査し、特定のタスクプロンプトを作成し、代表的なサンプルをキュレートし、不確実性を見積もることで、この知識を探索する様々な方法を開発した。しかしながら、これらの手法は、特定の事実に関する LLM の知識を探索するために、基礎となるモデルを前方通過させ、計算コストと時間を要する。このギャップを埋めるために、私たちは、LLMのプロキシとしてテキストやグラフとして事実知識を効果的にエンコードする事前学習された埋め込みモデルを利用して、$\textbf{P}$roxy $\textbf{E}$mbeddings to $\textbf{E}$stimate $\textbf{K}$nowledge of LLMsを提案する。まず、様々な探索手法を用いてLLMで知られている事実のトレーニングセットを特定し、次に埋め込みモデルを適用して線形デコーダ層でLLM出力を予測する。ウィキペディア由来のデータセット3ドル、LLM4ドル、埋め込みモデル7ドルの総合的な評価は、埋め込みによって最大90%の精度でLLMの知識を予測できることを示している。さらに, 文章埋め込みモデルは, LLMの知識を予測するためにグラフ埋め込みよりも適しており, 事実景観の基本的な表現に光を当てている。したがって,LLMの内部帰納バイアスについて,知識適応型埋め込みを用いてLLM内の知識ギャップを大規模に識別し,より深い洞察を与えることができると考えている。コードとデータはhttps://github.com/claws-lab/peek.comで公開されている。

論文の概要: Efficient Knowledge Probing of Large Language Models by Adapting Pre-trained Embeddings

関連論文リスト