Fugu-MT 論文翻訳(概要): Readme_AI: Dynamic Context Construction for Large Language Models

論文の概要: Readme_AI: Dynamic Context Construction for Large Language Models

arxiv url: http://arxiv.org/abs/2509.19322v1
Date: Fri, 12 Sep 2025 20:34:58 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-28 15:30:14.415895
Title: Readme_AI: Dynamic Context Construction for Large Language Models
Title（参考訳）: Readme_AI: 大規模言語モデルのための動的コンテキスト構築
Authors: Millie Vyas, Timothy Blattner, Alden Dima,
Abstract要約: データソースのコンテキストを動的に構築できる仕様を提案する。データソースオーナは、データセット関連のクエリを推論する際に使用するLCMのメタデータを含むファイルを生成する。データソースからメタデータを取得して,コンテキスト構築に使用するReadme_AI Model Context Protocolサーバのプロトタイプを作成します。
参考スコア（独自算出の注目度）: 0.4726094039607201
License: http://creativecommons.org/publicdomain/zero/1.0/
Abstract: Despite being trained on significant amounts of data, Large Language Models (LLMs) can provide inaccurate or unreliable information in the context of a user's specific query. Given query-specific context significantly improves the usefulness of its responses. In this paper, we present a specification that can be used to dynamically build context for data sources. The data source owner creates the file containing metadata for LLMs to use when reasoning about dataset-related queries. To demonstrate our proposed specification, we created a prototype Readme_AI Model Context Protocol (MCP) server that retrieves the metadata from the data source and uses it to dynamically build context. Some features that make this specification dynamic are the extensible types that represent crawling web-pages, fetching data from data repositories, downloading and parsing publications, and general text. The context is formatted and grouped using user-specified tags that provide clear contextual information for the LLM to reason about the content. We demonstrate the capabilities of this early prototype by asking the LLM about the NIST-developed Hedgehog library, for which common LLMs often provides inaccurate and irrelevant responses containing hallucinations. With Readme_AI, the LLM receives enough context that it is now able to reason about the library and its use, and even generate code interpolated from examples that were included in the Readme_AI file provided by Hedgehog's developer. Our primary contribution is a extensible protocol for dynamically grounding LLMs in specialized, owner-provided data, enhancing responses from LLMs and reducing hallucinations. The source code for the Readme_AI tool is posted here: https://github.com/usnistgov/readme_ai .
Abstract（参考訳）: 大きな言語モデル(LLM)は、大量のデータに基づいてトレーニングされているにもかかわらず、ユーザの特定のクエリのコンテキストにおいて、不正確な情報や信頼性の低い情報を提供することができる。クエリ固有のコンテキストが与えられた場合、応答の有用性が大幅に向上する。本稿では,データソースのコンテキストを動的に構築するための仕様を提案する。データソースオーナは、データセット関連のクエリを推論する際に使用するLCMのメタデータを含むファイルを生成する。提案した仕様を実証するために,データソースからメタデータを取得して動的にコンテキストを構築するための,Readme_AI Model Context Protocol (MCP) サーバのプロトタイプを作成しました。この仕様を動的にする機能としては、クローリングWebページを表す拡張可能な型、データリポジトリからのデータのフェッチ、出版物のダウンロードと解析、一般的なテキストなどがある。コンテキストは、LLMがコンテンツを推論するための明確なコンテキスト情報を提供するユーザ指定タグを使用してフォーマットされ、グループ化される。我々は,NIST が開発した Hedgehog ライブラリについて LLM に質問することで,この初期プロトタイプの能力を実証する。 Readme_AIでは、LLMはライブラリとその使用について十分なコンテキストを受け取り、Hedgehogの開発者が提供するReadme_AIファイルに含まれるサンプルから解釈されたコードを生成することができる。我々の主な貢献は、特殊な所有者が提供するデータにLSMを動的に接地し、LSMからの応答を高め、幻覚を減らすための拡張可能なプロトコルである。 Readme_AIツールのソースコードは以下の通りである。

論文の概要: Readme_AI: Dynamic Context Construction for Large Language Models

関連論文リスト