Fugu-MT 論文翻訳(概要): LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface

論文の概要: LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface

arxiv url: http://arxiv.org/abs/2603.22519v2
Date: Mon, 30 Mar 2026 20:39:09 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-01 15:25:02.074932
Title: LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface
Title（参考訳）: LLMON: LLMインターフェースで構造とセマンティクスを活用するLLMネイティブなマークアップ言語
Authors: Michael Hind, Basel Shbita, Bo Wu, Farhan Ahmed, Chad DeLuca, Nathan Fulton, David Cox, Dan Gutfreund,
Abstract要約: LLM ネイティブなマークアップ言語 LLMON を導入し,テキストの構造と意味的メタデータを LLM に自然な方法で伝達できるようにする。この情報はモデルトレーニング、モデルプロンプト、推論実装で使用することができ、モデル精度、安全性、セキュリティが改善される。
参考スコア（独自算出の注目度）: 6.499636351596393
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Textual Large Language Models (LLMs) provide a simple and familiar interface: a string of text is used for both input and output. However, the information conveyed to an LLM often has a richer structure and semantics, which is not conveyed in a string. For example, most prompts contain both instructions ("Summarize this paper into a paragraph") and data (the paper to summarize), but these are usually not distinguished when passed to the model. This can lead to model confusion and security risks, such as prompt injection attacks. This work addresses this shortcoming by introducing an LLM-native mark-up language, LLMON (LLM Object Notation, pronounced "Lemon"), that enables the structure and semantic metadata of the text to be communicated in a natural way to an LLM. This information can then be used during model training, model prompting, and inference implementation, leading to improvements in model accuracy, safety, and security. This is analogous to how programming language types can be used for many purposes, such as static checking, code generation, dynamic checking, and IDE highlighting. We discuss the general design requirements of an LLM-native markup language, introduce the LLMON markup language and show how it meets these design requirements, describe how the information contained in a LLMON artifact can benefit model training and inference implementation, and provide some preliminary empirical evidence of its value for both of these use cases. We also discuss broader issues and research opportunities that are enabled with an LLM-native approach.
Abstract（参考訳）: Textual Large Language Models (LLMs) はシンプルで親しみやすいインタフェースを提供する:入力と出力の両方にテキストの文字列を使用する。しかし、LLMに伝達される情報は、しばしばよりリッチな構造と意味を持ち、文字列で伝達されない。例えば、ほとんどのプロンプトには、この論文を段落に要約する)とデータ(要約する)の両方が含まれているが、モデルに渡すと区別されないことが多い。これは、インジェクションインジェクション攻撃などのモデル混乱とセキュリティリスクにつながる可能性がある。この研究は、LLMネイティブなマークアップ言語LLMON(LLM Object Notation、略称「レモン」)を導入することで、この欠点に対処する。この情報はモデルトレーニング、モデルプロンプト、推論実装で使用することができ、モデル精度、安全性、セキュリティが改善される。これは静的チェック、コード生成、動的チェック、IDEハイライトなど、多くの目的でプログラミング言語の型が利用できる方法に類似している。 LLMネイティブなマークアップ言語の一般的な設計要件について議論し、LLMONマークアップ言語を導入し、それらの設計要件を満たす方法を示し、LLMONアーティファクトに含まれる情報がモデルトレーニングと推論の実装にどのように役立つかを説明し、これらのユースケースの双方に対してその価値の予備的な実証的証拠を提供する。また、LLMネイティブアプローチによって実現される幅広い問題や研究機会についても論じる。

論文の概要: LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface

関連論文リスト