Fugu-MT 論文翻訳(概要): ProbeLogits: Kernel-Level LLM Inference Primitives for AI-Native Operating Systems

論文の概要: ProbeLogits: Kernel-Level LLM Inference Primitives for AI-Native Operating Systems

arxiv url: http://arxiv.org/abs/2604.11943v1
Date: Mon, 13 Apr 2026 18:32:02 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-15 19:11:32.060425
Title: ProbeLogits: Kernel-Level LLM Inference Primitives for AI-Native Operating Systems
Title（参考訳）: ProbeLogits: AIネイティブオペレーティングシステムのためのカーネルレベルLLM推論プリミティブ
Authors: Daeyeon Son,
Abstract要約: LLM推論を内部で実行するOSカーネルは、任意のテキストが生成される前にロジット分布を読み取ることができる。 ProbeLogitsはカーネルレベルのオペレーションで、単一のフォワードパスを実行し、特定のトークンログを読み、エージェントアクションを安全か危険であると分類します。私はAnima OSでProbeLogitsを実装しています。これは、80,400行のRustで書かれたベアメタルのx86_64 OSです。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: An OS kernel that runs LLM inference internally can read logit distributions before any text is generated -- and act on them as a governance primitive. I present ProbeLogits, a kernel-level operation that performs a single forward pass and reads specific token logits to classify agent actions as safe or dangerous, with zero learned parameters. On a 260-prompt OS action benchmark (9 categories including adversarial attacks), ProbeLogits achieves F1=0.980, Precision=1.000, and Recall=0.960 using a general-purpose 7B model at 4-bit quantization. On ToxicChat (1,000 human-annotated real conversations), it achieves F1=0.790 at default calibration strength $α$=1.0, improving to F1=0.837 at $α$=0.5 -- 89% of Llama Guard 3's F1~0.939 with zero learned parameters. A key design contribution is the calibration strength $α$, which serves as a deployment-time policy knob rather than a learned hyperparameter. By adjusting $α$, the OS can enforce strict policies for privileged operations ($α\geq 0.8$, maximizing recall) or relaxed policies for conversational agents ($α$=0.5, maximizing precision). Contextual calibration improves accuracy from 64.8% to 97.3% on the custom benchmark. I implement ProbeLogits within Anima OS, a bare-metal x86_64 OS written in 80,400 lines of Rust. Because agent actions must pass through kernel-mediated host functions, ProbeLogits enforcement operates below the WASM sandbox boundary, making it significantly harder to circumvent than application-layer classifiers. Each classification costs 65ms on 7B -- fast enough for per-action governance. I also show that treating KV cache as process state enables checkpoint, restore, and fork operations analogous to traditional process management. To my knowledge, no prior system exposes LLM logit vectors as OS-level governance primitives.
Abstract（参考訳）: LLM推論を内部で実行するOSカーネルは、テキストが生成される前にロジット分布を読み取ることができ、ガバナンスプリミティブとして機能する。 ProbeLogitsはカーネルレベルのオペレーションで、単一のフォワードパスを実行し、特定のトークンログを読み、エージェントアクションを安全または危険と分類し、学習パラメータをゼロにする。 260プロンプトのOSアクションベンチマーク(敵攻撃を含む9つのカテゴリ)では、ProbeLogitsはF1=0.980、Precision=1.000、Recall=0.960を4ビット量子化時に汎用7Bモデルを用いて達成している。 ToxicChat (1,000人の人間による実際の会話)では、デフォルトキャリブレーション強度$α$=1.0でF1=0.790を達成し、学習パラメータがゼロのラマガード3のF1~0.939のF1=0.837を$α$=0.5でF1=0.837に改善した。主要な設計コントリビューションはキャリブレーション強度$α$であり、これは学習されたハイパーパラメータではなく、デプロイ時のポリシーノブとして機能する。 α$を調整することで、OSは特権操作の厳格なポリシー(α\geq 0.8$、リコールの最大化)や会話エージェントの緩和ポリシー(α$=0.5、精度の最大化)を強制することができる。コンテキストキャリブレーションにより、カスタムベンチマークでは64.8%から97.3%に精度が向上する。私はAnima OSでProbeLogitsを実装しています。これは、80,400行のRustで書かれたベアメタルのx86_64 OSです。エージェントアクションはカーネルを介するホスト関数を経由しなければならないため、ProbeLogitsの実行はWASMサンドボックス境界の下で動作し、アプリケーション層分類器よりも回避が極めて難しい。それぞれの分類は7Bで65ミリ秒かかります。また、KVキャッシュをプロセス状態として扱うことで、従来のプロセス管理と同様のチェックポイント、リストア、フォーク操作が可能になります。私の知る限り、LLMロジットベクトルをOSレベルのガバナンスプリミティブとして公開する以前のシステムはありません。

論文の概要: ProbeLogits: Kernel-Level LLM Inference Primitives for AI-Native Operating Systems

関連論文リスト