Fugu-MT 論文翻訳(概要): An Automated Framework for Input Alphabet Construction in Stateful Protocol Implementation Learning

論文の概要: An Automated Framework for Input Alphabet Construction in Stateful Protocol Implementation Learning

arxiv url: http://arxiv.org/abs/2606.23464v1
Date: Mon, 22 Jun 2026 15:14:12 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-24 18:53:55.204906
Title: An Automated Framework for Input Alphabet Construction in Stateful Protocol Implementation Learning
Title（参考訳）: ステートフルプロトコル実装学習における入力Alphabet構築のための自動フレームワーク
Authors: JiongHan Wang, WenChao Huang,
Abstract要約: ステートフルなプロトコル実装のための一般的な分析技術であるステート機械学習は、手書き入力アルファベットから派生したコアボトルネックに悩まされている。我々は、プロトコルメッセージレイアウトを解析し、構造化突然変異規則に従って候補入力シンボルを生成するために、大きな言語モデルを採用する。連続的なアルファベットの増大に伴うオーバヘッドの増大を考慮して,新たなアルファベットエントリを組み込む際に,既存の学習オートマトンを再利用するためのミニバッチインクリメンタル学習戦略を導入する。
参考スコア（独自算出の注目度）: 0.8536845899508164
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: As a prevalent analytical technique for stateful protocol implementations, state machine learning suffers from a core bottleneck stemming from handcrafted input alphabets. Manual alphabet definition inherently limits the completeness of input exploration, making it difficult to capture anomalous non-conformant messages and consequently missing latent semantic defects. In this paper, we target automatic input alphabet generation to break the above limitation for state machine learning. We adopt large language models to parse protocol message layouts and produce candidate input symbols following structured mutation rules, which automatically covers valid and invalid message spaces and eliminates reliance on manual protocol expertise. Considering the rising overhead brought by continuously growing alphabets, we introduce a mini-batch incremental learning strategy to reuse existing learned automata when incorporating new alphabet entries. Comprehensive experiments on practical protocol stacks indicate our approach can reproduce existing security vulnerabilities and identify novel semantic bugs. A subset of these newly discovered issues has been confirmed and patched by developers, proving the practicability and effectiveness of our proposed method.
Abstract（参考訳）: ステートフルなプロトコル実装の一般的な分析手法として、ステート機械学習は手作りの入力アルファベットに由来するコアボトルネックに悩まされている。手動アルファベットの定義は本質的に入力探索の完全性を制限し、異常な非整合メッセージのキャプチャが困難になり、結果として潜在意味の欠陥が欠落する。本稿では、状態機械学習における上記の制限を破るために、自動入力アルファベット生成を目標とする。我々は、プロトコルのレイアウトを解析し、構造化された変更規則に従って候補入力シンボルを生成するために、大きな言語モデルを採用し、有効かつ無効なメッセージ空間を自動でカバーし、手動のプロトコルの専門知識に依存しない。連続的なアルファベットの増大に伴うオーバヘッドの増大を考慮すると、新しいアルファベットエントリを組み込む際に既存の学習オートマトンを再利用するためのミニバッチインクリメンタル学習戦略を導入する。実用的なプロトコルスタックに関する包括的な実験は、我々のアプローチが既存のセキュリティ脆弱性を再現し、新しいセマンティックなバグを識別できることを示している。新たに発見された問題のサブセットが開発者によって確認され,提案手法の有効性と有効性が確認された。

論文の概要: An Automated Framework for Input Alphabet Construction in Stateful Protocol Implementation Learning

関連論文リスト