Fugu-MT 論文翻訳(概要): Constraint-Guided Multi-Agent Decompilation for Executable Binary Recovery

論文の概要: Constraint-Guided Multi-Agent Decompilation for Executable Binary Recovery

arxiv url: http://arxiv.org/abs/2604.23940v2
Date: Fri, 01 May 2026 21:28:31 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-05 20:33:49.420049
Title: Constraint-Guided Multi-Agent Decompilation for Executable Binary Recovery
Title（参考訳）: 実行可能バイナリリカバリのための制約型マルチエージェントデコンパイル
Authors: Yifan Zhang, Xiaohan Wang, Yueke Zhang, Yu Huang, Kevin Leach,
Abstract要約: 脱コンパイルは、セキュリティ分析、マルウェアのリバースエンジニアリング、レガシーソフトウェアメンテナンスに不可欠である。マルチレベル制約誘導型デコンパイル(MCGD)により,デコンパイルされたコードを再実行可能なソースに変換するマルチエージェントフレームワークを提案する。本フレームワークは,84～97%の再実行性を実現し,28～89ポイントのベースラインデコンパイラ出力を改善した。
参考スコア（独自算出の注目度）: 47.704311990064554
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Decompilation -- recovering source code from compiled binaries -- is essential for security analysis, malware reverse engineering, and legacy software maintenance. However, existing decompilers produce code that often fails to compile or execute correctly, limiting their practical utility. We present a multi-agent framework that transforms decompiled code into re-executable source through Multi-level Constraint-Guided Decompilation (MCGD). Our approach employs a hierarchical validation pipeline with three constraint levels: (1) syntactic correctness via parsing, (2) compilability via GCC, and (3) behavioral equivalence via LLM-generated test cases. When validation fails, specialized LLM agents iteratively refine the code using structured error feedback. We evaluate our framework on 1,641 real-world binaries from ExeBench across three decompilers (RetDec, Ghidra, and Angr). Our framework achieves 84-97% re-executability, improving baseline decompiler output by 28-89 percentage points. In comparison with state-of-the-art LLM-based decompilation methods using the same GPT-4o backbone, our approach (84.1%) outperforms LLM4Decompile (80.3%), SK2Decompile (73.9%), and SALT4Decompile (61.8%). Our ablation study reveals that execution-based validation is critical: compile-only approaches achieve 0% behavioral correctness despite 91-99% compilation rates. The system converges efficiently, with 90%+ binaries reaching correctness within 2 iterations at an average cost of $0.03-0.05 per binary. Our results demonstrate that constraint-guided agentic refinement can bridge the gap between raw decompiler output and practically useful source code.
Abstract（参考訳）: セキュリティ分析、マルウェアリバースエンジニアリング、レガシーソフトウェアメンテナンスには、逆コンパイル -- コンパイルされたバイナリからソースコードを復元する -- が不可欠である。しかし、既存の逆コンパイラは、しばしば正しくコンパイルや実行に失敗するコードを生成し、実用性を制限する。本稿では,マルチレベル制約誘導型デコンパイル(MCGD)を用いて,デコンパイルされたコードを再実行可能なソースに変換するマルチエージェントフレームワークを提案する。本手法では,(1)解析による構文的正当性,(2)GCCによるコンパイル性,(3)LCM生成テストケースによる行動等価性という3つの制約レベルを持つ階層的検証パイプラインを用いる。検証に失敗すると、特殊なLLMエージェントが構造化エラーフィードバックを使用してコードを反復的に洗練する。我々はExeBenchから3つの逆コンパイラ(RetDec,Ghidra,Angr)にまたがる1,641個の実世界のバイナリのフレームワークを評価した。本フレームワークは,84～97%の再実行性を実現し,28～89ポイントのベースラインデコンパイラ出力を改善した。同じGPT-4oバックボーンを用いた最新のLCMベースの逆コンパイル法と比較して、我々のアプローチ(84.1%)はLSM4Decompile (80.3%)、SK2Decompile (73.9%)、SALT4Decompile (61.8%)より優れている。 91-99%のコンパイル率にもかかわらず、コンパイルのみのアプローチは0%の動作正当性を達成します。システムは効率よく収束し、90%以上のバイナリが2回に1バイナリあたり0.03-0.05の平均的なコストで2回以内の正確性に達する。この結果から,制約誘導型エージェント改良により,生の逆コンパイラ出力と実用上有用なソースコードとのギャップを埋めることが可能であることが示唆された。

論文の概要: Constraint-Guided Multi-Agent Decompilation for Executable Binary Recovery

関連論文リスト