Fugu-MT 論文翻訳(概要): Revisiting Code Debloating with Ground Truth-based Evaluation

論文の概要: Revisiting Code Debloating with Ground Truth-based Evaluation

arxiv url: http://arxiv.org/abs/2604.17717v2
Date: Tue, 21 Apr 2026 17:43:19 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-22 14:04:47.926415
Title: Revisiting Code Debloating with Ground Truth-based Evaluation
Title（参考訳）: 地中真理に基づく評価によるコードのデブロ化の再検討
Authors: Muhammad Bilal, Moiz Ali, Mohit Kumar, Fareed Zaffar, Fahad Shaon, Ashish Gehani, Sazzadur Rahaman,
Abstract要約: プログラムデブロは、パフォーマンスオーバーヘッド、アタックサーフェス、メンテナンスコストを削減するために、未使用のコードを削除することを目的としている。その中心的な役割にもかかわらず、アプリケーションレベルのデブロは、パフォーマンスを測定するために不完全なプロキシに依存し続けている。我々は,地道な評価パラダイムを通じて,アプリケーションレベルのデブロ化の基礎を再考する。
参考スコア（独自算出の注目度）: 5.955975465516521
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Program debloating aims to remove unused code to reduce performance overhead, attack surfaces, and maintenance costs. Over time, debloating has evolved across multiple layers (container, library, and application), each building on the principles of application-level debloating. Despite its central role, application-level debloating continues to rely on imperfect proxies for measuring performance, such as test-case-driven evaluation for correctness, code size for runtime efficiency, and gadget count reduction for estimating security posture. While there is widespread skepticism about using such imperfect proxies, the community still lacks standardized methodologies or benchmarks to assess the true performance of application-level software debloating. This experience paper aims to address the gap. We revisit the foundations of application-level debloating through a ground-truth-based evaluation paradigm. Our analysis of eight state-of-the-art debloaters - Blade, Chisel, Cov, CovA, Lmcas, Trimmer, Occam, and Razor - uncovers insights previously unattainable through traditional evaluations. These tools collectively span the spectrum of source-to-source, IR-to-IR, and binary-to-binary transformation paradigms, characterizing a holistic reassessment across abstraction levels. Our analysis reveals that while dynamic analysis-based tools often remove up to 94% of code that should be retained, static analysis-based approaches exhibit the opposite behavior, showing high false retention rates due to coarse-grained dependency over-approximation. Additionally, static analyses may add code by introducing specialized variants of functions. False retentions and removals not only cause functional incorrectness but may also lead to systematic inconsistency, robustness failures, and exploitable vulnerabilities.
Abstract（参考訳）: プログラムデブロは、パフォーマンスオーバーヘッド、アタックサーフェス、メンテナンスコストを削減するために、未使用のコードを削除することを目的としている。時間の経過とともに、デブロは複数のレイヤ(コンテナ、ライブラリ、アプリケーション)にわたって進化し、それぞれがアプリケーションレベルのデブロの原則に基づいて構築されている。その中心的な役割にもかかわらず、アプリケーションレベルのデ肥大化は、テストケース駆動による正確性の評価、実行時のコードサイズ、セキュリティ姿勢を推定するためのガジェット数削減など、パフォーマンス測定のための不完全なプロキシに依存し続けている。このような不完全なプロキシの使用には懐疑論が広まっていますが、アプリケーションレベルのソフトウェアデ肥大化の真のパフォーマンスを評価するための標準化された方法論やベンチマークはいまだに欠如しています。この経験論文はそのギャップに対処することを目的としている。我々は,地道な評価パラダイムを通じて,アプリケーションレベルのデブロ化の基礎を再考する。私たちの分析では、Blade、Chisel、Cov、CovA、Lmcas、Trimmer、Occam、Razorの8つの最先端のデブロアが、従来の評価では達成不可能な洞察を明らかにしています。これらのツールは、ソース・ツー・ソース、IR-to-IR、バイナリ・ツー・バイナリ・トランスフォーメーションのパラダイムのスペクトルを網羅し、抽象レベルでの全体的再評価を特徴付ける。我々の分析では、動的解析ベースのツールは保持すべきコードの最大94%を除去することが多いが、静的解析ベースのアプローチでは逆の振る舞いを示し、粗い依存性の過剰な近似による偽保持率が高い。さらに静的解析は、関数の特別な変種を導入することで、コードを追加することができる。不正な保持と削除は機能的不正を引き起こすだけでなく、体系的不整合、堅牢性障害、悪用可能な脆弱性を引き起こす可能性がある。

論文の概要: Revisiting Code Debloating with Ground Truth-based Evaluation

関連論文リスト