Fugu-MT 論文翻訳(概要): Temporal Modeling of Change History for Black-Box Test Suite Minimization

論文の概要: Temporal Modeling of Change History for Black-Box Test Suite Minimization

arxiv url: http://arxiv.org/abs/2605.25441v1
Date: Mon, 25 May 2026 05:37:55 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-26 19:50:19.329148
Title: Temporal Modeling of Change History for Black-Box Test Suite Minimization
Title（参考訳）: ブラックボックステストスイート最小化のための変更履歴の時間モデル
Authors: Kamruzzaman Asif, Md. Siam, Kazi Sakib,
Abstract要約: テストスイート最小化(TSM)は、テストスイートのサイズを削減し、障害検出機能を保持する。最近の変更履歴は、ブラックボックスTSMを導くための軽量でスケーラブルな指標として研究されている。ブラックボックスTSMに時間的モデリングを導入し、時間的リスク駆動テストスイート最小化(TRTM)を提案する。
参考スコア（独自算出の注目度）: 0.5798758080057375
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Test Suite Minimization (TSM) reduces the size of test suites while preserving their fault detection capability. In black-box TSM, reduction is performed without relying on production-code instrumentation. While several black-box TSM approaches have explored metrics like test logs or test similarity, these often suffer from scalability and efficiency issues. Recently, change history has been explored as a lightweight and scalable indicator for guiding black-box TSM. However, existing approaches treat historical modifications uniformly, ignoring the temporal dynamics of software evolution where recently modified code tends to be more fault-prone. To address this limitation, we introduce temporal modeling into black-box TSM and propose Temporal Risk-driven Test Suite Minimization (TRTM). TRTM extracts modification history from version-control metadata and applies exponential temporal attenuation to weight changes based on recency, producing time-weighted class-level risk scores that reflect fault-proneness. Next, it determines dependencies between test cases and production classes by constructing static call graphs derived solely from test code, preserving the black-box setting. The risk scores of the classes exercised by each test case are then aggregated using statistical measures such as Average and Geometric Mean to compute a risk score for the test case. Finally, test cases with the highest risk scores are selected to construct the reduced suite. Evaluation on a large dataset containing 14 projects with 631 versions shows that TRTM consistently outperforms the state-of-the-art baseline, achieving a mean Accuracy of 0.72 (vs. 0.66) and Fault Detection Rate (FDR) of 0.75 (vs. 0.69), while also reducing execution time.
Abstract（参考訳）: テストスイート最小化(TSM)は、テストスイートのサイズを削減し、障害検出機能を保持する。ブラックボックスTSMでは、生産コードインスツルメンテーションに頼ることなく削減を行う。いくつかのブラックボックスのTSMアプローチでは、テストログやテストの類似性といったメトリクスを探索しているが、スケーラビリティと効率性の問題があることが多い。近年、変更履歴はブラックボックスTSMを導くための軽量でスケーラブルな指標として研究されている。しかし、既存のアプローチは、ソフトウェア進化の時間的ダイナミクスを無視して、歴史的修正を均一に扱う。この制限に対処するために、ブラックボックスTSMに時間モデルを導入し、時間的リスク駆動テストスイート最小化(TRTM)を提案する。 TRTMは、バージョン管理メタデータから変更履歴を抽出し、リフレレンスに基づく重み変化に指数的時間減衰を適用し、フォールト・プレネネスを反映した時間重み付きクラスレベルのリスクスコアを生成する。次に、テストコードから派生した静的なコールグラフを構築し、ブラックボックスの設定を保存することで、テストケースと本番クラス間の依存関係を決定する。各テストケースで実行されたクラスのリスクスコアは、平均値や幾何平均などの統計値を用いて集計され、テストケースのリスクスコアが計算される。最後に、リスクスコアが最も高いテストケースを選択して、削減スイートを構築する。 631バージョンを含む14のプロジェクトを含む大規模なデータセットの評価によると、TRTMは最先端のベースラインを一貫して上回り、平均精度は0.72(vs. 0.66)、故障検出率(FDR)は0.75(vs. 0.69)であり、実行時間を短縮している。

論文の概要: Temporal Modeling of Change History for Black-Box Test Suite Minimization

関連論文リスト