Fugu-MT 論文翻訳(概要): On the Illusion of Success: An Empirical Study of Build Reruns and Silent Failures in Industrial CI

論文の概要: On the Illusion of Success: An Empirical Study of Build Reruns and Silent Failures in Industrial CI

arxiv url: http://arxiv.org/abs/2509.14347v1
Date: Wed, 17 Sep 2025 18:26:29 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-19 17:26:52.941579
Title: On the Illusion of Success: An Empirical Study of Build Reruns and Silent Failures in Industrial CI
Title（参考訳）: 成功の幻想--産業CIにおけるビルドリランと無実の失敗の実証的研究
Authors: Henri Aïdasso, Francis Bordeleau, Ali Tizghadam,
Abstract要約: 本報告では, サイレント障害の初体験的研究について, 事業再開の実践を通して紹介する。 81の工業プロジェクトにおける142,387の雇用の分析によると、成功した雇用の11%が再雇用され、その35%が24時間以上経過した後に行われる。成功したジョブの再実行に関連する主な要因は、テストと静的解析タスク、Shellのようなスクリプト言語、そして開発者が再実行する傾向である。
参考スコア（独自算出の注目度）: 1.2744523252873348
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Reliability of build outcomes is a cornerstone of effective Continuous Integration (CI). Yet in practice, developers often struggle with non-deterministic issues in the code or CI infrastructure, which undermine trust in build results. When faced with such unexpected outcomes, developers often repeatedly rerun jobs hoping for true success, but this practice is known to increase CI costs and reduce productivity. While recent studies have focused on intermittent job failures, no prior work has investigated silent failures, where build jobs are marked as successful but fail to complete all or part of their tasks. Such silent failures often go unnoticed, creating an illusion of success with detrimental consequences such as bugs escaping into production. This paper presents the first empirical study of silent failures through the practice of rerunning successful jobs. An analysis of 142,387 jobs across 81 industrial projects shows that 11% of successful jobs are rerun, with 35% of these reruns occurring after more than 24 hours. Using mixed-effects models on 32 independent variables (AUC of 85%), we identified key factors associated with reruns of successful jobs, notably testing and static analysis tasks, scripting languages like Shell, and developers prior rerun tendencies. A further analysis of 92 public issues revealed 11 categories of silent failures aligning with these factors, the most frequent being artifact operation errors, caching errors, and ignored exit codes. Overall, our findings provide valuable insights into the circumstances and causes of silent failures to raise awareness among teams, and present solutions to improve CI reliability.
Abstract（参考訳）: ビルド結果の信頼性は、効果的な継続的インテグレーション(CI)の基礎となります。しかし実際には、開発者はコードやCIインフラストラクチャの非決定論的問題に苦しむことが多く、ビルド結果への信頼を損なう。このような予期せぬ結果に直面した場合、開発者は真の成功を期待して繰り返しジョブを再実行しますが、このプラクティスはCIコストを増やし、生産性を低下させることで知られています。最近の研究では、断続的なジョブの失敗に焦点が当てられているが、前回の作業では、ビルドジョブが成功したとマークされているが、そのタスクのすべてまたは一部を完了できないサイレントな失敗を調査していない。このような静かな失敗は、しばしば気付かれず、生産から逃れるバグのような有害な結果によって成功の錯覚を生み出す。本報告では, サイレント障害の初体験的研究について, 事業再開の実践を通して紹介する。 81の工業プロジェクトにおける142,387の雇用の分析によると、成功した雇用の11%が再雇用され、その35%が24時間以上経過した後に行われる。 32個の独立変数(AUCの85%)で混合効果モデルを使用することで、特にテストや静的解析タスク、Shellのようなスクリプト言語、そして開発者が再実行する傾向など、ジョブの再実行に関連する重要な要素を特定しました。 92の公開問題のさらなる分析では、これらの要因に沿った11のサイレント障害が明らかになった。全体として、私たちの調査結果は、チーム間の認識を高めるためのサイレント障害の状況と原因に関する貴重な洞察を与え、CI信頼性を改善するためのソリューションを提示します。

論文の概要: On the Illusion of Success: An Empirical Study of Build Reruns and Silent Failures in Industrial CI

関連論文リスト