Fugu-MT 論文翻訳(概要): Detecting and Understanding Real-World Differential Performance Bugs in Machine Learning Libraries

論文の概要: Detecting and Understanding Real-World Differential Performance Bugs in Machine Learning Libraries

arxiv url: http://arxiv.org/abs/2006.01991v1
Date: Wed, 3 Jun 2020 00:23:06 GMT
ステータス: 翻訳完了
システム内更新日: 2022-11-25 18:49:48.443052
Title: Detecting and Understanding Real-World Differential Performance Bugs in Machine Learning Libraries
Title（参考訳）: 機械学習ライブラリにおける実世界の差分パフォーマンスバグの検出と理解
Authors: Saeid Tizpaz-Niari and Pavol Cern\'y and Ashutosh Trivedi
Abstract要約: 同じサイズであるにもかかわらず、パフォーマンスが広く変化するインプットを見つけます。我々は,単一入力だけでなく,各クラスがそれぞれのサイズによってパラメータ化される類似入力を持つ入力のクラスの性能を比較する。重要なことに、パフォーマンスバグの修正に簡単に使用できる形式で、なぜパフォーマンスが異なるのか、という説明も提供します。
参考スコア（独自算出の注目度）: 2.879036956042183
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Programming errors that degrade the performance of systems are widespread, yet there is little tool support for analyzing these bugs. We present a method based on differential performance analysis---we find inputs for which the performance varies widely, despite having the same size. To ensure that the differences in the performance are robust (i.e. hold also for large inputs), we compare the performance of not only single inputs, but of classes of inputs, where each class has similar inputs parameterized by their size. Thus, each class is represented by a performance function from the input size to performance. Importantly, we also provide an explanation for why the performance differs in a form that can be readily used to fix a performance bug. The two main phases in our method are discovery with fuzzing and explanation with decision tree classifiers, each of which is supported by clustering. First, we propose an evolutionary fuzzing algorithm to generate inputs. For this fuzzing task, the unique challenge is that we not only need the input class with the worst performance, but rather a set of classes exhibiting differential performance. We use clustering to merge similar input classes which significantly improves the efficiency of our fuzzer. Second, we explain the differential performance in terms of program inputs and internals. We adapt discriminant learning approaches with clustering and decision trees to localize suspicious code regions. We applied our techniques to a set of applications. On a set of micro-benchmarks, we show that our approach outperforms state-of-the-art fuzzers in finding inputs to characterize the differential performance. On a set of case-studies, we discover and explain multiple performance bugs in popular machine learning frameworks. Four of these bugs, reported first in this paper, have since been fixed by the developers.
Abstract（参考訳）: システムの性能を低下させるエラーのプログラミングは広く行われているが、これらのバグを分析するツールがほとんどない。そこで本研究では,差分性能解析に基づく手法を提案する。性能の違いが頑健であることを保証するため(すなわち、大きな入力に対しても保持する)、各クラスがそれぞれのサイズでパラメータ化された類似の入力を持つ入力クラスのパフォーマンスを比較する。これにより、各クラスは、入力サイズからパフォーマンスまでのパフォーマンス関数で表される。重要なことに、パフォーマンスバグの修正に簡単に使用できる形式で、なぜパフォーマンスが異なるのか、という説明も提供します。本手法の主な2つのフェーズはファジィによる発見と決定木分類器による説明であり,それぞれがクラスタリングによってサポートされている。まず,入力を生成する進化的ファジングアルゴリズムを提案する。この面倒なタスクにとって、ユニークな課題は、最悪のパフォーマンスを持つ入力クラスが必要なだけでなく、異なるパフォーマンスを示すクラスセットが必要であることです。我々はクラスタリングを用いて類似の入力クラスをマージし、ファザの効率を大幅に向上させる。第二に、プログラム入力と内部の差分性能について説明する。我々は、疑わしいコード領域をローカライズするために、クラスタリングと決定木を用いて識別学習アプローチを適用する。私たちはそのテクニックを一連のアプリケーションに適用した。マイクロベンチマークのセットでは,我々のアプローチが,差分性能を特徴付ける入力を見つける際に最先端のファザーよりも優れていることを示す。ケーススタディのセットでは、人気のある機械学習フレームワークで複数のパフォーマンスバグを発見し、説明します。この論文で最初に報告された4つのバグは、その後開発者によって修正された。

論文の概要: Detecting and Understanding Real-World Differential Performance Bugs in Machine Learning Libraries

関連論文リスト