Defect Prediction Using Stylistic Metrics
- URL: http://arxiv.org/abs/2206.10959v2
- Date: Thu, 23 Jun 2022 11:49:31 GMT
- Title: Defect Prediction Using Stylistic Metrics
- Authors: Rafed Muhammad Yasir, Moumita Asad, Dr. Ahmedul Kabir
- Abstract summary: This paper aims at analyzing the impact of stylistic metrics on both within-project and crossproject defect prediction.
Experiment is conducted on 14 releases of 5 popular, open source projects.
- Score: 2.286041284499166
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Defect prediction is one of the most popular research topics due to its
potential to minimize software quality assurance efforts. Existing approaches
have examined defect prediction from various perspectives such as complexity
and developer metrics. However, none of these consider programming style for
defect prediction. This paper aims at analyzing the impact of stylistic metrics
on both within-project and crossproject defect prediction. For prediction, 4
widely used machine learning algorithms namely Naive Bayes, Support Vector
Machine, Decision Tree and Logistic Regression are used. The experiment is
conducted on 14 releases of 5 popular, open source projects. F1, Precision and
Recall are inspected to evaluate the results. Results reveal that stylistic
metrics are a good predictor of defects.
Related papers
- Variance of ML-based software fault predictors: are we really improving
fault prediction? [0.3222802562733786]
We experimentally analyze the variance of a state-of-the-art fault prediction approach.
We observed a maximum variance of 10.10% in terms of the per-class accuracy metric.
arXiv Detail & Related papers (2023-10-26T09:31:32Z) - Explainable Software Defect Prediction from Cross Company Project
Metrics Using Machine Learning [5.829545587965401]
This study focuses on developing defect prediction models that apply various machine learning algorithms.
One notable issue in existing defect prediction studies is the lack of transparency in the developed models.
arXiv Detail & Related papers (2023-06-14T17:46:08Z) - Generalizability Analysis of Graph-based Trajectory Predictor with
Vectorized Representation [29.623692599892365]
Trajectory prediction is one of the essential tasks for autonomous vehicles.
Recent progress in machine learning gave birth to a series of advanced trajectory prediction algorithms.
arXiv Detail & Related papers (2022-08-06T20:19:52Z) - HEDP: A Method for Early Forecasting Software Defects based on Human
Error Mechanisms [1.0660480034605238]
The main process behind a software defect is that an error-prone scenario triggers human error modes.
The proposed idea emphasizes predicting the exact location and form of a possible defect.
arXiv Detail & Related papers (2021-10-13T14:44:23Z) - Learning to Predict Trustworthiness with Steep Slope Loss [69.40817968905495]
We study the problem of predicting trustworthiness on real-world large-scale datasets.
We observe that the trustworthiness predictors trained with prior-art loss functions are prone to view both correct predictions and incorrect predictions to be trustworthy.
We propose a novel steep slope loss to separate the features w.r.t. correct predictions from the ones w.r.t. incorrect predictions by two slide-like curves that oppose each other.
arXiv Detail & Related papers (2021-09-30T19:19:09Z) - Hessian-based toolbox for reliable and interpretable machine learning in
physics [58.720142291102135]
We present a toolbox for interpretability and reliability, extrapolation of the model architecture.
It provides a notion of the influence of the input data on the prediction at a given test point, an estimation of the uncertainty of the model predictions, and an agnostic score for the model predictions.
Our work opens the road to the systematic use of interpretability and reliability methods in ML applied to physics and, more generally, science.
arXiv Detail & Related papers (2021-08-04T16:32:59Z) - Moving from Cross-Project Defect Prediction to Heterogeneous Defect
Prediction: A Partial Replication Study [0.0]
Earlier studies often used machine learning techniques to build, validate, and improve bug prediction models.
Knowledge coming from those models will not be overlapping to a target project if no sufficient metrics have been collected in the source projects.
We systematically integrated Heterogeneous Defect Prediction (HDP) by replicating and validating the obtained results.
Our results shed light on the infeasibility of many cases for the HDP algorithm due to its sensitivity to the parameter selection.
arXiv Detail & Related papers (2021-03-05T06:29:45Z) - Towards More Fine-grained and Reliable NLP Performance Prediction [85.78131503006193]
We make two contributions to improving performance prediction for NLP tasks.
First, we examine performance predictors for holistic measures of accuracy like F1 or BLEU.
Second, we propose methods to understand the reliability of a performance prediction model from two angles: confidence intervals and calibration.
arXiv Detail & Related papers (2021-02-10T15:23:20Z) - Distribution-Free, Risk-Controlling Prediction Sets [112.9186453405701]
We show how to generate set-valued predictions from a black-box predictor that control the expected loss on future test points at a user-specified level.
Our approach provides explicit finite-sample guarantees for any dataset by using a holdout set to calibrate the size of the prediction sets.
arXiv Detail & Related papers (2021-01-07T18:59:33Z) - Counterfactual Predictions under Runtime Confounding [74.90756694584839]
We study the counterfactual prediction task in the setting where all relevant factors are captured in the historical data.
We propose a doubly-robust procedure for learning counterfactual prediction models in this setting.
arXiv Detail & Related papers (2020-06-30T15:49:05Z) - Ambiguity in Sequential Data: Predicting Uncertain Futures with
Recurrent Models [110.82452096672182]
We propose an extension of the Multiple Hypothesis Prediction (MHP) model to handle ambiguous predictions with sequential data.
We also introduce a novel metric for ambiguous problems, which is better suited to account for uncertainties.
arXiv Detail & Related papers (2020-03-10T09:15:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.