Systematic Literature Review on Application of Machine Learning in
Continuous Integration
- URL: http://arxiv.org/abs/2305.12695v2
- Date: Mon, 17 Jul 2023 06:38:00 GMT
- Title: Systematic Literature Review on Application of Machine Learning in
Continuous Integration
- Authors: Ali Kazemi Arani, Triet Huynh Minh Le, Mansooreh Zahedi and Muhammad
Ali Babar
- Abstract summary: This research conducted a systematic review of the literature on machine learning (ML)-based methods in the context of Continuous Integration (CI) over the past 22 years.
The study aimed to identify and describe the techniques used in ML-based solutions for CI.
- Score: 7.180264400668846
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This research conducted a systematic review of the literature on machine
learning (ML)-based methods in the context of Continuous Integration (CI) over
the past 22 years. The study aimed to identify and describe the techniques used
in ML-based solutions for CI and analyzed various aspects such as data
engineering, feature engineering, hyper-parameter tuning, ML models, evaluation
methods, and metrics. In this paper, we have depicted the phases of CI testing,
the connection between them, and the employed techniques in training the ML
method phases. We presented nine types of data sources and four taken steps in
the selected studies for preparing the data. Also, we identified four feature
types and nine subsets of data features through thematic analysis of the
selected studies. Besides, five methods for selecting and tuning the
hyper-parameters are shown. In addition, we summarised the evaluation methods
used in the literature and identified fifteen different metrics. The most
commonly used evaluation methods were found to be precision, recall, and
F1-score, and we have also identified five methods for evaluating the
performance of trained ML models. Finally, we have presented the relationship
between ML model types, performance measurements, and CI phases. The study
provides valuable insights for researchers and practitioners interested in
ML-based methods in CI and emphasizes the need for further research in this
area.
Related papers
- A Survey of Small Language Models [104.80308007044634]
Small Language Models (SLMs) have become increasingly important due to their efficiency and performance to perform various language tasks with minimal computational resources.
We present a comprehensive survey on SLMs, focusing on their architectures, training techniques, and model compression techniques.
arXiv Detail & Related papers (2024-10-25T23:52:28Z) - Recent Advances on Machine Learning for Computational Fluid Dynamics: A Survey [51.87875066383221]
This paper introduces fundamental concepts, traditional methods, and benchmark datasets, then examine the various roles Machine Learning plays in improving CFD.
We highlight real-world applications of ML for CFD in critical scientific and engineering disciplines, including aerodynamics, combustion, atmosphere & ocean science, biology fluid, plasma, symbolic regression, and reduced order modeling.
We draw the conclusion that ML is poised to significantly transform CFD research by enhancing simulation accuracy, reducing computational time, and enabling more complex analyses of fluid dynamics.
arXiv Detail & Related papers (2024-08-22T07:33:11Z) - Systematic Literature Review on Application of Learning-based Approaches in Continuous Integration [2.3436632098950456]
Machine learning (ML) and deep learning (DL) analyze raw data to extract valuable insights in specific phases.
The rise of continuous practices in software projects emphasizes automating Continuous Integration (CI) with these learning-based methods.
This study provides a comprehensive overview of learning-based methods in CI, offering valuable insights for researchers and practitioners developing CI task automation.
arXiv Detail & Related papers (2024-06-28T09:10:23Z) - Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective [106.92016199403042]
We empirically investigate knowledge transfer from larger to smaller models through a parametric perspective.
We employ sensitivity-based techniques to extract and align knowledge-specific parameters between different large language models.
Our findings highlight the critical factors contributing to the process of parametric knowledge transfer.
arXiv Detail & Related papers (2023-10-17T17:58:34Z) - Application of Transformers based methods in Electronic Medical Records:
A Systematic Literature Review [77.34726150561087]
This work presents a systematic literature review of state-of-the-art advances using transformer-based methods on electronic medical records (EMRs) in different NLP tasks.
arXiv Detail & Related papers (2023-04-05T22:19:42Z) - Filter Methods for Feature Selection in Supervised Machine Learning
Applications -- Review and Benchmark [0.0]
This review synthesizes the literature on feature selection benchmarking and evaluates the performance of 58 methods in the widely used R environment.
We consider four typical dataset scenarios that are challenging for ML models.
arXiv Detail & Related papers (2021-11-23T20:20:24Z) - Supply of engineering techniques and software design patterns in
psychoanalysis and psychometrics sciences [0.0]
The purpose of this study is to introduce software technologies and models and artificial intelligence algorithms to improve the weaknesses of CBT (Cognitive Behavior Therapy) method in psychotherapy.
The presentation method for this purpose is the implementation of psychometric experiments in which the hidden human variables are inferred from the answers of tests.
arXiv Detail & Related papers (2021-08-16T08:36:37Z) - An Extensible Benchmark Suite for Learning to Simulate Physical Systems [60.249111272844374]
We introduce a set of benchmark problems to take a step towards unified benchmarks and evaluation protocols.
We propose four representative physical systems, as well as a collection of both widely used classical time-based and representative data-driven methods.
arXiv Detail & Related papers (2021-08-09T17:39:09Z) - Comprehensive Comparative Study of Multi-Label Classification Methods [1.1278903078792917]
Multi-label classification (MLC) has recently received increasing interest from the machine learning community.
This work provides a comprehensive empirical study of a wide range of MLC methods on a plethora of datasets from various domains.
arXiv Detail & Related papers (2021-02-14T09:38:15Z) - Machine Learning for Detecting Data Exfiltration: A Review [1.949912057689623]
Research at the intersection of cybersecurity, Machine Learning (ML), and Software Engineering (SE) has recently taken significant steps in proposing countermeasures for detecting sophisticated data exfiltration attacks.
This paper aims at systematically reviewing ML-based data exfiltration countermeasures to identify and classify ML approaches, feature engineering techniques, evaluation datasets, and performance metrics used for these countermeasures.
arXiv Detail & Related papers (2020-12-17T01:05:50Z) - Interpretable Multi-dataset Evaluation for Named Entity Recognition [110.64368106131062]
We present a general methodology for interpretable evaluation for the named entity recognition (NER) task.
The proposed evaluation method enables us to interpret the differences in models and datasets, as well as the interplay between them.
By making our analysis tool available, we make it easy for future researchers to run similar analyses and drive progress in this area.
arXiv Detail & Related papers (2020-11-13T10:53:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.