Continuous Integration and Software Quality: A Causal Explanatory Study
- URL: http://arxiv.org/abs/2309.10205v1
- Date: Mon, 18 Sep 2023 23:10:34 GMT
- Title: Continuous Integration and Software Quality: A Causal Explanatory Study
- Authors: Eliezio Soares, Daniel Alencar da Costa and Uir\'a Kulesza
- Abstract summary: Continuous Integration (CI) is a software engineering practice that aims to reduce the cost and risk of code integration among teams.
Recent empirical studies have confirmed associations between CI and the software quality (SQ)
- Score: 0.46040036610482665
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Continuous Integration (CI) is a software engineering practice that aims to
reduce the cost and risk of code integration among teams. Recent empirical
studies have confirmed associations between CI and the software quality (SQ).
However, no existing study investigates causal relationships between CI and SQ.
This paper investigates it by applying the causal Direct Acyclic Graphs (DAGs)
technique. We combine two other strategies to support this technique: a
literature review and a Mining Software Repository (MSR) study. In the first
stage, we review the literature to discover existing associations between CI
and SQ, which help us create a "literature-based causal DAG" in the second
stage. This DAG encapsulates the literature assumptions regarding CI and its
influence on SQ. In the third stage, we analyze 12 activity months for 70
opensource projects by mining software repositories -- 35 CI and 35 no-CI
projects. This MSR study is not a typical "correlation is not causation" study
because it is used to verify the relationships uncovered in the causal DAG
produced in the first stages. The fourth stage consists of testing the
statistical implications from the "literature-based causal DAG" on our dataset.
Finally, in the fifth stage, we build a DAG with observations from the
literature and the dataset, the "literature-data DAG". In addition to the
direct causal effect of CI on SQ, we find evidence of indirect effects of CI.
For example, CI affects teams' communication, which positively impacts SQ. We
also highlight the confounding effect of project age.
Related papers
- Causal Discovery from Data Assisted by Large Language Models [50.193740129296245]
It is essential to integrate experimental data with prior domain knowledge for knowledge driven discovery.
Here we demonstrate this approach by combining high-resolution scanning transmission electron microscopy (STEM) data with insights derived from large language models (LLMs)
By fine-tuning ChatGPT on domain-specific literature, we construct adjacency matrices for Directed Acyclic Graphs (DAGs) that map the causal relationships between structural, chemical, and polarization degrees of freedom in Sm-doped BiFeO3 (SmBFO)
arXiv Detail & Related papers (2025-03-18T02:14:49Z) - Perplexity Trap: PLM-Based Retrievers Overrate Low Perplexity Documents [64.43980129731587]
We propose a causal-inspired inference-time debiasing method called Causal Diagnosis and Correction (CDC)
CDC first diagnoses the bias effect of the perplexity and then separates the bias effect from the overall relevance score.
Experimental results across three domains demonstrate the superior debiasing effectiveness.
arXiv Detail & Related papers (2025-03-11T17:59:00Z) - A Knowledge-Centric Benchmarking Framework and Empirical Study for Retrieval-Augmented Generation [4.359511178431438]
Retrieval-Augmented Generation (RAG) enhances generative models by integrating retrieval mechanisms.
Despite its advantages, RAG encounters significant challenges, particularly in effectively handling real-world queries.
This paper proposes a novel RAG benchmark designed to address these challenges.
arXiv Detail & Related papers (2024-09-03T03:31:37Z) - Causal Reasoning in Software Quality Assurance: A Systematic Review [11.887059800587672]
This study provides a systematic review of the scientific literature on causal reasoning for SQA.
Fault localization is the activity where causal reasoning is more exploited, especially in the web services/microservices domain.
tools to favour their application are appearing at a fast pace - most of them after 2021.
arXiv Detail & Related papers (2024-08-30T10:34:11Z) - Anwendung von Causal-Discovery-Algorithmen zur Root-Cause-Analyse in der Fahrzeugmontage [0.2995925627097048]
Root Cause Analysis (RCA) is a quality management method that aims to systematically investigate and identify the cause-and-effect relationships of problems.
In modern production processes, large amounts of data are collected.
This publication demonstrates the application of Causal Discovery Algorithms (CDA) on data from the assembly of a leading automotive manufacturer.
arXiv Detail & Related papers (2024-07-23T11:22:33Z) - SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis [8.768077629120915]
SCP-Diff sets new state-of-the-art results in SIS on Cityscapes, ADE20K and COCO-Stuff, yielding a FID as low as 10.53 on Cityscapes.
arXiv Detail & Related papers (2024-03-14T17:59:55Z) - Discovering Dynamic Causal Space for DAG Structure Learning [64.763763417533]
We propose a dynamic causal space for DAG structure learning, coined CASPER.
It integrates the graph structure into the score function as a new measure in the causal space to faithfully reflect the causal distance between estimated and ground truth DAG.
arXiv Detail & Related papers (2023-06-05T12:20:40Z) - Joint Learning of Label and Environment Causal Independence for Graph
Out-of-Distribution Generalization [60.4169201192582]
We propose to incorporate label and environment causal independence (LECI) to fully make use of label and environment information.
LECI significantly outperforms prior methods on both synthetic and real-world datasets.
arXiv Detail & Related papers (2023-06-01T19:33:30Z) - Causal DAG extraction from a library of books or videos/movies [0.0]
We argue that human and animal brains contain an explicit engine for doing Causal Inference (CI)
We propose a simple algorithm for constructing such an atlas from a library of books or videos/movies.
We illustrate our method by applying it to a database of randomly generated Tic-Tac-Toe games.
arXiv Detail & Related papers (2022-10-29T16:09:22Z) - Biblio-Analysis of Cohort Intelligence (CI) Algorithm and its allied
applications from Scopus and Web of Science Perspective [0.0]
Cohort Intelligence or CI is one of its kind of novel optimization algorithm.
This research paper in a way will be an ice breaker for those who want to take up CI to a new level.
In this research papers, CI publications available in Scopus are analyzed through graphs, networked diagrams about authors, source titles, keywords over the years, journals over the time.
arXiv Detail & Related papers (2022-09-07T09:09:33Z) - Mining Root Cause Knowledge from Cloud Service Incident Investigations
for AIOps [71.12026848664753]
Root Cause Analysis (RCA) of any service-disrupting incident is one of the most critical as well as complex tasks in IT processes.
In this work, we present ICA and the downstream Incident Search and Retrieval based RCA pipeline, built at Salesforce.
arXiv Detail & Related papers (2022-04-21T02:33:34Z) - Effect Identification in Cluster Causal Diagrams [51.42809552422494]
We introduce a new type of graphical model called cluster causal diagrams (for short, C-DAGs)
C-DAGs allow for the partial specification of relationships among variables based on limited prior knowledge.
We develop the foundations and machinery for valid causal inferences over C-DAGs.
arXiv Detail & Related papers (2022-02-22T21:27:31Z) - SAIS: Supervising and Augmenting Intermediate Steps for Document-Level
Relation Extraction [51.27558374091491]
We propose to explicitly teach the model to capture relevant contexts and entity types by supervising and augmenting intermediate steps (SAIS) for relation extraction.
Based on a broad spectrum of carefully designed tasks, our proposed SAIS method not only extracts relations of better quality due to more effective supervision, but also retrieves the corresponding supporting evidence more accurately.
arXiv Detail & Related papers (2021-09-24T17:37:35Z) - Recent Developments Combining Ensemble Smoother and Deep Generative
Networks for Facies History Matching [58.720142291102135]
This research project focuses on the use of autoencoders networks to construct a continuous parameterization for facies models.
We benchmark seven different formulations, including VAE, generative adversarial network (GAN), Wasserstein GAN, variational auto-encoding GAN, principal component analysis (PCA) with cycle GAN, PCA with transfer style network and VAE with style loss.
arXiv Detail & Related papers (2020-05-08T21:32:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.