A Comprehensive Study on Quality Assurance Tools for Java
- URL: http://arxiv.org/abs/2305.16812v2
- Date: Wed, 7 Jun 2023 11:45:03 GMT
- Title: A Comprehensive Study on Quality Assurance Tools for Java
- Authors: Han Liu, Sen Chen, Ruitao Feng, Chengwei Liu, Kaixuan Li, Zhengzi Xu,
Liming Nie, Yang Liu, Yixiang Chen
- Abstract summary: Quality assurance (QA) tools are receiving more and more attention and are widely used by developers.
Most existing research is limited in the following ways:.
They compare tools without considering scanning rules analysis.
They disagree on the effectiveness of tools due to the study methodology and benchmark dataset.
There is no large-scale study on the analysis of time performance.
- Score: 15.255117038871337
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Quality assurance (QA) tools are receiving more and more attention and are
widely used by developers. Given the wide range of solutions for QA technology,
it is still a question of evaluating QA tools. Most existing research is
limited in the following ways: (i) They compare tools without considering
scanning rules analysis. (ii) They disagree on the effectiveness of tools due
to the study methodology and benchmark dataset. (iii) They do not separately
analyze the role of the warnings. (iv) There is no large-scale study on the
analysis of time performance. To address these problems, in the paper, we
systematically select 6 free or open-source tools for a comprehensive study
from a list of 148 existing Java QA tools. To carry out a comprehensive study
and evaluate tools in multi-level dimensions, we first mapped the scanning
rules to the CWE and analyze the coverage and granularity of the scanning
rules. Then we conducted an experiment on 5 benchmarks, including 1,425 bugs,
to investigate the effectiveness of these tools. Furthermore, we took
substantial effort to investigate the effectiveness of warnings by comparing
the real labeled bugs with the warnings and investigating their role in bug
detection. Finally, we assessed these tools' time performance on 1,049
projects. The useful findings based on our comprehensive study can help
developers improve their tools and provide users with suggestions for selecting
QA tools.
Related papers
- A Comprehensive Study on Static Application Security Testing (SAST) Tools for Android [22.558610938860124]
VulsTotal is a unified evaluation platform for defining and describing tools' supported vulnerability types.
We select 11 free and open-sourced SAST tools from a pool of 97 existing options, adhering to clearly defined criteria.
We then unify 67 general/common vulnerability types for Android SAST tools.
arXiv Detail & Related papers (2024-10-28T05:10:22Z) - Query Routing for Homogeneous Tools: An Instantiation in the RAG Scenario [62.615210194004106]
Current research on tool learning primarily focuses on selecting the most effective tool from a wide array of options, often overlooking cost-effectiveness.
In this paper, we address the selection of homogeneous tools by predicting both their performance and the associated cost required to accomplish a given task.
arXiv Detail & Related papers (2024-06-18T09:24:09Z) - Tool Learning with Large Language Models: A Survey [60.733557487886635]
Tool learning with large language models (LLMs) has emerged as a promising paradigm for augmenting the capabilities of LLMs to tackle highly complex problems.
Despite growing attention and rapid advancements in this field, the existing literature remains fragmented and lacks systematic organization.
arXiv Detail & Related papers (2024-05-28T08:01:26Z) - Towards Completeness-Oriented Tool Retrieval for Large Language Models [60.733557487886635]
Real-world systems often incorporate a wide array of tools, making it impractical to input all tools into Large Language Models.
Existing tool retrieval methods primarily focus on semantic matching between user queries and tool descriptions.
We propose a novel modelagnostic COllaborative Learning-based Tool Retrieval approach, COLT, which captures not only the semantic similarities between user queries and tool descriptions but also takes into account the collaborative information of tools.
arXiv Detail & Related papers (2024-05-25T06:41:23Z) - Efficacy of static analysis tools for software defect detection on open-source projects [0.0]
The study used popular analysis tools such as SonarQube, PMD, Checkstyle, and FindBugs to perform the comparison.
The study results show that SonarQube performs considerably well than all other tools in terms of its defect detection.
arXiv Detail & Related papers (2024-05-20T19:05:32Z) - Static Application Security Testing (SAST) Tools for Smart Contracts: How Far Are We? [14.974832502863526]
In recent years, the importance of smart contract security has been heightened by the increasing number of attacks against them.
To address this issue, a multitude of static application security testing (SAST) tools have been proposed for detecting vulnerabilities in smart contracts.
In this paper, we propose an up-to-date and fine-grained taxonomy that includes 45 unique vulnerability types for smart contracts.
arXiv Detail & Related papers (2024-04-28T13:40:18Z) - What Are Tools Anyway? A Survey from the Language Model Perspective [67.18843218893416]
Language models (LMs) are powerful yet mostly for text generation tasks.
We provide a unified definition of tools as external programs used by LMs.
We empirically study the efficiency of various tooling methods.
arXiv Detail & Related papers (2024-03-18T17:20:07Z) - TOOLVERIFIER: Generalization to New Tools via Self-Verification [69.85190990517184]
We introduce a self-verification method which distinguishes between close candidates by self-asking contrastive questions during tool selection.
Experiments on 4 tasks from the ToolBench benchmark, consisting of 17 unseen tools, demonstrate an average improvement of 22% over few-shot baselines.
arXiv Detail & Related papers (2024-02-21T22:41:38Z) - ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of
Large Language Models in Real-world Scenarios [48.38419686697733]
We propose ToolEyes, a fine-grained system tailored for the evaluation of large language models' tool learning capabilities in authentic scenarios.
The system meticulously examines seven real-world scenarios, analyzing five dimensions crucial to LLMs in tool learning.
ToolEyes incorporates a tool library boasting approximately 600 tools, serving as an intermediary between LLMs and the physical world.
arXiv Detail & Related papers (2024-01-01T12:49:36Z) - Automated Grading and Feedback Tools for Programming Education: A
Systematic Review [7.776434991976473]
Most papers assess the correctness of assignments in object-oriented languages.
Few tools assess the maintainability, readability or documentation of the source code.
Most tools offered fully automated assessment to allow for near-instantaneous feedback.
arXiv Detail & Related papers (2023-06-20T17:54:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.