Related papers: An Empirical Study of Bugs in Quantum Machine Learning Frameworks

An Empirical Study of Bugs in Quantum Machine Learning Frameworks

URL: http://arxiv.org/abs/2306.06369v3
Date: Thu, 22 Jun 2023 16:10:04 GMT
Title: An Empirical Study of Bugs in Quantum Machine Learning Frameworks
Authors: Pengzhan Zhao, Xiongfei Wu, Junjie Luo, Zhuo Li, Jianjun Zhao
Abstract summary: We inspect 391 real-world bugs collected from 22 open-source repositories of nine popular QML frameworks. 28% of the bugs are quantum-specific, such as erroneous unitary matrix implementation. We manually distilled a taxonomy of five symptoms and nine root cause of bugs in QML platforms.
Score: 5.868747298750261
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Quantum computing has emerged as a promising domain for the machine learning (ML) area, offering significant computational advantages over classical counterparts. With the growing interest in quantum machine learning (QML), ensuring the correctness and robustness of software platforms to develop such QML programs is critical. A necessary step for ensuring the reliability of such platforms is to understand the bugs they typically suffer from. To address this need, this paper presents the first comprehensive study of bugs in QML frameworks. We inspect 391 real-world bugs collected from 22 open-source repositories of nine popular QML frameworks. We find that 1) 28% of the bugs are quantum-specific, such as erroneous unitary matrix implementation, calling for dedicated approaches to find and prevent them; 2) We manually distilled a taxonomy of five symptoms and nine root cause of bugs in QML platforms; 3) We summarized four critical challenges for QML framework developers. The study results provide researchers with insights into how to ensure QML framework quality and present several actionable suggestions for QML framework developers to improve their code quality.

Related papers

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents [63.43699771428243]
EmbodiedBench is an extensive benchmark designed to evaluate vision-driven embodied agents. We evaluated 19 leading proprietary and open-source MLLMs within EmbodiedBench. MLLMs excel at high-level tasks but struggle with low-level manipulation.
arXiv Detail & Related papers (2025-02-13T18:11:34Z)
Quantum vs. Classical Machine Learning Algorithms for Software Defect Prediction: Challenges and Opportunities [4.2793468776829]
This study compares the performance of three Quantum Machine Learning (QML) and five classical machine learning (CML) algorithms on software defect datasets. Our investigation reports the comparative scenarios of QML vs. CML algorithms and identifies the better-performing and consistent algorithms to predict software defects. The findings of this study can help practitioners and researchers further progress in this research domain by making software systems reliable and bug-free.
arXiv Detail & Related papers (2024-12-10T17:38:36Z)
What's Wrong with Your Code Generated by Large Language Models? An Extensive Study [80.18342600996601]
Large language models (LLMs) produce code that is shorter yet more complicated as compared to canonical solutions. We develop a taxonomy of bugs for incorrect codes that includes three categories and 12 sub-categories, and analyze the root cause for common bug types. We propose a novel training-free iterative method that introduces self-critique, enabling LLMs to critique and correct their generated code based on bug types and compiler feedback.
arXiv Detail & Related papers (2024-07-08T17:27:17Z)
Predominant Aspects on Security for Quantum Machine Learning: Literature Review [0.0]
Quantum Machine Learning (QML) has emerged as a promising intersection of quantum computing and classical machine learning. This paper discusses the question which security concerns and strengths are connected to QML by means of a systematic literature review.
arXiv Detail & Related papers (2024-01-15T15:35:43Z)
DebugBench: Evaluating Debugging Capability of Large Language Models [80.73121177868357]
DebugBench is a benchmark for Large Language Models (LLMs) It covers four major bug categories and 18 minor types in C++, Java, and Python. We evaluate two commercial and four open-source models in a zero-shot scenario.
arXiv Detail & Related papers (2024-01-09T15:46:38Z)
Competition-Level Problems are Effective LLM Evaluators [121.15880285283116]
This paper aims to evaluate the reasoning capacities of large language models (LLMs) in solving recent programming problems in Codeforces. We first provide a comprehensive evaluation of GPT-4's peiceived zero-shot performance on this task, considering various aspects such as problems' release time, difficulties, and types of errors encountered. Surprisingly, theThoughtived performance of GPT-4 has experienced a cliff like decline in problems after September 2021 consistently across all the difficulties and types of problems.
arXiv Detail & Related papers (2023-12-04T18:58:57Z)
Unifying (Quantum) Statistical and Parametrized (Quantum) Algorithms [65.268245109828]
We take inspiration from Kearns' SQ oracle and Valiant's weak evaluation oracle. We introduce an extensive yet intuitive framework that yields unconditional lower bounds for learning from evaluation queries.
arXiv Detail & Related papers (2023-10-26T18:23:21Z)
A Survey on Quantum Machine Learning: Current Trends, Challenges, Opportunities, and the Road Ahead [5.629434388963902]
Quantum Computing (QC) claims to improve the efficiency of solving complex problems, compared to classical computing. When QC is integrated with Machine Learning (ML), it creates a Quantum Machine Learning (QML) system. This paper aims to provide a thorough understanding of the foundational concepts of QC and its notable advantages over classical computing.
arXiv Detail & Related papers (2023-10-16T11:52:54Z)
Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level Vision [85.6008224440157]
Multi-modality Large Language Models (MLLMs) have catalyzed a shift in computer vision from specialized models to general-purpose foundation models. We present Q-Bench, a holistic benchmark crafted to evaluate potential abilities of MLLMs on three realms: low-level visual perception, low-level visual description, and overall visual quality assessment.
arXiv Detail & Related papers (2023-09-25T14:43:43Z)
Case Study-Based Approach of Quantum Machine Learning in Cybersecurity: Quantum Support Vector Machine for Malware Classification and Protection [8.34729912896717]
We design and develop QML-based ten learning modules covering various cybersecurity topics. In this paper, we utilize quantum support vector machine (QSVM) for malware classification and protection. We demonstrate our QSVM model and achieve an accuracy of 95% in malware classification and protection.
arXiv Detail & Related papers (2023-06-01T02:04:09Z)
Projection Valued Measure-based Quantum Machine Learning for Multi-Class Classification [10.90994913062223]
We propose a novel framework for multi-class classification using projection-valued measure (PVM) Our framework outperforms the state-of-theart (SOTA) with various datasets using no more than 6 qubits.
arXiv Detail & Related papers (2022-10-30T03:12:53Z)
Bugs in Machine Learning-based Systems: A Faultload Benchmark [16.956588187947993]
There is no standard benchmark of bugs to assess their performance, compare them and discuss their advantages and weaknesses. In this study, we firstly investigate the verifiability of the bugs in ML-based systems and show the most important factors in each one. We provide a benchmark namely defect4ML that satisfies all criteria of standard benchmark, i.e. relevance, fairness, verifiability, and usability.
arXiv Detail & Related papers (2022-06-24T14:20:34Z)
Understanding the Usability Challenges of Machine Learning In High-Stakes Decision Making [67.72855777115772]
Machine learning (ML) is being applied to a diverse and ever-growing set of domains. In many cases, domain experts -- who often have no expertise in ML or data science -- are asked to use ML predictions to make high-stakes decisions. We investigate the ML usability challenges present in the domain of child welfare screening through a series of collaborations with child welfare screeners.
arXiv Detail & Related papers (2021-03-02T22:50:45Z)

This list is automatically generated from the titles and abstracts of the papers in this site.