Application of AI to formal methods - an analysis of current trends
- URL: http://arxiv.org/abs/2411.14870v2
- Date: Wed, 27 Aug 2025 20:29:01 GMT
- Title: Application of AI to formal methods - an analysis of current trends
- Authors: Sebastian Stock, Jannik Dunkelau, Atif Mashkoor,
- Abstract summary: We conduct a systematic mapping study to overview the current landscape of research publications that apply AI to formal methods (FM)<n>We searched for relevant publications in four major databases, defined inclusion and exclusion criteria, and applied extensive snowballing to uncover potential additional sources.<n>We find a strong focus on AI in the area of theorem proving while other subfields of FM are less represented.
- Score: 0.9278012928091801
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Context: With artificial intelligence (AI) being well established within the daily lives of research communities, we turn our gaze toward formal methods (FM). FM aim to provide sound and verifiable reasoning about problems in computer science. Objective: We conduct a systematic mapping study to overview the current landscape of research publications that apply AI to FM. We aim to identify how FM can benefit from AI techniques and highlight areas for further research. Our focus lies on the previous five years (2019-2023) of research. Method: Following the proposed guidelines for systematic mapping studies, we searched for relevant publications in four major databases, defined inclusion and exclusion criteria, and applied extensive snowballing to uncover potential additional sources. Results: This investigation results in 189 entries which we explored to find current trends and highlight research gaps. We find a strong focus on AI in the area of theorem proving while other subfields of FM are less represented. Conclusions: The mapping study provides a quantitative overview of the modern state of AI application in FM. The current trend of the field is yet to mature. Many primary studies focus on practical application, yet we identify a lack of theoretical groundwork, standard benchmarks, or case studies. Further, we identify issues regarding shared training data sets and standard benchmarks.
Related papers
- The Role of Computing Resources in Publishing Foundation Model Research [84.20094600030092]
We evaluate the relationship between these resources and the scientific advancement of foundation models (FM)<n>We reviewed 6517 FM papers published between 2022 to 2024, and surveyed 229 first-authors to the impact of computing resources on scientific output.<n>We find that increased computing is correlated with national funding allocations and citations, but our findings don't observe the strong correlations with research environment.
arXiv Detail & Related papers (2025-10-15T14:50:45Z) - Previously on... Automating Code Review [4.096540146408279]
Modern Code Review (MCR) is a standard practice in software engineering, yet it demands substantial time and resource investments.<n>Recent research has increasingly explored automating core review tasks using machine learning (ML) and deep learning (DL)<n>This study provides the first comprehensive analysis of MCR automation research.
arXiv Detail & Related papers (2025-08-25T13:12:48Z) - Characterizing Deep Research: A Benchmark and Formal Definition [24.523394260858822]
We propose a formal characterization of the deep research (DR) task and introduce a benchmark to evaluate the performance of DR systems.<n>We argue that the core defining feature of deep research is not the production of lengthy report-style outputs, but rather the high fan-out over concepts required during the search process.
arXiv Detail & Related papers (2025-08-06T08:09:28Z) - Unveiling Latent Topics in Robotic Process Automation -- an Approach based on Latent Dirichlet Allocation Smart Review [0.0]
This study aims to create a science map of RPA and its aspects by revealing latent topics related to RPA.
By using an unsupervised machine learning method based on Latent Dirichlet Allocation, we were able to analyse over 2000 paper abstracts.
Among these, we found 100 distinct study topics, 15 of which have been included in the science map we provide.
arXiv Detail & Related papers (2024-04-08T20:03:06Z) - Gradient based Feature Attribution in Explainable AI: A Technical Review [13.848675695545909]
Surge in black-box AI models has prompted the need to explain the internal mechanism and justify their reliability.
gradient based explanations can be directly adopted for neural network models.
We introduce both human and quantitative evaluations to measure algorithm performance.
arXiv Detail & Related papers (2024-03-15T15:49:31Z) - Progress and Opportunities of Foundation Models in Bioinformatics [77.74411726471439]
Foundations models (FMs) have ushered in a new era in computational biology, especially in the realm of deep learning.
Central to our focus is the application of FMs to specific biological problems, aiming to guide the research community in choosing appropriate FMs for their research needs.
Review analyses challenges and limitations faced by FMs in biology, such as data noise, model explainability, and potential biases.
arXiv Detail & Related papers (2024-02-06T02:29:17Z) - ChatGPT and Beyond: The Generative AI Revolution in Education [0.21756081703275998]
This survey examines academic literature published between November, 2022 and July, 2023.
It aims to illuminate the evolving role of generative AI models, particularly ChatGPT, in education.
The findings of this review will empower educators, researchers, and policymakers to make informed decisions about the integration of AI technologies into learning environments.
arXiv Detail & Related papers (2023-11-26T05:34:22Z) - Learn From Model Beyond Fine-Tuning: A Survey [78.80920533793595]
Learn From Model (LFM) focuses on the research, modification, and design of foundation models (FM) based on the model interface.
The study of LFM techniques can be broadly categorized into five major areas: model tuning, model distillation, model reuse, meta learning and model editing.
This paper gives a comprehensive review of the current methods based on FM from the perspective of LFM.
arXiv Detail & Related papers (2023-10-12T10:20:36Z) - A Survey on Interpretable Cross-modal Reasoning [64.37362731950843]
Cross-modal reasoning (CMR) has emerged as a pivotal area with applications spanning from multimedia analysis to healthcare diagnostics.
This survey delves into the realm of interpretable cross-modal reasoning (I-CMR)
This survey presents a comprehensive overview of the typical methods with a three-level taxonomy for I-CMR.
arXiv Detail & Related papers (2023-09-05T05:06:48Z) - Artificial intelligence adoption in the physical sciences, natural
sciences, life sciences, social sciences and the arts and humanities: A
bibliometric analysis of research publications from 1960-2021 [73.06361680847708]
In 1960 14% of 333 research fields were related to AI, but this increased to over half of all research fields by 1972, over 80% by 1986 and over 98% in current times.
In 1960 14% of 333 research fields were related to AI (many in computer science), but this increased to over half of all research fields by 1972, over 80% by 1986 and over 98% in current times.
We conclude that the context of the current surge appears different, and that interdisciplinary AI application is likely to be sustained.
arXiv Detail & Related papers (2023-06-15T14:08:07Z) - AI Techniques in the Microservices Life-Cycle: A Systematic Mapping Study [8.026381963838272]
The use of AI in (MSs) is an emerging field as indicated by a substantial number of surveys.
We take an exhaustive approach to reveal all possible connections between the use of AI techniques for improving any quality attribute (QA) of MSs during the DevOps phases.
arXiv Detail & Related papers (2023-05-25T14:24:37Z) - A Survey of Explainable AI in Deep Visual Modeling: Methods and Metrics [24.86176236641865]
We present the first survey in Explainable AI that focuses on the methods and metrics for interpreting deep visual models.
Covering the landmark contributions along the state-of-the-art, we not only provide a taxonomic organization of the existing techniques, but also excavate a range of evaluation metrics.
arXiv Detail & Related papers (2023-01-31T06:49:42Z) - Research Trends and Applications of Data Augmentation Algorithms [77.34726150561087]
We identify the main areas of application of data augmentation algorithms, the types of algorithms used, significant research trends, their progression over time and research gaps in data augmentation literature.
We expect readers to understand the potential of data augmentation, as well as identify future research directions and open questions within data augmentation research.
arXiv Detail & Related papers (2022-07-18T11:38:32Z) - Characterising Research Areas in the field of AI [68.8204255655161]
We identified the main conceptual themes by performing clustering analysis on the co-occurrence network of topics.
The results highlight the growing academic interest in research themes like deep learning, machine learning, and internet of things.
arXiv Detail & Related papers (2022-05-26T16:30:30Z) - Systematic Mapping Study on the Machine Learning Lifecycle [4.4090257489826845]
The study yields 405 publications published from 2005 to 2020, mapped in 5 different main research topics, and 31 sub-topics.
We observe that only a minority of publications focus on data management and model production problems, and that more studies should address the AI lifecycle from a holistic perspective.
arXiv Detail & Related papers (2021-03-11T11:44:23Z) - Domain Generalization: A Survey [146.68420112164577]
Domain generalization (DG) aims to achieve OOD generalization by only using source domain data for model learning.
For the first time, a comprehensive literature review is provided to summarize the ten-year development in DG.
arXiv Detail & Related papers (2021-03-03T16:12:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.