Automated Discovery of Data Transformations for Robotic Process
Automation
- URL: http://arxiv.org/abs/2001.01007v1
- Date: Fri, 3 Jan 2020 23:15:45 GMT
- Title: Automated Discovery of Data Transformations for Robotic Process
Automation
- Authors: Volodymyr Leno, Marlon Dumas, Marcello La Rosa, Fabrizio Maria Maggi,
Artem Polyvyanyy
- Abstract summary: This paper addresses the problem of analyzing User Interaction (UI) logs in order to discover routines where a user transfers data from one spreadsheet or (Web) form to another.
The proposed approach and its optimizations are evaluated using UI logs that replicate a real-life repetitive data transfer routine.
- Score: 2.0386745041807037
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Robotic Process Automation (RPA) is a technology for automating repetitive
routines consisting of sequences of user interactions with one or more
applications. In order to fully exploit the opportunities opened by RPA,
companies need to discover which specific routines may be automated, and how.
In this setting, this paper addresses the problem of analyzing User Interaction
(UI) logs in order to discover routines where a user transfers data from one
spreadsheet or (Web) form to another. The paper maps this problem to that of
discovering data transformations by example - a problem for which several
techniques are available. The paper shows that a naive application of a
state-of-the-art technique for data transformation discovery is computationally
inefficient. Accordingly, the paper proposes two optimizations that take
advantage of the information in the UI log and the fact that data transfers
across applications typically involve copying alphabetic and numeric tokens
separately. The proposed approach and its optimizations are evaluated using UI
logs that replicate a real-life repetitive data transfer routine.
Related papers
- Automatic AI Model Selection for Wireless Systems: Online Learning via Digital Twinning [50.332027356848094]
AI-based applications are deployed at intelligent controllers to carry out functionalities like scheduling or power control.
The mapping between context and AI model parameters is ideally done in a zero-shot fashion.
This paper introduces a general methodology for the online optimization of AMS mappings.
arXiv Detail & Related papers (2024-06-22T11:17:50Z) - Automated data processing and feature engineering for deep learning and big data applications: a survey [0.0]
Modern approach to artificial intelligence (AI) aims to design algorithms that learn directly from data.
Not all data processing tasks in conventional deep learning pipelines have been automated.
arXiv Detail & Related papers (2024-03-18T01:07:48Z) - Data augmentation with automated machine learning: approaches and
performance comparison with classical data augmentation methods [0.0]
State-of-the-art approaches typically rely on automated machine learning (AutoML) principles.
This work presents a comprehensive survey of AutoML-based data augmentation techniques.
arXiv Detail & Related papers (2024-03-13T09:00:38Z) - IR2: Information Regularization for Information Retrieval [2.4816250611120547]
This paper introduces IR2, Information Regularization for Information Retrieval, a technique for reducing overfitting during synthetic data generation.
Experimental results indicate that our regularization techniques not only outperform previous synthetic query generation methods on the tasks considered but also reduce cost by up to 50%.
arXiv Detail & Related papers (2024-02-25T21:25:06Z) - Zero-shot Composed Text-Image Retrieval [72.43790281036584]
We consider the problem of composed image retrieval (CIR)
It aims to train a model that can fuse multi-modal information, e.g., text and images, to accurately retrieve images that match the query, extending the user's expression ability.
arXiv Detail & Related papers (2023-06-12T17:56:01Z) - AutoML-GPT: Automatic Machine Learning with GPT [74.30699827690596]
We propose developing task-oriented prompts and automatically utilizing large language models (LLMs) to automate the training pipeline.
We present the AutoML-GPT, which employs GPT as the bridge to diverse AI models and dynamically trains models with optimized hyper parameters.
This approach achieves remarkable results in computer vision, natural language processing, and other challenging areas.
arXiv Detail & Related papers (2023-05-04T02:09:43Z) - Robotic Process Automation Using Process Mining $\unicode{x2013}$ A
Systematic Literature Review [0.7252027234425332]
This paper aims to assess the applicability of process mining in accelerating and improving the implementation of RPA.
A systematic literature review was conducted to examine the approaches where PM techniques were used to understand the as-is processes that can be automated with software robots.
There is a steady increase in the number of publications in this domain, especially during the year 2022, which suggests a raising interest in the combined use of PM with RPA.
arXiv Detail & Related papers (2022-04-02T03:13:17Z) - Automated Machine Learning Techniques for Data Streams [91.3755431537592]
This paper surveys the state-of-the-art open-source AutoML tools, applies them to data collected from streams, and measures how their performance changes over time.
The results show that off-the-shelf AutoML tools can provide satisfactory results but in the presence of concept drift, detection or adaptation techniques have to be applied to maintain the predictive accuracy over time.
arXiv Detail & Related papers (2021-06-14T11:42:46Z) - AURSAD: Universal Robot Screwdriving Anomaly Detection Dataset [80.6725125503521]
This report describes a dataset created using a UR3e series robot and OnRobot Screwdriver.
The resulting data contains 2042 samples of normal and anomalous robot operation.
Brief ML benchmarks using this data are also provided, showcasing the data's suitability and potential for further analysis and experimentation.
arXiv Detail & Related papers (2021-02-02T09:59:23Z) - Scaling Systematic Literature Reviews with Machine Learning Pipelines [57.82662094602138]
Systematic reviews entail the extraction of data from scientific documents.
We construct a pipeline that automates each of these aspects, and experiment with many human-time vs. system quality trade-offs.
We find that we can get surprising accuracy and generalisability of the whole pipeline system with only 2 weeks of human-expert annotation.
arXiv Detail & Related papers (2020-10-09T16:19:42Z) - Information-theoretic User Interaction: Significant Inputs for Program
Synthesis [11.473616777800318]
We introduce the em significant questions problem, and show that it is hard in general.
We develop an information-theoretic greedy approach for solving the problem.
In the context of interactive program synthesis, we use the above result to develop an emactive program learner
Our active learner is able to tradeoff false negatives for false positives and converge in a small number of iterations on a real-world dataset.
arXiv Detail & Related papers (2020-06-22T21:46:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.