AutoML to Date and Beyond: Challenges and Opportunities
- URL: http://arxiv.org/abs/2010.10777v4
- Date: Wed, 19 May 2021 20:32:33 GMT
- Title: AutoML to Date and Beyond: Challenges and Opportunities
- Authors: Shubhra Kanti Karmaker Santu, Md. Mahadi Hassan, Micah J. Smith, Lei
Xu, ChengXiang Zhai, Kalyan Veeramachaneni
- Abstract summary: AutoML tools aim to make machine learning accessible for non-machine learning experts.
We introduce a new classification system for AutoML systems.
We lay out a roadmap for the future, pinpointing the research required to further automate the end-to-end machine learning pipeline.
- Score: 30.60364966752454
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: As big data becomes ubiquitous across domains, and more and more stakeholders
aspire to make the most of their data, demand for machine learning tools has
spurred researchers to explore the possibilities of automated machine learning
(AutoML). AutoML tools aim to make machine learning accessible for non-machine
learning experts (domain experts), to improve the efficiency of machine
learning, and to accelerate machine learning research. But although automation
and efficiency are among AutoML's main selling points, the process still
requires human involvement at a number of vital steps, including understanding
the attributes of domain-specific data, defining prediction problems, creating
a suitable training data set, and selecting a promising machine learning
technique. These steps often require a prolonged back-and-forth that makes this
process inefficient for domain experts and data scientists alike, and keeps
so-called AutoML systems from being truly automatic. In this review article, we
introduce a new classification system for AutoML systems, using a seven-tiered
schematic to distinguish these systems based on their level of autonomy. We
begin by describing what an end-to-end machine learning pipeline actually looks
like, and which subtasks of the machine learning pipeline have been automated
so far. We highlight those subtasks which are still done manually - generally
by a data scientist - and explain how this limits domain experts' access to
machine learning. Next, we introduce our novel level-based taxonomy for AutoML
systems and define each level according to the scope of automation support
provided. Finally, we lay out a roadmap for the future, pinpointing the
research required to further automate the end-to-end machine learning pipeline
and discussing important challenges that stand in the way of this ambitious
goal.
Related papers
- AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML [56.565200973244146]
Automated machine learning (AutoML) accelerates AI development by automating tasks in the development pipeline.
Recent works have started exploiting large language models (LLM) to lessen such burden.
This paper proposes AutoML-Agent, a novel multi-agent framework tailored for full-pipeline AutoML.
arXiv Detail & Related papers (2024-10-03T20:01:09Z) - Position: A Call to Action for a Human-Centered AutoML Paradigm [83.78883610871867]
Automated machine learning (AutoML) was formed around the fundamental objectives of automatically and efficiently configuring machine learning (ML)
We argue that a key to unlocking AutoML's full potential lies in addressing the currently underexplored aspect of user interaction with AutoML systems.
arXiv Detail & Related papers (2024-06-05T15:05:24Z) - Automated data processing and feature engineering for deep learning and big data applications: a survey [0.0]
Modern approach to artificial intelligence (AI) aims to design algorithms that learn directly from data.
Not all data processing tasks in conventional deep learning pipelines have been automated.
arXiv Detail & Related papers (2024-03-18T01:07:48Z) - Assessing the Use of AutoML for Data-Driven Software Engineering [10.40771687966477]
AutoML promises to automate the building of end-to-end AI/ML pipelines.
Despite the growing interest and high expectations, there is a dearth of information about the extent to which AutoML is currently adopted.
arXiv Detail & Related papers (2023-07-20T11:14:24Z) - Benchmarking Automated Machine Learning Methods for Price Forecasting
Applications [58.720142291102135]
We show the possibility of substituting manually created ML pipelines with automated machine learning (AutoML) solutions.
Based on the CRISP-DM process, we split the manual ML pipeline into a machine learning and non-machine learning part.
We show in a case study for the industrial use case of price forecasting, that domain knowledge combined with AutoML can weaken the dependence on ML experts.
arXiv Detail & Related papers (2023-04-28T10:27:38Z) - OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge
Collaborative AutoML System [85.8338446357469]
We introduce OmniForce, a human-centered AutoML system that yields both human-assisted ML and ML-assisted human techniques.
We show how OmniForce can put an AutoML system into practice and build adaptive AI in open-environment scenarios.
arXiv Detail & Related papers (2023-03-01T13:35:22Z) - Man versus Machine: AutoML and Human Experts' Role in Phishing Detection [4.124446337711138]
This paper compares the performances of six well-known, state-of-the-art AutoML frameworks on ten different phishing datasets.
Our results indicate that AutoML-based models are able to outperform manually developed machine learning models in complex classification tasks.
arXiv Detail & Related papers (2021-08-27T09:26:20Z) - Automated Machine Learning Techniques for Data Streams [91.3755431537592]
This paper surveys the state-of-the-art open-source AutoML tools, applies them to data collected from streams, and measures how their performance changes over time.
The results show that off-the-shelf AutoML tools can provide satisfactory results but in the presence of concept drift, detection or adaptation techniques have to be applied to maintain the predictive accuracy over time.
arXiv Detail & Related papers (2021-06-14T11:42:46Z) - Automated Machine Learning on Graphs: A Survey [81.21692888288658]
This paper is the first systematic and comprehensive review of automated machine learning on graphs.
We focus on hyper- parameter optimization (HPO) and neural architecture search (NAS) for graph machine learning.
In the end, we share our insights on future research directions for automated graph machine learning.
arXiv Detail & Related papers (2021-03-01T04:20:33Z) - AutoDS: Towards Human-Centered Automation of Data Science [20.859067294445985]
This paper introduces AutoDS, an automated machine learning (AutoML) system to support data science projects.
As expected, AutoDS improves productivity; Yet surprisingly, we find that the models produced by the AutoDS group have higher quality and less errors, but lower human confidence scores.
arXiv Detail & Related papers (2021-01-13T08:35:14Z) - Automated Machine Learning -- a brief review at the end of the early
years [14.211962590104111]
Automated machine learning (AutoML) is the sub-field of machine learning that aims at automating, to some extend, all stages of the design of a machine learning system.
In the context of supervised learning, AutoML is concerned with feature extraction, pre processing, model design and post processing.
arXiv Detail & Related papers (2020-08-19T15:48:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.