LightAutoML: AutoML Solution for a Large Financial Services Ecosystem
- URL: http://arxiv.org/abs/2109.01528v1
- Date: Fri, 3 Sep 2021 13:52:32 GMT
- Title: LightAutoML: AutoML Solution for a Large Financial Services Ecosystem
- Authors: Anton Vakhrushev, Alexander Ryzhkov, Maxim Savchenko, Dmitry Simakov,
Rinchin Damdinov, Alexander Tuzhilin
- Abstract summary: We present an AutoML system called LightAutoML developed for a large European financial services company.
Our framework was piloted and deployed in numerous applications and performed at the level of the experienced data scientists.
- Score: 108.09104876115428
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We present an AutoML system called LightAutoML developed for a large European
financial services company and its ecosystem satisfying the set of
idiosyncratic requirements that this ecosystem has for AutoML solutions. Our
framework was piloted and deployed in numerous applications and performed at
the level of the experienced data scientists while building high-quality ML
models significantly faster than these data scientists. We also compare the
performance of our system with various general-purpose open source AutoML
solutions and show that it performs better for most of the ecosystem and OpenML
problems. We also present the lessons that we learned while developing the
AutoML system and moving it into production.
Related papers
- SELA: Tree-Search Enhanced LLM Agents for Automated Machine Learning [14.702694298483445]
Tree-Search Enhanced LLM Agents (SELA) is an agent-based system that leverages Monte Carlo Tree Search (MCTS) to optimize the AutoML process.
SELA represents pipeline configurations as trees, enabling agents to conduct experiments intelligently and iteratively refine their strategies.
In an extensive evaluation across 20 machine learning datasets, we compare the performance of traditional and agent-based AutoML methods.
arXiv Detail & Related papers (2024-10-22T17:56:08Z) - AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML [56.565200973244146]
Automated machine learning (AutoML) accelerates AI development by automating tasks in the development pipeline.
Recent works have started exploiting large language models (LLM) to lessen such burden.
This paper proposes AutoML-Agent, a novel multi-agent framework tailored for full-pipeline AutoML.
arXiv Detail & Related papers (2024-10-03T20:01:09Z) - Position: A Call to Action for a Human-Centered AutoML Paradigm [83.78883610871867]
Automated machine learning (AutoML) was formed around the fundamental objectives of automatically and efficiently configuring machine learning (ML)
We argue that a key to unlocking AutoML's full potential lies in addressing the currently underexplored aspect of user interaction with AutoML systems.
arXiv Detail & Related papers (2024-06-05T15:05:24Z) - DREAM: Debugging and Repairing AutoML Pipelines [39.83914420717843]
We design and implement DREAM, an automatic debug and repairing system for AutoML systems.
It monitors the process of AutoML to collect detailed feedback and automatically repairs bugs by expanding search space.
Our evaluation results show that DREAM can effectively and efficiently repair AutoML bugs.
arXiv Detail & Related papers (2023-12-31T02:45:17Z) - Assessing the Use of AutoML for Data-Driven Software Engineering [10.40771687966477]
AutoML promises to automate the building of end-to-end AI/ML pipelines.
Despite the growing interest and high expectations, there is a dearth of information about the extent to which AutoML is currently adopted.
arXiv Detail & Related papers (2023-07-20T11:14:24Z) - OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge
Collaborative AutoML System [85.8338446357469]
We introduce OmniForce, a human-centered AutoML system that yields both human-assisted ML and ML-assisted human techniques.
We show how OmniForce can put an AutoML system into practice and build adaptive AI in open-environment scenarios.
arXiv Detail & Related papers (2023-03-01T13:35:22Z) - Towards Green Automated Machine Learning: Status Quo and Future
Directions [71.86820260846369]
AutoML is being criticised for its high resource consumption.
This paper proposes Green AutoML, a paradigm to make the whole AutoML process more environmentally friendly.
arXiv Detail & Related papers (2021-11-10T18:57:27Z) - VolcanoML: Speeding up End-to-End AutoML via Scalable Search Space
Decomposition [57.06900573003609]
VolcanoML is a framework that decomposes a large AutoML search space into smaller ones.
It supports a Volcano-style execution model, akin to the one supported by modern database systems.
Our evaluation demonstrates that, not only does VolcanoML raise the level of expressiveness for search space decomposition in AutoML, it also leads to actual findings of decomposition strategies.
arXiv Detail & Related papers (2021-07-19T13:23:57Z) - Ensemble Squared: A Meta AutoML System [13.062016289815054]
This paper presents a "meta" AutoML system that ensembles at the level of AutoML systems.
Ensemble Squared exploits the diversity of existing, competing AutoML systems by ensembling the top-performing models simultaneously generated by a set of them.
arXiv Detail & Related papers (2020-12-10T01:09:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.