A Comparative Study of Machine Learning Algorithms for Anomaly Detection
in Industrial Environments: Performance and Environmental Impact
- URL: http://arxiv.org/abs/2307.00361v1
- Date: Sat, 1 Jul 2023 15:18:00 GMT
- Title: A Comparative Study of Machine Learning Algorithms for Anomaly Detection
in Industrial Environments: Performance and Environmental Impact
- Authors: \'Alvaro Huertas-Garc\'ia and Carlos Mart\'i-Gonz\'alez and Rub\'en
Garc\'ia Maezo and Alejandro Echeverr\'ia Rey
- Abstract summary: This study seeks to address the demands of high-performance machine learning models with environmental sustainability.
Traditional machine learning algorithms, such as Decision Trees and Random Forests, demonstrate robust efficiency and performance.
However, superior outcomes were obtained with optimised configurations, albeit with a commensurate increase in resource consumption.
- Score: 62.997667081978825
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In the context of Industry 4.0, the use of artificial intelligence (AI) and
machine learning for anomaly detection is being hampered by high computational
requirements and associated environmental effects. This study seeks to address
the demands of high-performance machine learning models with environmental
sustainability, contributing to the emerging discourse on 'Green AI.' An
extensive variety of machine learning algorithms, coupled with various
Multilayer Perceptron (MLP) configurations, were meticulously evaluated. Our
investigation encapsulated a comprehensive suite of evaluation metrics,
comprising Accuracy, Area Under the Curve (AUC), Recall, Precision, F1 Score,
Kappa Statistic, Matthews Correlation Coefficient (MCC), and F1 Macro.
Simultaneously, the environmental footprint of these models was gauged through
considerations of time duration, CO2 equivalent, and energy consumption during
the training, cross-validation, and inference phases. Traditional machine
learning algorithms, such as Decision Trees and Random Forests, demonstrate
robust efficiency and performance. However, superior outcomes were obtained
with optimised MLP configurations, albeit with a commensurate increase in
resource consumption. The study incorporated a multi-objective optimisation
approach, invoking Pareto optimality principles, to highlight the trade-offs
between a model's performance and its environmental impact. The insights
derived underscore the imperative of striking a balance between model
performance, complexity, and environmental implications, thus offering valuable
directions for future work in the development of environmentally conscious
machine learning models for industrial applications.
Related papers
- Impact of ML Optimization Tactics on Greener Pre-Trained ML Models [46.78148962732881]
This study aims to (i) analyze image classification datasets and pre-trained models, (ii) improve inference efficiency by comparing optimized and non-optimized models, and (iii) assess the economic impact of the optimizations.
We conduct a controlled experiment to evaluate the impact of various PyTorch optimization techniques (dynamic quantization, torch.compile, local pruning, and global pruning) to 42 Hugging Face models for image classification.
Dynamic quantization demonstrates significant reductions in inference time and energy consumption, making it highly suitable for large-scale systems.
arXiv Detail & Related papers (2024-09-19T16:23:03Z) - Computing Within Limits: An Empirical Study of Energy Consumption in ML Training and Inference [2.553456266022126]
Machine learning (ML) has seen tremendous advancements, but its environmental footprint remains a concern.
Acknowledging the growing environmental impact of ML this paper investigates Green ML.
arXiv Detail & Related papers (2024-06-20T13:59:34Z) - ExioML: Eco-economic dataset for Machine Learning in Global Sectoral Sustainability [11.925553950065895]
This paper introduces ExioML, the first Machine Learning benchmark dataset designed for sustainability analysis.
A crucial greenhouse gas emission regression task was conducted to evaluate sectoral sustainability and demonstrate the usability of the dataset.
arXiv Detail & Related papers (2024-06-11T17:06:34Z) - Efficiency Pentathlon: A Standardized Arena for Efficiency Evaluation [82.85015548989223]
Pentathlon is a benchmark for holistic and realistic evaluation of model efficiency.
Pentathlon focuses on inference, which accounts for a majority of the compute in a model's lifecycle.
It incorporates a suite of metrics that target different aspects of efficiency, including latency, throughput, memory overhead, and energy consumption.
arXiv Detail & Related papers (2023-07-19T01:05:33Z) - When Demonstrations Meet Generative World Models: A Maximum Likelihood
Framework for Offline Inverse Reinforcement Learning [62.00672284480755]
This paper aims to recover the structure of rewards and environment dynamics that underlie observed actions in a fixed, finite set of demonstrations from an expert agent.
Accurate models of expertise in executing a task has applications in safety-sensitive applications such as clinical decision making and autonomous driving.
arXiv Detail & Related papers (2023-02-15T04:14:20Z) - Energy Efficiency of Training Neural Network Architectures: An Empirical
Study [11.325530936177493]
The evaluation of Deep Learning models has traditionally focused on criteria such as accuracy, F1 score, and related measures.
The computations needed to train such models entail a large carbon footprint.
We study the relations between DL model architectures and their environmental impact in terms of energy consumed and CO$$ emissions produced during training.
arXiv Detail & Related papers (2023-02-02T09:20:54Z) - Efficient Model-based Multi-agent Reinforcement Learning via Optimistic
Equilibrium Computation [93.52573037053449]
H-MARL (Hallucinated Multi-Agent Reinforcement Learning) learns successful equilibrium policies after a few interactions with the environment.
We demonstrate our approach experimentally on an autonomous driving simulation benchmark.
arXiv Detail & Related papers (2022-03-14T17:24:03Z) - Hybrid and Automated Machine Learning Approaches for Oil Fields
Development: the Case Study of Volve Field, North Sea [58.720142291102135]
The paper describes the usage of intelligent approaches for field development tasks that may assist a decision-making process.
We focus on the problem of wells location optimization and two tasks within it: improving the quality of oil production estimation and estimation of reservoir characteristics.
The implemented approaches can be used to analyze different oil fields or adapted to similar physics-related problems.
arXiv Detail & Related papers (2021-03-03T18:51:46Z) - Importance of Environment Design in Reinforcement Learning: A Study of a
Robotic Environment [0.0]
This paper studies the decision-making process of a mobile collaborative robotic assistant modeled by the Markov decision process (MDP) framework.
The optimal state-action combinations of the MDP are calculated with the non-linear Bellman optimality equations.
We present various small modifications on the very same schema that lead to different optimal policies.
arXiv Detail & Related papers (2021-02-20T21:14:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.