Related papers: Scientific Machine Learning Benchmarks

Scientific Machine Learning Benchmarks

URL: http://arxiv.org/abs/2110.12773v1
Date: Mon, 25 Oct 2021 10:05:11 GMT
Title: Scientific Machine Learning Benchmarks
Authors: Jeyan Thiyagalingam, Mallikarjun Shankar, Geoffrey Fox, Tony Hey
Abstract summary: The breakthrough in Deep Learning neural networks has transformed the use of AI and machine learning technologies for the analysis of very large experimental datasets. Identifying the most appropriate machine learning algorithm for the analysis of any given scientific dataset is still a challenge for scientists. We describe our approach to the development of scientific machine learning benchmarks and review other approaches to benchmarking scientific machine learning.
Score: 0.17205106391379021
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The breakthrough in Deep Learning neural networks has transformed the use of AI and machine learning technologies for the analysis of very large experimental datasets. These datasets are typically generated by large-scale experimental facilities at national laboratories. In the context of science, scientific machine learning focuses on training machines to identify patterns, trends, and anomalies to extract meaningful scientific insights from such datasets. With a new generation of experimental facilities, the rate of data generation and the scale of data volumes will increasingly require the use of more automated data analysis. At present, identifying the most appropriate machine learning algorithm for the analysis of any given scientific dataset is still a challenge for scientists. This is due to many different machine learning frameworks, computer architectures, and machine learning models. Historically, for modelling and simulation on HPC systems such problems have been addressed through benchmarking computer applications, algorithms, and architectures. Extending such a benchmarking approach and identifying metrics for the application of machine learning methods to scientific datasets is a new challenge for both scientists and computer scientists. In this paper, we describe our approach to the development of scientific machine learning benchmarks and review other approaches to benchmarking scientific machine learning.

Related papers

Building Machine Learning Challenges for Anomaly Detection in Science [94.24422981343699]
We present three datasets aimed at developing machine learning-based anomaly detection for disparate scientific domains. We present a scheme to make machine learning challenges around the three datasets findable, accessible, interoperable, and reusable.
arXiv Detail & Related papers (2025-03-03T22:54:07Z)
Obtaining physical layer data of latest generation networks for investigating adversary attacks [0.0]
Machine learning can be used to optimize the functions of latest generation data networks such as 5G and 6G. adversarial measures that manipulate the behaviour of intelligent machine learning models are becoming a major concern. A simulation model is proposed that works in conjunction with machine learning applications.
arXiv Detail & Related papers (2024-05-02T06:03:27Z)
EndToEndML: An Open-Source End-to-End Pipeline for Machine Learning Applications [0.2826977330147589]
We propose a web-based end-to-end pipeline that is capable of preprocessing, training, evaluating, and visualizing machine learning models. Our library assists in recognizing, classifying, clustering, and predicting a wide range of multi-modal, multi-sensor datasets.
arXiv Detail & Related papers (2024-03-27T02:24:38Z)
Mechanistic Neural Networks for Scientific Machine Learning [58.99592521721158]
We present Mechanistic Neural Networks, a neural network design for machine learning applications in the sciences. It incorporates a new Mechanistic Block in standard architectures to explicitly learn governing differential equations as representations. Central to our approach is a novel Relaxed Linear Programming solver (NeuRLP) inspired by a technique that reduces solving linear ODEs to solving linear programs.
arXiv Detail & Related papers (2024-02-20T15:23:24Z)
Interpretable Machine Learning for Discovery: Statistical Challenges \& Opportunities [1.2891210250935146]
We discuss and review the field of interpretable machine learning. We outline the types of discoveries that can be made using Interpretable Machine Learning. We focus on the grand challenge of how to validate these discoveries in a data-driven manner.
arXiv Detail & Related papers (2023-08-02T23:57:31Z)
Addressing computational challenges in physical system simulations with machine learning [0.0]
We present a machine learning-based data generator framework tailored to aid researchers who utilize simulations to examine various physical systems or processes. Our approach involves a two-step process: first, we train a supervised predictive model using a limited simulated dataset to predict simulation outcomes. Subsequently, a reinforcement learning agent is trained to generate accurate, simulation-like data by leveraging the supervised model.
arXiv Detail & Related papers (2023-05-16T17:31:50Z)
Advancing Reacting Flow Simulations with Data-Driven Models [50.9598607067535]
Key to effective use of machine learning tools in multi-physics problems is to couple them to physical and computer models. The present chapter reviews some of the open opportunities for the application of data-driven reduced-order modeling of combustion systems.
arXiv Detail & Related papers (2022-09-05T16:48:34Z)
Research Trends and Applications of Data Augmentation Algorithms [77.34726150561087]
We identify the main areas of application of data augmentation algorithms, the types of algorithms used, significant research trends, their progression over time and research gaps in data augmentation literature. We expect readers to understand the potential of data augmentation, as well as identify future research directions and open questions within data augmentation research.
arXiv Detail & Related papers (2022-07-18T11:38:32Z)
Learning from learning machines: a new generation of AI technology to meet the needs of science [59.261050918992325]
We outline emerging opportunities and challenges to enhance the utility of AI for scientific discovery. The distinct goals of AI for industry versus the goals of AI for science create tension between identifying patterns in data versus discovering patterns in the world from data.
arXiv Detail & Related papers (2021-11-27T00:55:21Z)
An Extensible Benchmark Suite for Learning to Simulate Physical Systems [60.249111272844374]
We introduce a set of benchmark problems to take a step towards unified benchmarks and evaluation protocols. We propose four representative physical systems, as well as a collection of both widely used classical time-based and representative data-driven methods.
arXiv Detail & Related papers (2021-08-09T17:39:09Z)
Ten Quick Tips for Deep Learning in Biology [116.78436313026478]
Machine learning is concerned with the development and applications of algorithms that can recognize patterns in data and use them for predictive modeling. Deep learning has become its own subfield of machine learning. In the context of biological research, deep learning has been increasingly used to derive novel insights from high-dimensional biological data.
arXiv Detail & Related papers (2021-05-29T21:02:44Z)
Performance Analysis and Comparison of Machine and Deep Learning Algorithms for IoT Data Classification [0.0]
This paper evaluates the performance of 11 popular machine and deep learning algorithms for classification task using six IoT-related datasets. Considering all performance metrics, Random Forests performed better than other machine learning models, while among deep learning models, ANN and CNN achieved more interesting results.
arXiv Detail & Related papers (2020-01-27T09:14:11Z)

This list is automatically generated from the titles and abstracts of the papers in this site.