Bridge Data Center AI Systems with Edge Computing for Actionable
Information Retrieval
- URL: http://arxiv.org/abs/2105.13967v1
- Date: Fri, 28 May 2021 16:47:01 GMT
- Title: Bridge Data Center AI Systems with Edge Computing for Actionable
Information Retrieval
- Authors: Zhengchun Liu, Ahsan Ali, Peter Kenesei, Antonino Miceli, Hemant
Sharma, Nicholas Schwarz, Dennis Trujillo, Hyunseung Yoo, Ryan Coffee, Ryan
Herbst, Jana Thayer, Chun Hong Yoon, Ian Foster
- Abstract summary: High data rates at modern synchrotron and X-ray free-electron lasers motivate the use of machine learning methods for data reduction, feature detection, and other purposes.
We describe here how specialized data center AI systems can be used for this purpose.
- Score: 0.5652468989804973
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Extremely high data rates at modern synchrotron and X-ray free-electron
lasers (XFELs) light source beamlines motivate the use of machine learning
methods for data reduction, feature detection, and other purposes. Regardless
of the application, the basic concept is the same: data collected in early
stages of an experiment, data from past similar experiments, and/or data
simulated for the upcoming experiment are used to train machine learning models
that, in effect, learn specific characteristics of those data; these models are
then used to process subsequent data more efficiently than would
general-purpose models that lack knowledge of the specific dataset or data
class. Thus, a key challenge is to be able to train models with sufficient
rapidity that they can be deployed and used within useful timescales. We
describe here how specialized data center AI systems can be used for this
purpose.
Related papers
- Data Shapley in One Training Run [88.59484417202454]
Data Shapley provides a principled framework for attributing data's contribution within machine learning contexts.
Existing approaches require re-training models on different data subsets, which is computationally intensive.
This paper introduces In-Run Data Shapley, which addresses these limitations by offering scalable data attribution for a target model of interest.
arXiv Detail & Related papers (2024-06-16T17:09:24Z) - Physics Sensor Based Deep Learning Fall Detection System [0.9128828609564524]
We build a complete system named TSFallDetect including data receiving device based on embedded sensor.
We exploit the sequential deep-learning methods to address this falling motion prediction problem based on data collected by inertial and film pressure sensors.
arXiv Detail & Related papers (2024-02-29T07:50:06Z) - Synthetic data enable experiments in atomistic machine learning [0.0]
We demonstrate the use of a large dataset labelled with per-atom energies from an existing ML potential model.
The cheapness of this process, compared to the quantum-mechanical ground truth, allows us to generate millions of datapoints.
We show that learning synthetic data labels can be a useful pre-training task for subsequent fine-tuning on small datasets.
arXiv Detail & Related papers (2022-11-29T18:17:24Z) - Data Models for Dataset Drift Controls in Machine Learning With Optical
Images [8.818468649062932]
A primary failure mode are performance drops due to differences between the training and deployment data.
Existing approaches do not account for explicit models of the primary object of interest: the data.
We demonstrate how such data models can be constructed for image data and used to control downstream machine learning model performance related to dataset drift.
arXiv Detail & Related papers (2022-11-04T16:50:10Z) - Advancing Reacting Flow Simulations with Data-Driven Models [50.9598607067535]
Key to effective use of machine learning tools in multi-physics problems is to couple them to physical and computer models.
The present chapter reviews some of the open opportunities for the application of data-driven reduced-order modeling of combustion systems.
arXiv Detail & Related papers (2022-09-05T16:48:34Z) - Deep Reinforcement Learning Assisted Federated Learning Algorithm for
Data Management of IIoT [82.33080550378068]
The continuous expanded scale of the industrial Internet of Things (IIoT) leads to IIoT equipments generating massive amounts of user data every moment.
How to manage these time series data in an efficient and safe way in the field of IIoT is still an open issue.
This paper studies the FL technology applications to manage IIoT equipment data in wireless network environments.
arXiv Detail & Related papers (2022-02-03T07:12:36Z) - SOLIS -- The MLOps journey from data acquisition to actionable insights [62.997667081978825]
In this paper we present a unified deployment pipeline and freedom-to-operate approach that supports all requirements while using basic cross-platform tensor framework and script language engines.
This approach however does not supply the needed procedures and pipelines for the actual deployment of machine learning capabilities in real production grade systems.
arXiv Detail & Related papers (2021-12-22T14:45:37Z) - ALT-MAS: A Data-Efficient Framework for Active Testing of Machine
Learning Algorithms [58.684954492439424]
We propose a novel framework to efficiently test a machine learning model using only a small amount of labeled test data.
The idea is to estimate the metrics of interest for a model-under-test using Bayesian neural network (BNN)
arXiv Detail & Related papers (2021-04-11T12:14:04Z) - Model-Based Deep Learning [155.063817656602]
Signal processing, communications, and control have traditionally relied on classical statistical modeling techniques.
Deep neural networks (DNNs) use generic architectures which learn to operate from data, and demonstrate excellent performance.
We are interested in hybrid techniques that combine principled mathematical models with data-driven systems to benefit from the advantages of both approaches.
arXiv Detail & Related papers (2020-12-15T16:29:49Z) - Evaluation of synthetic and experimental training data in supervised
machine learning applied to charge state detection of quantum dots [0.0]
We evaluate the prediction accuracy of a range of machine learning models trained on simulated and experimental data.
We find that classifiers perform best on either purely experimental or a combination of synthetic and experimental training data.
arXiv Detail & Related papers (2020-05-16T23:41:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.