Function+Data Flow: A Framework to Specify Machine Learning Pipelines for Digital Twinning
- URL: http://arxiv.org/abs/2406.19670v2
- Date: Mon, 8 Jul 2024 08:28:34 GMT
- Title: Function+Data Flow: A Framework to Specify Machine Learning Pipelines for Digital Twinning
- Authors: Eduardo de Conto, Blaise Genest, Arvind Easwaran,
- Abstract summary: Digital twins (DTs) for physical systems increasingly leverage artificial intelligence (AI)
Here we propose a domain-specific language (t+Data Flow) to describe AI pipelines within DTs.
Specifically, t treats functions as first-class citizens, enabling effective manipulation of models learned with AI.
- Score: 2.27626288527213
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The development of digital twins (DTs) for physical systems increasingly leverages artificial intelligence (AI), particularly for combining data from different sources or for creating computationally efficient, reduced-dimension models. Indeed, even in very different application domains, twinning employs common techniques such as model order reduction and modelization with hybrid data (that is, data sourced from both physics-based models and sensors). Despite this apparent generality, current development practices are ad-hoc, making the design of AI pipelines for digital twinning complex and time-consuming. Here we propose Function+Data Flow (FDF), a domain-specific language (DSL) to describe AI pipelines within DTs. FDF aims to facilitate the design and validation of digital twins. Specifically, FDF treats functions as first-class citizens, enabling effective manipulation of models learned with AI. We illustrate the benefits of FDF on two concrete use cases from different domains: predicting the plastic strain of a structure and modeling the electromagnetic behavior of a bearing.
Related papers
- Automatic AI Model Selection for Wireless Systems: Online Learning via Digital Twinning [50.332027356848094]
AI-based applications are deployed at intelligent controllers to carry out functionalities like scheduling or power control.
The mapping between context and AI model parameters is ideally done in a zero-shot fashion.
This paper introduces a general methodology for the online optimization of AMS mappings.
arXiv Detail & Related papers (2024-06-22T11:17:50Z) - Training Deep Surrogate Models with Large Scale Online Learning [48.7576911714538]
Deep learning algorithms have emerged as a viable alternative for obtaining fast solutions for PDEs.
Models are usually trained on synthetic data generated by solvers, stored on disk and read back for training.
It proposes an open source online training framework for deep surrogate models.
arXiv Detail & Related papers (2023-06-28T12:02:27Z) - Enhanced multi-fidelity modelling for digital twin and uncertainty
quantification [0.0]
Data-driven models play a crucial role in digital twins, enabling real-time updates and predictions.
The fidelity of available data and the scarcity of accurate sensor data often hinder the efficient learning of surrogate models.
We propose a novel framework that begins by developing a robust multi-fidelity surrogate model.
arXiv Detail & Related papers (2023-06-26T05:58:17Z) - DINOv2: Learning Robust Visual Features without Supervision [75.42921276202522]
This work shows that existing pretraining methods, especially self-supervised methods, can produce such features if trained on enough curated data from diverse sources.
Most of the technical contributions aim at accelerating and stabilizing the training at scale.
In terms of data, we propose an automatic pipeline to build a dedicated, diverse, and curated image dataset instead of uncurated data, as typically done in the self-supervised literature.
arXiv Detail & Related papers (2023-04-14T15:12:19Z) - Scaling Knowledge Graphs for Automating AI of Digital Twins [2.8693907332286996]
Digital Twins are digital representations of systems in the Internet of Things (IoT) that are often based on AI models that are trained on data from those systems.
We will discuss the unique requirements of applying semantic graphs to automate Digital Twins in different practical use cases.
arXiv Detail & Related papers (2022-10-26T10:12:10Z) - Advancing Reacting Flow Simulations with Data-Driven Models [50.9598607067535]
Key to effective use of machine learning tools in multi-physics problems is to couple them to physical and computer models.
The present chapter reviews some of the open opportunities for the application of data-driven reduced-order modeling of combustion systems.
arXiv Detail & Related papers (2022-09-05T16:48:34Z) - Automatic digital twin data model generation of building energy systems
from piping and instrumentation diagrams [58.720142291102135]
We present an approach to recognize symbols and connections of P&ID from buildings in a completely automated way.
We apply algorithms for symbol recognition, line recognition and derivation of connections to the data sets.
The approach can be used in further processes like control generation, (distributed) model predictive control or fault detection.
arXiv Detail & Related papers (2021-08-31T15:09:39Z) - Model-Based Deep Learning [155.063817656602]
Signal processing, communications, and control have traditionally relied on classical statistical modeling techniques.
Deep neural networks (DNNs) use generic architectures which learn to operate from data, and demonstrate excellent performance.
We are interested in hybrid techniques that combine principled mathematical models with data-driven systems to benefit from the advantages of both approaches.
arXiv Detail & Related papers (2020-12-15T16:29:49Z) - Federated Knowledge Distillation [42.87991207898215]
Federated distillation (FD) is a distributed learning solution that only exchanges the model outputs whose dimensions are commonly much smaller than the model sizes.
This chapter provides a deep understanding of FD while demonstrating its communication efficiency and applicability to a variety of tasks.
The second part elaborates on a baseline implementation of FD for a classification task, and illustrates its performance in terms of accuracy and communication efficiency compared to FL.
The third part presents two selected applications, namely FD over asymmetric uplink-and-downlink wireless channels and FD for reinforcement learning.
arXiv Detail & Related papers (2020-11-04T15:56:13Z) - Deep Learning of Dynamic Subsurface Flow via Theory-guided Generative
Adversarial Network [0.0]
Theory-guided generative adversarial network (TgGAN) is proposed to solve dynamic partial differential equations (PDEs)
TgGAN is proposed for dynamic subsurface flow with heterogeneous model parameters.
Numerical results demonstrate that the TgGAN model is robust and reliable for deep learning of dynamic PDEs.
arXiv Detail & Related papers (2020-06-02T02:53:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.