Data-centric Operational Design Domain Characterization for Machine
Learning-based Aeronautical Products
- URL: http://arxiv.org/abs/2307.07681v1
- Date: Sat, 15 Jul 2023 02:08:33 GMT
- Title: Data-centric Operational Design Domain Characterization for Machine
Learning-based Aeronautical Products
- Authors: Fateh Kaakai, Shridhar "Shreeder" Adibhatla, Ganesh Pai, Emmanuelle
Escorihuela
- Abstract summary: We give first rigorous characterization of Operational Design Domains (ODDs) for Machine Learning (ML)-based aeronautical products.
We propose the dimensions along which the parameters that define an ODD can be explicitly captured, together with a categorization of the data that ML-based applications can encounter in operation.
- Score: 4.8461049669050915
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We give a first rigorous characterization of Operational Design Domains
(ODDs) for Machine Learning (ML)-based aeronautical products. Unlike in other
application sectors (such as self-driving road vehicles) where ODD development
is scenario-based, our approach is data-centric: we propose the dimensions
along which the parameters that define an ODD can be explicitly captured,
together with a categorization of the data that ML-based applications can
encounter in operation, whilst identifying their system-level relevance and
impact. Specifically, we discuss how those data categories are useful to
determine: the requirements necessary to drive the design of ML Models (MLMs);
the potential effects on MLMs and higher levels of the system hierarchy; the
learning assurance processes that may be needed, and system architectural
considerations. We illustrate the underlying concepts with an example of an
aircraft flight envelope.
Related papers
- How to design a dataset compliant with an ML-based system ODD? [5.432478272457867]
This paper focuses on a Vision-based Landing task and presents the design and validation of a dataset that would comply with the Operational Design Domain (ODD) of a Machine-Learning (ML) system.
Relying on emerging certification standards, we describe the process for establishing ODDs at both the system and image levels.
arXiv Detail & Related papers (2024-06-20T06:48:34Z) - Prospector Heads: Generalized Feature Attribution for Large Models & Data [82.02696069543454]
We introduce prospector heads, an efficient and interpretable alternative to explanation-based attribution methods.
We demonstrate how prospector heads enable improved interpretation and discovery of class-specific patterns in input data.
arXiv Detail & Related papers (2024-02-18T23:01:28Z) - LESS: Selecting Influential Data for Targeted Instruction Tuning [64.78894228923619]
We propose LESS, an efficient algorithm to estimate data influences and perform Low-rank gradiEnt Similarity Search for instruction data selection.
We show that training on a LESS-selected 5% of the data can often outperform training on the full dataset across diverse downstream tasks.
Our method goes beyond surface form cues to identify data that the necessary reasoning skills for the intended downstream application.
arXiv Detail & Related papers (2024-02-06T19:18:04Z) - Benchmarking Automated Machine Learning Methods for Price Forecasting
Applications [58.720142291102135]
We show the possibility of substituting manually created ML pipelines with automated machine learning (AutoML) solutions.
Based on the CRISP-DM process, we split the manual ML pipeline into a machine learning and non-machine learning part.
We show in a case study for the industrial use case of price forecasting, that domain knowledge combined with AutoML can weaken the dependence on ML experts.
arXiv Detail & Related papers (2023-04-28T10:27:38Z) - Desiderata for next generation of ML model serving [0.34410212782758054]
This paper puts forth a range of important qualities that next generation of inference platforms should be aiming for.
An overarching design pattern is data-centricity, which enables smarter monitoring in ML system operation.
arXiv Detail & Related papers (2022-10-26T12:29:25Z) - An Empirical Evaluation of Flow Based Programming in the Machine
Learning Deployment Context [11.028123436097616]
Data Oriented Architecture (DOA) is an emerging approach that can support data scientists and software developers when addressing challenges.
This paper proposes to consider Flow-Based Programming (FBP) as a paradigm for creating DOA applications.
We empirically evaluate FBP in the context of ML deployment on four applications that represent typical data science projects.
arXiv Detail & Related papers (2022-04-27T09:08:48Z) - SOLIS -- The MLOps journey from data acquisition to actionable insights [62.997667081978825]
In this paper we present a unified deployment pipeline and freedom-to-operate approach that supports all requirements while using basic cross-platform tensor framework and script language engines.
This approach however does not supply the needed procedures and pipelines for the actual deployment of machine learning capabilities in real production grade systems.
arXiv Detail & Related papers (2021-12-22T14:45:37Z) - Machine Learning for Naval Architecture, Ocean and Marine Engineering [0.0]
This article reviews the applications of machine learning algorithms in naval architecture, ocean, and marine engineering problems.
We discuss the applications of machine learning algorithms for different problems such as wave height prediction, calculation of wind loads on ships, damage detection of offshore platforms, calculation of ship added resistance.
arXiv Detail & Related papers (2021-09-01T09:36:12Z) - Exploring the potential of flow-based programming for machine learning
deployment in comparison with service-oriented architectures [8.677012233188968]
We argue that part of the reason is infrastructure that was not designed for activities around data collection and analysis.
We propose to consider flow-based programming with data streams as an alternative to commonly used service-oriented architectures for building software applications.
arXiv Detail & Related papers (2021-08-09T15:06:02Z) - Leveraging Expert Consistency to Improve Algorithmic Decision Support [62.61153549123407]
We explore the use of historical expert decisions as a rich source of information that can be combined with observed outcomes to narrow the construct gap.
We propose an influence function-based methodology to estimate expert consistency indirectly when each case in the data is assessed by a single expert.
Our empirical evaluation, using simulations in a clinical setting and real-world data from the child welfare domain, indicates that the proposed approach successfully narrows the construct gap.
arXiv Detail & Related papers (2021-01-24T05:40:29Z) - A Survey on Large-scale Machine Learning [67.6997613600942]
Machine learning can provide deep insights into data, allowing machines to make high-quality predictions.
Most sophisticated machine learning approaches suffer from huge time costs when operating on large-scale data.
Large-scale Machine Learning aims to learn patterns from big data with comparable performance efficiently.
arXiv Detail & Related papers (2020-08-10T06:07:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.