CateCom: a practical data-centric approach to categorization of
computational models
- URL: http://arxiv.org/abs/2109.13452v1
- Date: Tue, 28 Sep 2021 02:59:40 GMT
- Title: CateCom: a practical data-centric approach to categorization of
computational models
- Authors: Alexander Zech and Timur Bazhirov
- Abstract summary: We present an effort aimed at organizing the landscape of physics-based and data-driven computational models.
We apply object-oriented design concepts and outline the foundations of an open-source collaborative framework.
- Score: 77.34726150561087
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The advent of data-driven science in the 21st century brought about the need
for well-organized structured data and associated infrastructure able to
facilitate the applications of Artificial Intelligence and Machine Learning. We
present an effort aimed at organizing the diverse landscape of physics-based
and data-driven computational models in order to facilitate the storage of
associated information as structured data. We apply object-oriented design
concepts and outline the foundations of an open-source collaborative framework
that is: (1) capable of uniquely describing the approaches in structured data,
(2) flexible enough to cover the majority of widely used models, and (3)
utilizes collective intelligence through community contributions. We present
example database schemas and corresponding data structures and explain how
these are deployed in software at the time of this writing.
Related papers
- Structure Learning via Mutual Information [0.8702432681310399]
We propose a framework for learning and representing functional relationships in data using mutual information (MI) features.
Our method aims to capture the underlying structure of information in datasets, enabling more efficient and generalizable learning algorithms.
arXiv Detail & Related papers (2024-09-21T19:33:56Z) - Redefining Data-Centric Design: A New Approach with a Domain Model and Core Data Ontology for Computational Systems [2.872069347343959]
This paper presents an innovative data-centric paradigm for designing computational systems by introducing a new informatics domain model.
The proposed model moves away from the conventional node-centric framework and focuses on data-centric categorization, using a multimodal approach that incorporates objects, events, concepts, and actions.
arXiv Detail & Related papers (2024-09-01T22:34:12Z) - Tree-based variational inference for Poisson log-normal models [47.82745603191512]
hierarchical trees are often used to organize entities based on proximity criteria.
Current count-data models do not leverage this structured information.
We introduce the PLN-Tree model as an extension of the PLN model for modeling hierarchical count data.
arXiv Detail & Related papers (2024-06-25T08:24:35Z) - A Blueprint Architecture of Compound AI Systems for Enterprise [18.109450556443782]
We introduce a blueprint architecture for compound AI systems to operate in enterprise settings cost-effectively and feasibly.
Our proposed architecture aims for seamless integration with existing compute and data infrastructure, with stream'' serving as the key orchestration concept.
arXiv Detail & Related papers (2024-06-02T01:16:32Z) - Incremental hierarchical text clustering methods: a review [49.32130498861987]
This study aims to analyze various hierarchical and incremental clustering techniques.
The main contribution of this research is the organization and comparison of the techniques used by studies published between 2010 and 2018 that aimed to texts documents clustering.
arXiv Detail & Related papers (2023-12-12T22:27:29Z) - Serving Deep Learning Model in Relational Databases [70.53282490832189]
Serving deep learning (DL) models on relational data has become a critical requirement across diverse commercial and scientific domains.
We highlight three pivotal paradigms: The state-of-the-art DL-centric architecture offloads DL computations to dedicated DL frameworks.
The potential UDF-centric architecture encapsulates one or more tensor computations into User Defined Functions (UDFs) within the relational database management system (RDBMS)
arXiv Detail & Related papers (2023-10-07T06:01:35Z) - Data Mesh: a Systematic Gray Literature Review [3.038477115588261]
Data mesh is an emerging domain-driven decentralized data architecture that aims to minimize or avoid operational bottlenecks.
We systematically collected, analyzed, and synthesized 114 industrial gray literature articles.
The review provides insights into practitioners' perspectives on the four key principles of data mesh.
arXiv Detail & Related papers (2023-04-03T15:16:46Z) - Policy Architectures for Compositional Generalization in Control [71.61675703776628]
We introduce a framework for modeling entity-based compositional structure in tasks.
Our policies are flexible and can be trained end-to-end without requiring any action primitives.
arXiv Detail & Related papers (2022-03-10T06:44:24Z) - Edge-assisted Democratized Learning Towards Federated Analytics [67.44078999945722]
We show the hierarchical learning structure of the proposed edge-assisted democratized learning mechanism, namely Edge-DemLearn.
We also validate Edge-DemLearn as a flexible model training mechanism to build a distributed control and aggregation methodology in regions.
arXiv Detail & Related papers (2020-12-01T11:46:03Z) - Siamese Graph Neural Networks for Data Integration [11.41207739004894]
We propose a general approach to modeling and integrating entities from structured data, such as relational databases, as well as unstructured sources, such as free text from news articles.
Our approach is designed to explicitly model and leverage relations between entities, thereby using all available information and preserving as much context as possible.
We evaluate our method on the task of integrating data about business entities, and we demonstrate that it outperforms standard rule-based systems, as well as other deep learning approaches that do not use graph-based representations.
arXiv Detail & Related papers (2020-01-17T21:51:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.