A Feature Dataset of Microservices-based Systems
- URL: http://arxiv.org/abs/2404.01789v1
- Date: Tue, 2 Apr 2024 09:52:18 GMT
- Title: A Feature Dataset of Microservices-based Systems
- Authors: Weipan Yang, Yongchao Xing, Yiming Lyu, Zhihao Liang, Zhiying Tu,
- Abstract summary: Poor practices in the design and development of datasets are called microservice bad smells.
There is a lack of an appropriate open-source microservice feature dataset.
- Score: 2.3734388579113275
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Microservice architecture has become a dominant architectural style in the service-oriented software industry. Poor practices in the design and development of microservices are called microservice bad smells. In microservice bad smells research, the detection of these bad smells relies on feature data from microservices. However, there is a lack of an appropriate open-source microservice feature dataset. The availability of such datasets may contribute to the detection of microservice bad smells unexpectedly. To address this research gap, this paper collects a number of open-source microservice systems utilizing Spring Cloud. Additionally, feature metrics are established based on the architecture and interactions of Spring Boot style microservices. And an extraction program is developed. The program is then applied to the collected open-source microservice systems, extracting the necessary information, and undergoing manual verification to create an open-source feature dataset specific to microservice systems using Spring Cloud. The dataset is made available through a CSV file. We believe that both the extraction program and the dataset have the potential to contribute to the study of micro-service bad smells.
Related papers
- DiscoveryBench: Towards Data-Driven Discovery with Large Language Models [50.36636396660163]
We present DiscoveryBench, the first comprehensive benchmark that formalizes the multi-step process of data-driven discovery.
Our benchmark contains 264 tasks collected across 6 diverse domains, such as sociology and engineering.
Our benchmark, thus, illustrates the challenges in autonomous data-driven discovery and serves as a valuable resource for the community to make progress.
arXiv Detail & Related papers (2024-07-01T18:58:22Z) - Benchmarking Data Management Systems for Microservices [1.9948490148513414]
Microservice architectures are a popular choice for deploying large-scale data-intensive applications.
Existing microservice benchmarks lack essential data management challenges.
Online Marketplace is a novel benchmark that embraces core data management requirements.
arXiv Detail & Related papers (2024-05-19T11:55:45Z) - A Benchmark for Data Management in Microservices [1.9338699922911442]
We present Online Marketplace, a microservice benchmark that incorporates core data management challenges.
These challenges include transaction processing, query processing, event processing, constraint enforcement, and data replication.
We present the challenges we faced in creating workloads that accurately reflect the state-of-the-art data platforms.
arXiv Detail & Related papers (2024-03-19T10:14:48Z) - A Systematic Review of Available Datasets in Additive Manufacturing [56.684125592242445]
In-situ monitoring incorporating visual and other sensor technologies allows the collection of extensive datasets during the Additive Manufacturing process.
These datasets have potential for determining the quality of the manufactured output and the detection of defects through the use of Machine Learning.
This systematic review investigates the availability of open image-based datasets originating from AM processes that align with a number of pre-defined selection criteria.
arXiv Detail & Related papers (2024-01-27T16:13:32Z) - The Microservice Dependency Matrix [0.0]
This paper introduces the Dependency Matrix (EDM) and Data Dependency Matrix (DDM) as tools to address this challenge.
We present an automated approach for tracking these dependencies and demonstrate their extraction through a case study.
arXiv Detail & Related papers (2023-09-06T07:41:00Z) - infoVerse: A Universal Framework for Dataset Characterization with
Multidimensional Meta-information [68.76707843019886]
infoVerse is a universal framework for dataset characterization.
infoVerse captures multidimensional characteristics of datasets by incorporating various model-driven meta-information.
In three real-world applications (data pruning, active learning, and data annotation), the samples chosen on infoVerse space consistently outperform strong baselines.
arXiv Detail & Related papers (2023-05-30T18:12:48Z) - AI Techniques in the Microservices Life-Cycle: A Survey [10.06596283248616]
In microservice systems, functionalities are provided by loosely coupled, small services, each focusing on a specific business capability.
Building a system according to the architectural style brings a number of challenges, mainly related to how different are deployed and coordinated.
In this paper, we provide a survey about how techniques in the area of Artificial Intelligence have been used to tackle these challenges.
arXiv Detail & Related papers (2023-05-25T14:24:37Z) - MicroRes: Versatile Resilience Profiling in Microservices via Degradation Dissemination Indexing [29.456286275972474]
Microservice resilience, the ability to recover from failures and continue providing reliable and responsive services, is crucial for cloud vendors.
The current practice relies on manually configured specific rules to a certain microservice system, resulting in labor-intensity and flexibility issues.
Our insight is that resilient deployment can effectively prevent the dissemination of degradation from system performance to user-aware metrics, and the latter affects service quality.
arXiv Detail & Related papers (2022-12-25T03:56:42Z) - Outsourcing Training without Uploading Data via Efficient Collaborative
Open-Source Sampling [49.87637449243698]
Traditional outsourcing requires uploading device data to the cloud server.
We propose to leverage widely available open-source data, which is a massive dataset collected from public and heterogeneous sources.
We develop a novel strategy called Efficient Collaborative Open-source Sampling (ECOS) to construct a proximal proxy dataset from open-source data for cloud training.
arXiv Detail & Related papers (2022-10-23T00:12:18Z) - A Privacy-Preserving Distributed Architecture for
Deep-Learning-as-a-Service [68.84245063902908]
This paper introduces a novel distributed architecture for deep-learning-as-a-service.
It is able to preserve the user sensitive data while providing Cloud-based machine and deep learning services.
arXiv Detail & Related papers (2020-03-30T15:12:03Z) - MSC: A Dataset for Macro-Management in StarCraft II [52.52008929278214]
We release a new macro-management dataset based on the platform SC2LE.
MSC consists of well-designed feature vectors, pre-defined high-level actions and final result of each match.
Besides the dataset, we propose a baseline model and present initial baseline results for global state evaluation and build order prediction.
arXiv Detail & Related papers (2017-10-09T14:59:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.