Machine Learning Algorithms for Active Monitoring of High Performance
Computing as a Service (HPCaaS) Cloud Environments
- URL: http://arxiv.org/abs/2009.12498v1
- Date: Sat, 26 Sep 2020 01:29:19 GMT
- Title: Machine Learning Algorithms for Active Monitoring of High Performance
Computing as a Service (HPCaaS) Cloud Environments
- Authors: Gianluca Longoni (1), Ryan LaMothe (1), Jeremy Teuton (1), Mark
Greaves (1), Nicole Nichols (1), William Smith (1) ((1) Pacific Northwest
National Laboratory)
- Abstract summary: This paper explores the viability of engineering applications running on a cloud infrastructure configured as an HPC platform.
The engineering applications considered in this work include MCNP6, a radiation transport code developed by Los Alamos National Laboratory, OpenFOAM, an open source computational fluid dynamics code, and CADONFS, a numerical implementation of the general number field sieve algorithm used for prime number factorization.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Cloud computing provides ubiquitous and on-demand access to vast
reconfigurable resources that can meet any computational need. Many service
models are available, but the Infrastructure as a Service (IaaS) model is
particularly suited to operate as a high performance computing (HPC) platform,
by networking large numbers of cloud computing nodes. We used the Pacific
Northwest National Laboratory (PNNL) cloud computing environment to perform our
experiments. A number of cloud computing providers such as Amazon Web Services,
Microsoft Azure, or IBM Cloud, offer flexible and scalable computing resources.
This paper explores the viability identifying types of engineering applications
running on a cloud infrastructure configured as an HPC platform using privacy
preserving features as input to statistical models. The engineering
applications considered in this work include MCNP6, a radiation transport code
developed by Los Alamos National Laboratory, OpenFOAM, an open source
computational fluid dynamics code, and CADO-NFS, a numerical implementation of
the general number field sieve algorithm used for prime number factorization.
Our experiments use the OpenStack cloud management tool to create a cloud HPC
environment and the privacy preserving Ceilometer billing meters as
classification features to demonstrate identification of these applications.
Related papers
- Teaching Cloud Infrastructure and Scalable Application Deployment in an Undergraduate Computer Science Program [2.8912542516745168]
Students commonly interact with cloud abstractions early in their technical careers.
Building cloud-native applications without a firm understanding of the fundamentals of cloud engineering can leave students susceptible to cost and security pitfalls common to cloud platforms.
Our course featured a number of hands-on assignments that gave students experience with modern, best-practice concepts and tools such as IaC, containerization, observability, serverless computing, and continuous integration and deployment.
arXiv Detail & Related papers (2024-10-01T19:49:19Z) - Feature Selection using the concept of Peafowl Mating in IDS [2.184775414778289]
Cloud computing provides services that are Infrastructure based, Platform based and Software based.
The popularity of this technology is due to its superb performance, high level of computing ability, low cost of services, scalability, availability and flexibility.
The obtainability and openness of data in cloud environment make it vulnerable to the world of cyber-attacks.
To detect the attacks Intrusion Detection System is used, that can identify the attacks and ensure information security.
arXiv Detail & Related papers (2024-02-03T06:04:49Z) - SoTaNa: The Open-Source Software Development Assistant [81.86136560157266]
SoTaNa is an open-source software development assistant.
It generates high-quality instruction-based data for the domain of software engineering.
It employs a parameter-efficient fine-tuning approach to enhance the open-source foundation model, LLaMA.
arXiv Detail & Related papers (2023-08-25T14:56:21Z) - Measuring the Carbon Intensity of AI in Cloud Instances [91.28501520271972]
We provide a framework for measuring software carbon intensity, and propose to measure operational carbon emissions.
We evaluate a suite of approaches for reducing emissions on the Microsoft Azure cloud compute platform.
arXiv Detail & Related papers (2022-06-10T17:04:04Z) - SOLIS -- The MLOps journey from data acquisition to actionable insights [62.997667081978825]
In this paper we present a unified deployment pipeline and freedom-to-operate approach that supports all requirements while using basic cross-platform tensor framework and script language engines.
This approach however does not supply the needed procedures and pipelines for the actual deployment of machine learning capabilities in real production grade systems.
arXiv Detail & Related papers (2021-12-22T14:45:37Z) - Case study of SARS-CoV-2 transmission risk assessment in indoor
environments using cloud computing resources [1.3150679728390269]
We showcase how a complex computational framework can be abstracted and deployed on cloud services.
We deploy the simulation framework on the Azure cloud framework, utilizing the Dendro-kT mesh generation tool and PETSc solvers.
We compare the performance of the cloud machines with state-of-the-art HPC machine TACC Frontera.
arXiv Detail & Related papers (2021-11-17T19:28:18Z) - Molecular Dynamics Simulations on Cloud Computing and Machine Learning
Platforms [0.8093262393618671]
We see a paradigm shift in the computational structure, design, and requirements of scientific computing applications.
Data-driven and machine learning approaches are being used to support, speed-up, and enhance scientific computing applications.
Cloud computing platforms are increasingly appealing for scientific computing.
arXiv Detail & Related papers (2021-11-11T21:20:26Z) - Edge-Cloud Polarization and Collaboration: A Comprehensive Survey [61.05059817550049]
We conduct a systematic review for both cloud and edge AI.
We are the first to set up the collaborative learning mechanism for cloud and edge modeling.
We discuss potentials and practical experiences of some on-going advanced edge AI topics.
arXiv Detail & Related papers (2021-11-11T05:58:23Z) - Auto-Split: A General Framework of Collaborative Edge-Cloud AI [49.750972428032355]
This paper describes the techniques and engineering practice behind Auto-Split, an edge-cloud collaborative prototype of Huawei Cloud.
To the best of our knowledge, there is no existing industry product that provides the capability of Deep Neural Network (DNN) splitting.
arXiv Detail & Related papers (2021-08-30T08:03:29Z) - The MIT Supercloud Dataset [3.375826083518709]
We introduce the MIT Supercloud dataset which aims to foster innovative AI/ML approaches to the analysis of large scale HPC and datacenter/cloud operations.
We provide detailed monitoring logs from the MIT Supercloud system, which include CPU and GPU usage by jobs, memory usage, file system logs, and physical monitoring data.
This paper discusses the details of the dataset, collection methodology, data availability, and discusses potential challenge problems being developed using this data.
arXiv Detail & Related papers (2021-08-04T13:06:17Z) - A Privacy-Preserving Distributed Architecture for
Deep-Learning-as-a-Service [68.84245063902908]
This paper introduces a novel distributed architecture for deep-learning-as-a-service.
It is able to preserve the user sensitive data while providing Cloud-based machine and deep learning services.
arXiv Detail & Related papers (2020-03-30T15:12:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.