Globus Automation Services: Research process automation across the
space-time continuum
- URL: http://arxiv.org/abs/2208.09513v1
- Date: Fri, 19 Aug 2022 18:58:53 GMT
- Title: Globus Automation Services: Research process automation across the
space-time continuum
- Authors: Ryan Chard and Jim Pruyne and Kurt McKee and Josh Bryan and Brigitte
Raumann and Rachana Ananthakrishnan and Kyle Chard and Ian Foster
- Abstract summary: We report on services within the Globus research data management platform.
These services enable the specification of diverse research processes as reusable sets of actions, flows, and the execution of such flows.
We present use cases for Globus automation services, describe the design and implementation of the services, present microbenchmark studies, and review experiences applying the services in a range of applications.
- Score: 0.5905241534086076
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Research process automation--the reliable, efficient, and reproducible
execution of linked sets of actions on scientific instruments, computers, data
stores, and other resources--has emerged as an essential element of modern
science. We report here on new services within the Globus research data
management platform that enable the specification of diverse research processes
as reusable sets of actions, flows, and the execution of such flows in
heterogeneous research environments. To support flows with broad spatial extent
(e.g., from scientific instrument to remote data center) and temporal extent
(from seconds to weeks), these Globus automation services feature: 1) cloud
hosting for reliable execution of even long-lived flows despite sporadic
failures; 2) a declarative notation, and extensible asynchronous action
provider API, for defining and executing a wide variety of actions and flow
specifications involving arbitrary resources; 3) authorization delegation
mechanisms for secure invocation of actions. These services permit researchers
to outsource and automate the management of a broad range of research tasks to
a reliable, scalable, and secure cloud platform. We present use cases for
Globus automation services, describe the design and implementation of the
services, present microbenchmark studies, and review experiences applying the
services in a range of applications
Related papers
- OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automation [51.27062359412488]
Office automation significantly enhances human productivity by automatically finishing routine tasks in the workflow.
We introduce OfficeBench, one of the first office automation benchmarks for evaluating current LLM agents' capability to address office tasks in realistic office.
Applying our customized evaluation methods on each task, we find that GPT-4 Omni achieves the highest pass rate of 47.00%, demonstrating a decent performance in handling office tasks.
arXiv Detail & Related papers (2024-07-26T19:27:17Z) - SWARM-SLR -- Streamlined Workflow Automation for Machine-actionable Systematic Literature Reviews [0.4915744683251149]
We propose the Streamlined Automation for Machine-actionable Systematic Literature Reviews (SWARM- SLR) to crowdsource the improvement of SLR efficiency.
By guidelines from the literature, we have composed a set of 65 requirements, spanning from planning to reporting a review.
Existing tools were assessed against these requirements and synthesized into the SWARM- SLR workflow prototype, a ready-for-operation software support tool.
arXiv Detail & Related papers (2024-07-26T10:46:14Z) - Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows? [73.81908518992161]
We introduce Spider2-V, the first multimodal agent benchmark focusing on professional data science and engineering.
Spider2-V features real-world tasks in authentic computer environments and incorporating 20 enterprise-level professional applications.
These tasks evaluate the ability of a multimodal agent to perform data-related tasks by writing code and managing the GUI in enterprise data software systems.
arXiv Detail & Related papers (2024-07-15T17:54:37Z) - DiscoveryBench: Towards Data-Driven Discovery with Large Language Models [50.36636396660163]
We present DiscoveryBench, the first comprehensive benchmark that formalizes the multi-step process of data-driven discovery.
Our benchmark contains 264 tasks collected across 6 diverse domains, such as sociology and engineering.
Our benchmark, thus, illustrates the challenges in autonomous data-driven discovery and serves as a valuable resource for the community to make progress.
arXiv Detail & Related papers (2024-07-01T18:58:22Z) - WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks? [83.19032025950986]
We study the use of large language model-based agents for interacting with software via web browsers.
WorkArena is a benchmark of 33 tasks based on the widely-used ServiceNow platform.
BrowserGym is an environment for the design and evaluation of such agents.
arXiv Detail & Related papers (2024-03-12T14:58:45Z) - OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge
Collaborative AutoML System [85.8338446357469]
We introduce OmniForce, a human-centered AutoML system that yields both human-assisted ML and ML-assisted human techniques.
We show how OmniForce can put an AutoML system into practice and build adaptive AI in open-environment scenarios.
arXiv Detail & Related papers (2023-03-01T13:35:22Z) - Machine Learning-based Orchestration of Containers: A Taxonomy and
Future Directions [25.763692543206773]
Existing mainstream cloud service providers have prevalently adopted container technologies in their distributed system infrastructures for automated application management.
To handle the automation of deployment, maintenance, autoscaling, and networking of containerized applications, container orchestration is proposed as an essential research problem.
In this paper, we present a comprehensive review of existing machine learning-based container orchestration approaches.
arXiv Detail & Related papers (2021-06-24T02:55:35Z) - Automated Machine Learning Techniques for Data Streams [91.3755431537592]
This paper surveys the state-of-the-art open-source AutoML tools, applies them to data collected from streams, and measures how their performance changes over time.
The results show that off-the-shelf AutoML tools can provide satisfactory results but in the presence of concept drift, detection or adaptation techniques have to be applied to maintain the predictive accuracy over time.
arXiv Detail & Related papers (2021-06-14T11:42:46Z) - Artificial Intelligence (AI)-Centric Management of Resources in Modern
Distributed Computing Systems [22.550075095184514]
Cloud Data Centres (DCS) are large scale, complex, heterogeneous, and distributed across multiple networks and geographical boundaries.
The Internet of Things (IoT)-driven applications are producing a huge amount of data that requires real-time processing and fast response.
Existing Resource Management Systems (RMS) rely on either static or solutions inadequate for such composite and dynamic systems.
arXiv Detail & Related papers (2020-06-09T06:54:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.