Testing GitHub projects on custom resources using unprivileged
Kubernetes runners
- URL: http://arxiv.org/abs/2305.10346v1
- Date: Wed, 17 May 2023 16:31:41 GMT
- Title: Testing GitHub projects on custom resources using unprivileged
Kubernetes runners
- Authors: Igor Sfiligoi, Daniel McDonald, Rob Knight and Frank W\"urthwein
- Abstract summary: GitHub is a popular repository for hosting software projects.
Native GitHub Actions make it easy for software developers to validate new commits and have confidence that new code does not introduce major bugs.
The freely available test environments are limited to only a few popular setups but can be extended with custom Action Runners.
- Score: 1.137903861863692
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: GitHub is a popular repository for hosting software projects, both due to
ease of use and the seamless integration with its testing environment. Native
GitHub Actions make it easy for software developers to validate new commits and
have confidence that new code does not introduce major bugs. The freely
available test environments are limited to only a few popular setups but can be
extended with custom Action Runners. Our team had access to a Kubernetes
cluster with GPU accelerators, so we explored the feasibility of automatically
deploying GPU-providing runners there. All available Kubernetes-based setups,
however, require cluster-admin level privileges. To address this problem, we
developed a simple custom setup that operates in a completely unprivileged
manner. In this paper we provide a summary description of the setup and our
experience using it in the context of two Knight lab projects on the Prototype
National Research Platform system.
Related papers
- SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving [90.32201622392137]
We present SwingArena, a competitive evaluation framework for Large Language Models (LLMs)<n>Unlike traditional static benchmarks, SwingArena models the collaborative process of software by pairing LLMs as iterations, who generate patches, and reviewers, who create test cases and verify the patches through continuous integration (CI) pipelines.
arXiv Detail & Related papers (2025-05-29T18:28:02Z) - GitHub Proxy Server: A tool for supporting massive data collection on GitHub [0.0]
GitHub is the most popular social coding platform and widely used by developers and organizations to host their open-source projects around the world.<n>The platform has a web API that allow developers collect information from public repositories hosted on it.<n>However, collecting massive amount of data from GitHub can be very challenging due to existing restrictions and abuse detection mechanisms.<n>We present a tool, called GitHub Proxy Server, which abstracts such complexities into a tool that is independent on operational system and programming language.
arXiv Detail & Related papers (2025-05-23T19:00:32Z) - EnvBench: A Benchmark for Automated Environment Setup [76.02998475135824]
Large Language Models have enabled researchers to focus on practical repository-level tasks in software engineering domain.
Existing studies on environment setup introduce innovative agentic strategies, but their evaluation is often based on small datasets.
To address this gap, we introduce a comprehensive environment setup benchmark EnvBench.
arXiv Detail & Related papers (2025-03-18T17:19:12Z) - PyPackIT: Automated Research Software Engineering for Scientific Python Applications on GitHub [0.0]
PyPackIT is a user-friendly, ready-to-use software that enables scientists to focus on the scientific aspects of their projects.
PyPackIT offers a robust project infrastructure including a build-ready Python package skeleton, a fully operational documentation and test suite, and a control center for dynamic project management.
arXiv Detail & Related papers (2025-03-06T19:41:55Z) - RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph [63.87660059104077]
We present RepoGraph, a plug-in module that manages a repository-level structure for modern AI software engineering solutions.
RepoGraph substantially boosts the performance of all systems, leading to a new state-of-the-art among open-source frameworks.
arXiv Detail & Related papers (2024-10-03T05:45:26Z) - Automatic Categorization of GitHub Actions with Transformers and Few-shot Learning [12.254055731378045]
GitHub Actions (GHA) have been conceived to provide developers with a practical tool to create and maintain a pipeline.
To expose actions to search engines, GitHub allows developers to assign them to one or more categories manually.
We propose Gavel, a practical solution to increasing the visibility of actions in GitHub.
arXiv Detail & Related papers (2024-07-24T02:27:36Z) - Benchmarking Predictive Coding Networks -- Made Simple [48.652114040426625]
We tackle the problems of efficiency and scalability for predictive coding networks (PCNs) in machine learning.
We propose a library, called PCX, that focuses on performance and simplicity, and use it to implement a large set of standard benchmarks.
We perform extensive tests on such benchmarks using both existing algorithms for PCNs, as well as adaptations of other methods popular in the bio-plausible deep learning community.
arXiv Detail & Related papers (2024-07-01T10:33:44Z) - MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution [47.850418420195304]
Large Language Models (LLMs) have shown promise in code generation but face difficulties in resolving GitHub issues.
We propose a novel Multi-Agent framework for GitHub Issue reSolution, MAGIS, consisting of four agents customized for software evolution.
arXiv Detail & Related papers (2024-03-26T17:57:57Z) - GitAgent: Facilitating Autonomous Agent with GitHub by Tool Extension [81.44231422624055]
A growing area of research focuses on Large Language Models (LLMs) equipped with external tools capable of performing diverse tasks.
In this paper, we introduce GitAgent, an agent capable of achieving the autonomous tool extension from GitHub.
arXiv Detail & Related papers (2023-12-28T15:47:30Z) - GitBug-Actions: Building Reproducible Bug-Fix Benchmarks with GitHub
Actions [8.508198765617196]
We present GitBug-Actions, a novel tool for building bug-fix benchmarks with modern and fully-reproducible bug-fixes.
GitBug-Actions relies on the most popular CI platform, GitHub Actions, to detect bug-fixes.
To demonstrate our toolchain, we deploy GitBug-Actions to build a proof-of-concept Go bug-fix benchmark.
arXiv Detail & Related papers (2023-10-24T09:04:14Z) - Modern Software Development for JUNO offline software [0.0]
The Jiangmen Underground Neutrino Observatory (JUNO), under construction in South China, primarily aims to determine the neutrino mass hierarchy and to precise measure the neutrino oscillation parameters.
The development of the JUNO offline software (JUNOSW) started in 2012, and it is quite challenging to maintain the JUNOSW for such a long time.
New stringent requirements came out, such as how to reduce the building time for the whole project, how to deploy offline algorithms to an online environment, and how to improve the code quality with code review and continuous integration.
This contribution will present the software development system based
arXiv Detail & Related papers (2023-09-25T00:13:47Z) - Git-Theta: A Git Extension for Collaborative Development of Machine
Learning Models [26.107117592578632]
We introduce Git-Theta, a version control system for machine learning models.
Git-Theta is an extension to Git, the most widely used version control software.
arXiv Detail & Related papers (2023-06-07T15:37:50Z) - torchgfn: A PyTorch GFlowNet library [56.071033896777784]
torchgfn is a PyTorch library that aims to address this need.
It provides users with a simple API for environments and useful abstractions for samplers and losses.
arXiv Detail & Related papers (2023-05-24T00:20:59Z) - AutoCoreset: An Automatic Practical Coreset Construction Framework [65.37876706107764]
A coreset is a tiny weighted subset of an input set, that closely resembles the loss function.
We propose an automatic framework for constructing coresets, which requires only the input data and the desired cost function from the user.
We show that while this set is limited, the coreset is quite general.
arXiv Detail & Related papers (2023-05-19T19:59:52Z) - Repro: An Open-Source Library for Improving the Reproducibility and
Usability of Publicly Available Research Code [74.28810048824519]
Repro is an open-source library which aims at improving the usability of research code.
It provides a lightweight Python API for running software released by researchers within Docker containers.
arXiv Detail & Related papers (2022-04-29T01:54:54Z) - Student Teamwork on Programming Projects: What can GitHub logs show us? [3.764846583322767]
We collected GitHub logs from two programming projects in two offerings of a CS2 Java programming course for computer science majors.
Students worked in pairs for both projects (one optional, the other mandatory) in each year.
We can identify the students' teamwork style automatically from their submission logs.
arXiv Detail & Related papers (2020-08-25T20:41:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.