Fugu-MT 論文翻訳(概要): Let the Barbarians In: How AI Can Accelerate Systems Performance Research

論文の概要: Let the Barbarians In: How AI Can Accelerate Systems Performance Research

arxiv url: http://arxiv.org/abs/2512.14806v2
Date: Thu, 18 Dec 2025 08:28:14 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-19 14:03:03.255297
Title: Let the Barbarians In: How AI Can Accelerate Systems Performance Research
Title（参考訳）: AIがシステムパフォーマンス研究を加速する方法
Authors: Audrey Cheng, Shu Liu, Melissa Pan, Zhifei Li, Shubham Agarwal, Mert Cemri, Bowen Wang, Alexander Krentsel, Tian Xia, Jongseok Park, Shuo Yang, Jeff Chen, Lakshya Agrawal, Ashwin Naren, Shulu Li, Ruiying Ma, Aditya Desai, Jiarong Xing, Koushik Sen, Matei Zaharia, Ion Stoica,
Abstract要約: 我々は、この繰り返しサイクルを、AI駆動システム研究の生成、評価、洗練のサイクルと呼ぶ。我々はADRSが生成するソリューションが人間の最先端の設計に適合し、さらに優れることを示した。
参考スコア（独自算出の注目度）: 80.43506848683633
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Artificial Intelligence (AI) is beginning to transform the research process by automating the discovery of new solutions. This shift depends on the availability of reliable verifiers, which AI-driven approaches require to validate candidate solutions. Research focused on improving systems performance is especially well-suited to this paradigm because system performance problems naturally admit such verifiers: candidates can be implemented in real systems or simulators and evaluated against predefined workloads. We term this iterative cycle of generation, evaluation, and refinement AI-Driven Research for Systems (ADRS). Using several open-source ADRS instances (i.e., OpenEvolve, GEPA, and ShinkaEvolve), we demonstrate across ten case studies (e.g., multi-region cloud scheduling, mixture-of-experts load balancing, LLM-based SQL, transaction scheduling) that ADRS-generated solutions can match or even outperform human state-of-the-art designs. Based on these findings, we outline best practices (e.g., level of prompt specification, amount of feedback, robust evaluation) for effectively using ADRS, and we discuss future research directions and their implications. Although we do not yet have a universal recipe for applying ADRS across all of systems research, we hope our preliminary findings, together with the challenges we identify, offer meaningful guidance for future work as researcher effort shifts increasingly toward problem formulation and strategic oversight. Note: This paper is an extension of our prior work [14]. It adds extensive evaluation across multiple ADRS frameworks and provides deeper analysis and insights into best practices.
Abstract（参考訳）: 人工知能(AI)は、新しいソリューションの発見を自動化することによって、研究プロセスを変革し始めている。このシフトは、AI駆動のアプローチが候補ソリューションを検証するために必要な信頼性の高い検証器の可用性に依存する。システム性能の改善に焦点を当てた研究はこのパラダイムに特に適しているのは、システム性能の問題がこのような検証を自然に認めるからである。この繰り返しサイクルをAI駆動システム研究(ADRS)と呼ぶ。いくつかのオープンソースのADRSインスタンス(OpenEvolve、GEPA、S ShinkaEvolve)を使用して、ADRSの生成したソリューションがヒトの最先端設計にマッチしたり、あるいは性能を上回ることのできる10のケーススタディ(マルチリージョンクラウドスケジューリング、Mix-of-expertsロードバランシング、LLMベースのSQL、トランザクションスケジューリングなど)を実証する。これらの知見に基づき,ADRSを効果的に活用するためのベストプラクティス(即時仕様のレベル,フィードバックの量,堅牢な評価)を概説し,今後の研究の方向性とその意義について論じる。システム研究全般にADRSを適用するための普遍的なレシピをまだ持っていませんが、予備的な発見とともに、我々が特定した課題とともに、研究者の努力が問題定式化や戦略的監視へとシフトするにつれて、将来の作業に有意義なガイダンスを提供することを期待しています。注:本論文はこれまでの研究の延長である[14]。複数のADRSフレームワークに広範な評価を追加し、ベストプラクティスに関するより深い分析と洞察を提供する。

論文の概要: Let the Barbarians In: How AI Can Accelerate Systems Performance Research

関連論文リスト