Fugu-MT 論文翻訳(概要): ChipMATE: Multi-Agent Training via Reinforcement Learning for Enhanced RTL Generation

論文の概要: ChipMATE: Multi-Agent Training via Reinforcement Learning for Enhanced RTL Generation

arxiv url: http://arxiv.org/abs/2605.12857v1
Date: Wed, 13 May 2026 01:04:21 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-14 23:30:27.744614
Title: ChipMATE: Multi-Agent Training via Reinforcement Learning for Enhanced RTL Generation
Title（参考訳）: ChipMATE:強化RTL生成のための強化学習によるマルチエージェントトレーニング
Authors: Zhongkai Yu, Yichen Lin, Chenyang Zhou, Yuwei Zhang, Kun Zhou, Junxia Cui, Haotian Ye, Zhengding Hu, Zaifeng Pan, Ruiyi Wang, Yujie Zhao, Hejia Zhang, Jingbo Shang, Jishen Zhao, Yufei Ding,
Abstract要約: ChipMATEは、RTL生成のための最初の自己学習型マルチエージェントフレームワークである。 ChipMATEは産業的な実践に触発され、VerilogエージェントとPythonのリファレンスモデルエージェントをペアにし、相互に出力を検証する。 ChipMATEは、VerilogEval V2で75.0%と80.1%パス@1を4Bと9Bベースモデルで達成している。
参考スコア（独自算出の注目度）: 55.947962672433675
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Existing API-based agentic systems for RTL code generation are fundamentally misaligned with industrial practice: they assume a golden testbench is available at generation time, rely on closed-source APIs incompatible with chip vendors' air-gapped security requirements, and cannot be trained on vendors' proprietary RTL codebases, leaving valuable internal data unused. Recent self-trained models address the deployment constraint but remain single-turn generators that overlook the critical role of verification in real industrial flows. To bridge these gaps, we present ChipMATE, the first self-trained multi-agent framework for RTL generation. Inspired by industrial practice where correctness emerges from cross-comparison between independently written RTL modules and reference models, ChipMATE pairs a Verilog agent with a Python reference-model agent that mutually verify each other's outputs without any golden oracle. We design a backtrack-based inference workflow to prevent error propagation across turns, and a two-stage training pipeline that first trains each agent individually to saturate its code-generation capability, then trains the team jointly to collaborate effectively. To support the training, we further build a hybrid data-generation framework that produces 64.4K high-quality reference model training samples. ChipMATE achieves 75.0\% and 80.1\% pass@1 on VerilogEval V2 with 4B and 9B base models, outperforming all existing self-trained models and even DeepSeek V4 with 1600B parameters. Our code and model weights are publicly available in https://github.com/zhongkaiyu/ChipMATE.
Abstract（参考訳）: RTLコード生成のための既存のAPIベースのエージェントシステムは、基本的に産業的なプラクティスと不一致である。彼らは、ゴールデンテストベンチが世代毎に利用可能であり、チップベンダーの空調されたセキュリティ要件と互換性のないクローズドソースAPIに依存しており、ベンダー独自のRTLコードベースでトレーニングできないため、貴重な内部データが使われていないと仮定している。近年の自己学習型モデルでは, 実際の産業フローにおける検証の重要な役割を見落としている単一ターン発生器の配置制約に対処している。これらのギャップを埋めるために、RTL生成のための最初の自己学習型マルチエージェントフレームワークであるChipMATEを提案する。 ChipMATEは、独立に書かれたRTLモジュールと参照モデル間の相互比較から正しさが生じる産業プラクティスに触発され、Pythonの参照モデルエージェントとVerilogエージェントをペアにし、黄金のオラクルを使わずに互いの出力を相互に検証する。我々は、ターン間のエラー伝搬を防止するために、バックトラックベースの推論ワークフローを設計し、まず各エージェントを個別に訓練し、コード生成能力を飽和させ、チームと共同で協調して効果的に協力する2段階のトレーニングパイプラインを設計する。トレーニングを支援するために,64.4Kの高品質参照モデルトレーニングサンプルを生成するハイブリッドデータ生成フレームワークをさらに構築する。 ChipMATEは、4Bおよび9Bベースモデルを持つVerilogEval V2上で75.0\%と80.1\%のpass@1を達成する。コードとモデルの重み付けはhttps://github.com/zhongkaiyu/ChipMATE.comで公開されています。

論文の概要: ChipMATE: Multi-Agent Training via Reinforcement Learning for Enhanced RTL Generation

関連論文リスト