Fugu-MT 論文翻訳(概要): Agent Factories for High Level Synthesis: How Far Can General-Purpose Coding Agents Go in Hardware Optimization?

論文の概要: Agent Factories for High Level Synthesis: How Far Can General-Purpose Coding Agents Go in Hardware Optimization?

arxiv url: http://arxiv.org/abs/2603.25719v1
Date: Thu, 26 Mar 2026 17:57:50 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-27 20:52:48.417393
Title: Agent Factories for High Level Synthesis: How Far Can General-Purpose Coding Agents Go in Hardware Optimization?
Title（参考訳）: 高次合成のためのエージェントファクター:汎用符号化エージェントはどこまでハードウェア最適化に使えるのか?
Authors: Abhishek Bhandwaldar, Mihir Choudhury, Ruchir Puri, Akash Srivastava,
Abstract要約: 本稿では,汎用符号化エージェントが高レベルのアルゴリズム仕様からハードウェア設計をいかに最適化できるかを実証研究する。複数の自律的最適化エージェントの構築と調整を行う2段階パイプラインであるエージェントファクトリを導入する。 AMD Vitis HLS を用いた Claude Code (Opus4.5/4.6) を用いた HLS-Eval と Rodinia-HLS の 12 個のカーネルに対するアプローチの評価を行った。
参考スコア（独自算出の注目度）: 8.899459735174174
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We present an empirical study of how far general-purpose coding agents -- without hardware-specific training -- can optimize hardware designs from high-level algorithmic specifications. We introduce an agent factory, a two-stage pipeline that constructs and coordinates multiple autonomous optimization agents. In Stage~1, the pipeline decomposes a design into sub-kernels, independently optimizes each using pragma and code-level transformations, and formulates an Integer Linear Program (ILP) to assemble globally promising configurations under an area constraint. In Stage~2, it launches $N$ expert agents over the top ILP solutions, each exploring cross-function optimizations such as pragma recombination, loop fusion, and memory restructuring that are not captured by sub-kernel decomposition. We evaluate the approach on 12 kernels from HLS-Eval and Rodinia-HLS using Claude Code (Opus~4.5/4.6) with AMD Vitis HLS. Scaling from 1 to 10 agents yields a mean $8.27\times$ speedup over baseline, with larger gains on harder benchmarks: streamcluster exceeds $20\times$ and kmeans reaches approximately $10\times$. Across benchmarks, agents consistently rediscover known hardware optimization patterns without domain-specific training, and the best designs often do not originate from top-ranked ILP candidates, indicating that global optimization exposes improvements missed by sub-kernel search. These results establish agent scaling as a practical and effective axis for HLS optimization.
Abstract（参考訳）: ハードウェア固有のトレーニングを伴わない汎用コーディングエージェントが、ハイレベルなアルゴリズム仕様からハードウェア設計をいかに最適化できるかを実証研究する。複数の自律最適化エージェントの構築と調整を行う2段階パイプラインであるエージェントファクトリを導入する。 Stage~1では、パイプラインはサブカーネルに設計を分解し、プラグマとコードレベルの変換を使用してそれぞれを独立に最適化し、Integer Linear Program (ILP) を定式化し、領域制約の下でグローバルに期待できる構成を組み立てる。 Stage~2では、上位のILPソリューション上でN$のエキスパートエージェントを起動し、それぞれがサブカーネル分解によってキャプチャされないプラグマ再結合、ループ融合、メモリ再構成などのクロスファンクショナル最適化を探索する。 AMD Vitis HLS を用いた Claude Code (Opus~4.5/4.6) を用いた HLS-Eval と Rodinia-HLS の 12 個のカーネルに対するアプローチの評価を行った。 1から10のエージェントへのスケーリングでは、ベースラインよりも平均8.27\times$のスピードアップが得られ、より厳しいベンチマークでは大きな利得が得られている:ストリームクラスタは20\times$を超え、kmeansはおよそ10\times$に達する。ベンチマーク全体を通じて、エージェントはドメイン固有のトレーニングなしで既知のハードウェア最適化パターンを再発見し、最良の設計は上位のICP候補から派生しないことが多い。これらの結果から,HLS最適化の実践的かつ効果的な軸としてエージェントスケーリングが確立された。

論文の概要: Agent Factories for High Level Synthesis: How Far Can General-Purpose Coding Agents Go in Hardware Optimization?

関連論文リスト