Fugu-MT 論文翻訳(概要): InCoder-32B: Code Foundation Model for Industrial Scenarios

論文の概要: InCoder-32B: Code Foundation Model for Industrial Scenarios

arxiv url: http://arxiv.org/abs/2603.16790v1
Date: Tue, 17 Mar 2026 17:01:35 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-18 17:42:07.436962
Title: InCoder-32B: Code Foundation Model for Industrial Scenarios
Title（参考訳）: InCoder-32B:産業シナリオのためのコード基盤モデル
Authors: Jian Yang, Wei Zhang, Jiajun Wu, Junhang Cheng, Shawn Guo, Haowen Wang, Weicheng Gu, Yaxin Du, Joseph Li, Fanglin Xu, Yizhi Li, Lin Jing, Yuanbo Wang, Yuhan Gao, Ruihao Gong, Chuan Hao, Ran Tao, Aishan Liu, Tuney Zheng, Ganqu Cui, Zhoujun Li, Mingjie Tang, Chenghua Lin, Wayne Xin Zhao, Xianglong Liu, Ming Zhou, Bryan Dai, Weifeng Lv,
Abstract要約: InCoder-32B(Industrial-Coder-32B)を導入し,チップ設計,GPUカーネル最適化,組み込みシステム,コンパイラ最適化,3Dモデリングなどのコードインテリジェンスを統合する。 InCoder-32Bは、産業領域にまたがる強力なオープンソースベースラインを確立しながら、一般的なタスクにおいて高い競争力を発揮する。
参考スコア（独自算出の注目度）: 95.28947955949296
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent code large language models have achieved remarkable progress on general programming tasks. Nevertheless, their performance degrades significantly in industrial scenarios that require reasoning about hardware semantics, specialized language constructs, and strict resource constraints. To address these challenges, we introduce InCoder-32B (Industrial-Coder-32B), the first 32B-parameter code foundation model unifying code intelligence across chip design, GPU kernel optimization, embedded systems, compiler optimization, and 3D modeling. By adopting an efficient architecture, we train InCoder-32B from scratch with general code pre-training, curated industrial code annealing, mid-training that progressively extends context from 8K to 128K tokens with synthetic industrial reasoning data, and post-training with execution-grounded verification. We conduct extensive evaluation on 14 mainstream general code benchmarks and 9 industrial benchmarks spanning 4 specialized domains. Results show InCoder-32B achieves highly competitive performance on general tasks while establishing strong open-source baselines across industrial domains.
Abstract（参考訳）: 最近のコード大言語モデルは、一般的なプログラミングタスクにおいて顕著な進歩を遂げている。それでも、ハードウェアのセマンティクス、特殊な言語構成、厳格なリソース制約に関する推論を必要とする産業シナリオでは、パフォーマンスが著しく低下する。 InCoder-32B(Industrial-Coder-32B)は、チップ設計、GPUカーネル最適化、組み込みシステム、コンパイラ最適化、および3Dモデリングにおけるコードインテリジェンスを統合する最初の32Bパラメータコード基盤モデルである。効率的なアーキテクチャを採用することで、インコーダ-32Bをスクラッチからトレーニングし、一般的なコード事前トレーニング、キュレートされたインダストリアルコードアニール、コンテクストを8Kから128Kのトークンに段階的に拡張するミドルトレーニング、そして実行地上検証でトレーニングします。 4つの専門ドメインにまたがる14の主流の汎用コードベンチマークと9の産業ベンチマークに対して、広範な評価を行う。結果から,InCoder-32Bは,産業領域にまたがる強力なオープンソースベースラインを確立しつつ,一般的なタスクにおいて高い競争力を発揮することが示された。

論文の概要: InCoder-32B: Code Foundation Model for Industrial Scenarios

関連論文リスト