Fugu-MT 論文翻訳(概要): Mellum: Production-Grade in-IDE Contextual Code Completion with Multi-File Project Understanding

論文の概要: Mellum: Production-Grade in-IDE Contextual Code Completion with Multi-File Project Understanding

arxiv url: http://arxiv.org/abs/2510.05788v1
Date: Tue, 07 Oct 2025 11:09:11 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-08 17:57:08.217988
Title: Mellum: Production-Grade in-IDE Contextual Code Completion with Multi-File Project Understanding
Title（参考訳）: Mellum: マルチファイルプロジェクト理解によるIDEのコンテクストコード補完
Authors: Nikita Pavlichenko, Iurii Nazarov, Ivan Dolgov, Ekaterina Garanina, Dmitry Ustalov, Ivan Bondyrev, Kseniia Lysaniuk, Evgeniia Vu, Kirill Chekmenev, Joseph Shtok, Yaroslav Golubev, Anton Semenkin, Uladzislau Sazanovich,
Abstract要約: Mellumモデルは、4Bパラメータを持ち、Llamaスタイルのアーキテクチャを採用し、寛容にライセンスされた多言語コードの4Tトークンで事前訓練されている。コンテクスト化されたインエディターコンプリートを生成するためのエンドツーエンド産業パイプラインについて述べる。
参考スコア（独自算出の注目度）: 4.90964288676588
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We present the Mellum models family, open-weight code completion models designed for interactive use in JetBrains IDEs. Mellums have 4B parameters, adopt a Llama-style architecture, and are pre-trained on ~4T tokens of permissively licensed, multi-language code. Our studies show that (i) careful data curation and staged training significantly improve the model's quality, (ii) editor-critical capabilities such as context packing are necessary for high-quality suggestions, and (iii) a compact, task-focused model can meet the cost and latency constraints of interactive completion. In the paper, we describe an end-to-end industrial pipeline for producing contextualized in-editor completion: disciplined data governance, multi-stage training that includes fill-in-the-middle and project context via supervised fine-tuning, and alignment via direct preference optimization using feedback from real-world scenarios. Our quality evaluations include both large-scale offline benchmarks and online telemetry from production deployments in JetBrains IDEs. Mellums are released under the Apache-2.0 license on HuggingFace, with a public model card providing a reproducible reference for practitioners. Our experience offers a pragmatic blueprint for taking a focused, open model from a research prototype to at scale production for hundreds of thousands of users.
Abstract（参考訳）: JetBrains IDEのインタラクティブな使用のために設計された,オープンウェイトなコード補完モデルであるMellumモデルファミリを提示する。 Mellum は 4B パラメータを持ち、Llama スタイルのアーキテクチャを採用し、寛容にライセンスされた多言語コードの ~4T トークンで事前訓練されている。私たちの研究は一慎重なデータキュレーション及び段階訓練により、モデルの質が著しく向上する。 (II)高品質な提案には,コンテキストパッキングなどのエディタクリティカルな機能が必要である。三) 対話的な完了のコストと遅延の制約を満たす、コンパクトなタスク中心のモデル。本稿では,コンテクスト化されたエディター内補完を生成するためのエンドツーエンドの産業パイプラインについて述べる。規律付きデータガバナンス,教師付き微調整による中途半端とプロジェクトコンテキストを含む多段階トレーニング,実世界のシナリオからのフィードバックによる直接的な選好最適化によるアライメント。私たちの品質評価には、大規模オフラインベンチマークとJetBrains IDEのプロダクションデプロイメントからのオンラインテレメトリの両方が含まれています。 MellumsはHuggingFaceのApache-2.0ライセンスでリリースされており、実践者への再現可能な参照を提供する公開モデルカードが提供されている。私たちの経験は、研究プロトタイプから数十万人のユーザのための大規模生産まで、集中型でオープンなモデルを実現するための実践的な青写真を提供しています。

論文の概要: Mellum: Production-Grade in-IDE Contextual Code Completion with Multi-File Project Understanding

関連論文リスト