Fugu-MT 論文翻訳(概要): DomAgent: Leveraging Knowledge Graphs and Case-Based Reasoning for Domain-Specific Code Generation

論文の概要: DomAgent: Leveraging Knowledge Graphs and Case-Based Reasoning for Domain-Specific Code Generation

arxiv url: http://arxiv.org/abs/2603.21430v1
Date: Sun, 22 Mar 2026 22:39:32 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-24 19:11:39.414002
Title: DomAgent: Leveraging Knowledge Graphs and Case-Based Reasoning for Domain-Specific Code Generation
Title（参考訳）: DomAgent: ドメイン特化コード生成のための知識グラフの活用とケースベースの推論
Authors: Shuai Wang, Dhasarathy Parthasarathy, Robert Feldt, Yinan Yu,
Abstract要約: DomAgentは大規模言語モデル(LLM)のための自律型コーディングエージェントである DomRetrieverは、人間がドメイン固有の知識を学ぶ方法をエミュレートする、新しい検索モジュールだ。実験の結果,DomAgentはドメイン固有のコード生成を大幅に向上させることがわかった。
参考スコア（独自算出の注目度）: 10.633307460813523
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) have shown impressive capabilities in code generation. However, because most LLMs are trained on public domain corpora, directly applying them to real-world software development often yields low success rates, as these scenarios frequently require domain-specific knowledge. In particular, domain-specific tasks usually demand highly specialized solutions, which are often underrepresented or entirely absent in the training data of generic LLMs. To address this challenge, we propose DomAgent, an autonomous coding agent that bridges this gap by enabling LLMs to generate domain-adapted code through structured reasoning and targeted retrieval. A core component of DomAgent is DomRetriever, a novel retrieval module that emulates how humans learn domain-specific knowledge, by combining conceptual understanding with experiential examples. It dynamically integrates top-down knowledge-graph reasoning with bottom-up case-based reasoning, enabling iterative retrieval and synthesis of structured knowledge and representative cases to ensure contextual relevance and broad task coverage. DomRetriever can operate as part of DomAgent or independently with any LLM for flexible domain adaptation. We evaluate DomAgent on an open benchmark dataset in the data science domain (DS-1000) and further apply it to real-world truck software development tasks. Experimental results show that DomAgent significantly enhances domain-specific code generation, enabling small open-source models to close much of the performance gap with large proprietary LLMs in complex, real-world applications. The code is available at: https://github.com/Wangshuaiia/DomAgent.
Abstract（参考訳）: 大規模言語モデル(LLM)は、コード生成において印象的な機能を示している。しかしながら、ほとんどのLLMはパブリックドメインのコーパスでトレーニングされているため、実際のソフトウェア開発に直接適用すると、ドメイン固有の知識を必要とすることが多いため、成功率が低い場合が多い。特に、ドメイン固有のタスクは、通常高度に専門化されたソリューションを必要とするが、ジェネリックLLMのトレーニングデータには、しばしば不足しているか、完全に欠落している。この課題に対処するために、構造化推論とターゲット検索を通じてLLMがドメイン適応コードを生成することで、このギャップを埋める自律的符号化エージェントであるDomAgentを提案する。 DomAgentのコアコンポーネントであるDomRetrieverは、概念的理解と経験的な例を組み合わせることで、人間がドメイン固有の知識を学ぶ方法をエミュレートする新しい検索モジュールである。トップダウンの知識グラフ推論とボトムアップのケースベースの推論を動的に統合し、構造化された知識と代表的事例の反復的検索と合成を可能にし、文脈的関連性と幅広いタスクカバレッジを確保する。 DomRetrieverはDomAgentの一部として、あるいは任意のLLMと独立して、柔軟なドメイン適応を行うことができる。我々はDomAgentをデータサイエンス領域(DS-1000)のオープンなベンチマークデータセットで評価し、実際のトラックソフトウェア開発タスクに適用する。実験結果から,DomAgentはドメイン固有のコード生成を著しく向上させ,複雑な実世界のアプリケーションにおいて,小さなオープンソースモデルでLLMの大規模プロプライエタリな性能ギャップの大部分を埋めることができた。コードは、https://github.com/Wangshuaiia/DomAgent.comで入手できる。

論文の概要: DomAgent: Leveraging Knowledge Graphs and Case-Based Reasoning for Domain-Specific Code Generation

関連論文リスト