Fugu-MT 論文翻訳(概要): Beyond the Prompt: Assessing Domain Knowledge Strategies for High-Dimensional LLM Optimization in Software Engineering

論文の概要: Beyond the Prompt: Assessing Domain Knowledge Strategies for High-Dimensional LLM Optimization in Software Engineering

arxiv url: http://arxiv.org/abs/2602.02752v1
Date: Mon, 02 Feb 2026 20:07:15 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-04 18:37:15.059941
Title: Beyond the Prompt: Assessing Domain Knowledge Strategies for High-Dimensional LLM Optimization in Software Engineering
Title（参考訳）: プロンプトを超えて - ソフトウェア工学における高次元LLM最適化のためのドメイン知識戦略の評価
Authors: Srinath Srinivasan, Tim Menzies,
Abstract要約: 私たちは、ドメイン知識を生成するために、人間と人工知能の戦略を比較します。構造化知識の統合により、LLMが高次元最適化のための効果的なウォームスタートを生成することができるかどうかを決定するために、4つの異なるアーキテクチャを体系的に評価する。
参考スコア（独自算出の注目度）: 4.168157981135697
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Background/Context: Large Language Models (LLMs) demonstrate strong performance on low-dimensional software engineering optimization tasks ($\le$11 features) but consistently underperform on high-dimensional problems where Bayesian methods dominate. A fundamental gap exists in understanding how systematic integration of domain knowledge (whether from humans or automated reasoning) can bridge this divide. Objective/Aim: We compare human versus artificial intelligence strategies for generating domain knowledge. We systematically evaluate four distinct architectures to determine if structured knowledge integration enables LLMs to generate effective warm starts for high-dimensional optimization. Method: We evaluate four approaches on MOOT datasets stratified by dimensionality: (1) Human-in-the-Loop Domain Knowledge Prompting (H-DKP), utilizing asynchronous expert feedback loops; (2) Adaptive Multi-Stage Prompting (AMP), implementing sequential constraint identification and validation; (3) Dimension-Aware Progressive Refinement (DAPR), conducting optimization in progressively expanding feature subspaces; and (4) Hybrid Knowledge-Model Approach (HKMA), synthesizing statistical scouting (TPE) with RAG-enhanced prompting. Performance is quantified via Chebyshev distance to optimal solutions and ranked using Scott-Knott clustering against an established baseline for LLM generated warm starts. Note that all human studies conducted as part of this study will comply with the policies of our local Institutional Review Board.
Abstract（参考訳）: 背景/背景: 大規模言語モデル(LLM)は、低次元のソフトウェア工学最適化タスク($11の機能)で強いパフォーマンスを示すが、ベイズ的手法が支配的な高次元問題では一貫してパフォーマンスが低い。ドメイン知識の体系的な統合(人間か自動推論か)がいかにしてこの分割を橋渡しできるかを理解するのに、根本的なギャップがある。 Objective/Aim: ドメイン知識を生成するための人間と人工知能の戦略を比較します。構造化知識の統合により、LLMが高次元最適化のための効果的なウォームスタートを生成することができるかどうかを決定するために、4つの異なるアーキテクチャを体系的に評価する。方法: 提案手法は,(1)H-DKP(Human-in-the-Loop Domain Knowledge Prompting),(2)Adaptive Multi-Stage Prompting(AMP),(3)Dimension-Aware Progressive Refinement(DAPR),(4)HKMA(Hybrid Knowledge-Model Approach),(4)RAG-enhanced prompting(TPE)の合成である。性能はチェビシェフ距離から最適解まで定量化され、Scott-Knottクラスタリングを用いて、LLM生成したウォームスタートの確立されたベースラインに対してランク付けされる。この研究の一環として実施されたすべての人間研究は、当社の地方機関審査委員会の方針に従うことに留意されたい。

論文の概要: Beyond the Prompt: Assessing Domain Knowledge Strategies for High-Dimensional LLM Optimization in Software Engineering

関連論文リスト