Fugu-MT 論文翻訳(概要): Quality-Driven Agentic Reasoning for LLM-Assisted Software Design: Questions-of-Thoughts (QoT) as a Time-Series Self-QA Chain

論文の概要: Quality-Driven Agentic Reasoning for LLM-Assisted Software Design: Questions-of-Thoughts (QoT) as a Time-Series Self-QA Chain

arxiv url: http://arxiv.org/abs/2603.11082v1
Date: Tue, 10 Mar 2026 23:49:09 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-13 14:46:25.503933
Title: Quality-Driven Agentic Reasoning for LLM-Assisted Software Design: Questions-of-Thoughts (QoT) as a Time-Series Self-QA Chain
Title（参考訳）: LLM支援ソフトウェア設計のための品質駆動型エージェント推論: 時系列自己QAチェインとしてのQoT
Authors: Yen-Ku Liu, Yun-Cheng Tsai,
Abstract要約: 品質駆動型推論時間スキャフォールドであるQoTを導入し,ユーザ目標をエンジニアリングステップの順序付きシーケンスに変換する。 QoTは、API設計、データ通信、ファイルシステムの3つの代表的なバックエンドエンジニアリング領域にまたがって評価する。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent advances in large language models (LLMs) have accelerated AI-assisted software development, yet practical deployment remains constrained by incomplete implementations, weak modularization, and inconsistent security practices. We introduce Questions-of-Thoughts (QoT), a quality-driven inference-time scaffold that turns a user goal into (i) an ordered sequence of engineering steps and (ii) stepwise self-questioning to verify constraints and reduce omission errors, while maintaining a lightweight reasoning record that stabilizes subsequent design decisions. We evaluate QoT across three representative backend engineering domains: API Design, Data Communication, and File Systems. Each task requires multi-module decomposition and exposes standard failure modes in LLM-generated systems. To enable data-driven comparison, we score generated artifacts using an ISO/IEC-inspired quality rubric that measures Scalability, Completeness, Modularity, and Security. We report domain-wise gains as the change in total quality score, defined as the QoT score minus the NoQoT score. Results show capacity-dependent improvements: QoT yields consistent quality improvements for larger models and more complex domains, while smaller models may exhibit trade-offs under tight context and planning budgets. We release an open artifact with prompts, scoring guidelines, raw generations, and scripts that reproduce the reported tables and figures to support applied AI and data analytics research.
Abstract（参考訳）: 大規模言語モデル(LLM)の最近の進歩は、AI支援ソフトウェア開発を加速しているが、実際的なデプロイメントは、不完全な実装、モジュール化の弱い、一貫性のないセキュリティプラクティスによって制限されている。私たちはQoT(QoT)を紹介します。QoTは品質駆動型推論タイムの足場で、ユーザ目標をユーザ目標に転換します。一工学の段階の順序及び順序二制約の検証及び省略誤差の低減を図るとともに、その後の設計決定を安定化させる軽量な推論記録を維持すること。 QoTは、API設計、データ通信、ファイルシステムの3つの代表的なバックエンドエンジニアリング領域にまたがって評価する。各タスクはマルチモジュール分解を必要とし、LLM生成システムで標準的な障害モードを公開する。データ駆動比較を可能にするため、ISO/IECにインスパイアされた品質ルーブリックを使用して生成されたアーティファクトをスコアし、スケーラビリティ、完全性、モジュール性、セキュリティを測定しました。我々は,QoTスコアがNoQoTスコアを除いた総品質スコアの変化として,ドメインワイドゲインを報告した。 QoTはより大きなモデルとより複雑なドメインに対して一貫した品質改善をもたらします。我々は、AIとデータ分析の研究をサポートするために、報告された表や数字を再現するプロンプト、スコアリングガイドライン、生世代、スクリプトを備えたオープンアーティファクトをリリースする。

論文の概要: Quality-Driven Agentic Reasoning for LLM-Assisted Software Design: Questions-of-Thoughts (QoT) as a Time-Series Self-QA Chain

関連論文リスト