Fugu-MT 論文翻訳(概要): PromptFlow: Training Prompts Like Neural Networks

論文の概要: PromptFlow: Training Prompts Like Neural Networks

arxiv url: http://arxiv.org/abs/2510.12246v1
Date: Tue, 14 Oct 2025 07:56:12 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-15 19:02:32.232086
Title: PromptFlow: Training Prompts Like Neural Networks
Title（参考訳）: PromptFlow: ニューラルネットワークのようなプロンプトのトレーニング
Authors: Jingyi Wang, Hongyuan Zhu, Ye Niu, Yunhui Deng,
Abstract要約: 大規模言語モデル(LLM)は自然言語処理(NLP)タスクに大きな影響を与えている。プロンプトエンジニアリングの最近の進歩は、広範囲な再訓練に代わる有望な代替手段を提供する。本稿では,メタプロンプト,演算子,最適化,評価器にインスパイアされたモジュール型トレーニングフレームワークであるPromptFlowを提案する。
参考スコア（独自算出の注目度）: 17.90494213352502
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large Language Models (LLMs) have demonstrated profound impact on Natural Language Processing (NLP) tasks. However, their effective deployment across diverse domains often require domain-specific adaptation strategies, as generic models may underperform when faced with specialized data distributions. Recent advances in prompt engineering (PE) offer a promising alternative to extensive retraining by refining input instructions to align LLM outputs with task objectives. This paradigm has emerged as a rapid and versatile approach for model fine-tuning. Despite its potential, manual prompt design remains labor-intensive and heavily depends on specialized expertise, often requiring iterative human effort to achieve optimal formulations. To address this limitation, automated prompt engineering methodologies have been developed to systematically generate task-specific prompts. However, current implementations predominantly employ static update rules and lack mechanisms for dynamic strategy selection, resulting in suboptimal adaptation to varying NLP task requirements. Furthermore, most methods treat and update the whole prompts at each step, without considering editing prompt sections at a finer granularity. At last, in particular, the problem of how to recycle experience in LLM is still underexplored. To this end, we propose the PromptFlow, a modular training framework inspired by TensorFlow, which integrates meta-prompts, operators, optimization, and evaluator. Our framework can be equipped with the latest optimization methods and autonomously explores optimal prompt refinement trajectories through gradient-based meta-learning, requiring minimal task-specific training data. Specifically, we devise a reinforcement learning method to recycle experience for LLM in the PE process. Finally, we conduct extensive experiments on various datasets, and demonstrate the effectiveness of PromptFlow.
Abstract（参考訳）: 大規模言語モデル(LLM)は自然言語処理(NLP)タスクに大きな影響を与えている。しかしながら、様々なドメインにまたがる効果的なデプロイメントにはドメイン固有の適応戦略が必要になることが多い。近年のプロンプトエンジニアリング(PE)の進歩は、LLM出力をタスク目標に合わせるために入力命令を精算することで、広範囲なリトレーニングに代わる有望な代替手段を提供する。このパラダイムは、モデル微調整のための迅速で汎用的なアプローチとして登場した。その可能性にもかかわらず、手動のプロンプトデザインは労働集約的であり、しばしば最適な定式化を達成するために反復的な人間の努力を必要とする特別な専門知識に大きく依存している。この制限に対処するため、タスク固有のプロンプトを体系的に生成する自動プロンプトエンジニアリング手法が開発されている。しかし、現在の実装では静的な更新規則と動的戦略選択のメカニズムが欠如しており、それによって様々なNLPタスク要求に最適化される。さらに、ほとんどのメソッドは、プロンプトセクションをより細かい粒度で編集することなく、各ステップでプロンプト全体を処理し、更新する。最終的に、LLMでの体験をリサイクルする方法の問題は、まだ未解決のままである。この目的のために、TensorFlowにインスパイアされたモジュール型トレーニングフレームワークであるPromptFlowを提案し、メタプロンプト、演算子、最適化、評価器を統合する。我々のフレームワークには最新の最適化手法が組み込まれており、グラデーションベースのメタラーニングを通じて最適な即時改善軌道を自律的に探索し、最小限のタスク固有のトレーニングデータを必要とする。具体的には, PE プロセスにおける LLM のリサイクル経験を再現するための強化学習法を提案する。最後に,様々なデータセットについて広範な実験を行い,PromptFlowの有効性を実証する。

論文の概要: PromptFlow: Training Prompts Like Neural Networks

関連論文リスト