Fugu-MT 論文翻訳(概要): GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers

論文の概要: GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers

arxiv url: http://arxiv.org/abs/2412.09722v1
Date: Thu, 12 Dec 2024 20:59:43 GMT
ステータス: 翻訳完了
システム内更新日: 2024-12-16 15:37:48.765033
Title: GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers
Title（参考訳）: GReaTer: 推論よりもグラディエンスにより、より小さな言語モデルがプロンプト最適化を強力にする
Authors: Sarkar Snigdha Sarathi Das, Ryo Kamoi, Bo Pang, Yusen Zhang, Caiming Xiong, Rui Zhang,
Abstract要約: 本稿では,タスク固有の推論よりも勾配情報を直接組み込む新しいプロンプト最適化手法であるGReaTerを紹介する。 GReaTerはタスク損失勾配を利用して、オープンソースの軽量言語モデルのためのプロンプトの自己最適化を可能にする。 GReaTerは、従来の最先端のプロンプト最適化手法を一貫して上回っている。
参考スコア（独自算出の注目度）: 52.17222304851524
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: The effectiveness of large language models (LLMs) is closely tied to the design of prompts, making prompt optimization essential for enhancing their performance across a wide range of tasks. Many existing approaches to automating prompt engineering rely exclusively on textual feedback, refining prompts based solely on inference errors identified by large, computationally expensive LLMs. Unfortunately, smaller models struggle to generate high-quality feedback, resulting in complete dependence on large LLM judgment. Moreover, these methods fail to leverage more direct and finer-grained information, such as gradients, due to operating purely in text space. To this end, we introduce GReaTer, a novel prompt optimization technique that directly incorporates gradient information over task-specific reasoning. By utilizing task loss gradients, GReaTer enables self-optimization of prompts for open-source, lightweight language models without the need for costly closed-source LLMs. This allows high-performance prompt optimization without dependence on massive LLMs, closing the gap between smaller models and the sophisticated reasoning often needed for prompt refinement. Extensive evaluations across diverse reasoning tasks including BBH, GSM8k, and FOLIO demonstrate that GReaTer consistently outperforms previous state-of-the-art prompt optimization methods, even those reliant on powerful LLMs. Additionally, GReaTer-optimized prompts frequently exhibit better transferability and, in some cases, boost task performance to levels comparable to or surpassing those achieved by larger language models, highlighting the effectiveness of prompt optimization guided by gradients over reasoning. Code of GReaTer is available at https://github.com/psunlpgroup/GreaTer.
Abstract（参考訳）: 大規模言語モデル(LLM)の有効性はプロンプトの設計と密接に結びついているので、幅広いタスクでパフォーマンスを向上させるために、迅速な最適化が不可欠である。プロンプトエンジニアリングを自動化するための既存の多くのアプローチは、テキストフィードバックにのみ依存しており、大規模で計算コストのかかるLLMによって特定される推論エラーのみに基づいてプロンプトを精査する。残念なことに、小さなモデルは高品質なフィードバックを生成するのに苦労し、結果として大きなLCM判断に完全に依存する。さらに、これらの手法はテキスト空間内で純粋に動作するため、勾配のようなより直接的できめ細かい情報を利用することができない。この目的のために,タスク固有の推論よりも勾配情報を直接組み込む新しいプロンプト最適化手法であるGReaTerを紹介する。タスク損失勾配を利用することで、GReaTerは、低コストでクローズドソースのLLMを必要としない、オープンソースの軽量言語モデルのプロンプトの自己最適化を可能にする。これにより、大規模なLLMに依存せずに高速なプロンプト最適化が可能となり、より小さなモデルと迅速な改良に必要な洗練された推論の間のギャップを埋めることができた。 BBH、GSM8k、FOLIOを含む様々な推論タスクに対する広範囲な評価は、GReaTerが従来の最先端のプロンプト最適化手法よりも一貫して優れていることを示した。さらに、GReaTerに最適化されたプロンプトは、しばしばより良い転送可能性を示し、場合によっては、より大きな言語モデルによって達成されたものと同等以上のレベルまでタスクパフォーマンスを高め、推論よりも勾配によって導かれるプロンプト最適化の有効性を強調している。 GReaTerのコードはhttps://github.com/psunlpgroup/GreaTerで入手できる。

論文の概要: GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers

関連論文リスト