Fugu-MT 論文翻訳(概要): LlamaRestTest: Effective REST API Testing with Small Language Models

論文の概要: LlamaRestTest: Effective REST API Testing with Small Language Models

arxiv url: http://arxiv.org/abs/2501.08598v2
Date: Thu, 03 Apr 2025 19:42:32 GMT
ステータス: 翻訳完了
システム内更新日: 2025-04-07 16:30:18.030539
Title: LlamaRestTest: Effective REST API Testing with Small Language Models
Title（参考訳）: LlamaRestTest: 小規模な言語モデルによる効果的なREST APIテスト
Authors: Myeongsoo Kim, Saurabh Sinha, Alessandro Orso,
Abstract要約: LlamaRestTestは、2つのLLM(Large Language Models)を使って現実的なテストインプットを生成する新しいアプローチである。私たちは、GPTを使った仕様強化ツールであるRESTGPTなど、最先端のREST APIテストツールに対して、これを評価しています。私たちの研究は、REST APIテストにおいて、小さな言語モデルは、大きな言語モデルと同様に、あるいは、より良く機能することができることを示しています。
参考スコア（独自算出の注目度）: 50.058600784556816
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Modern web services rely heavily on REST APIs, typically documented using the OpenAPI specification. The widespread adoption of this standard has resulted in the development of many black-box testing tools that generate tests based on OpenAPI specifications. Although Large Language Models (LLMs) have shown promising test-generation abilities, their application to REST API testing remains mostly unexplored. We present LlamaRestTest, a novel approach that employs two custom LLMs-created by fine-tuning and quantizing the Llama3-8B model using mined datasets of REST API example values and inter-parameter dependencies-to generate realistic test inputs and uncover inter-parameter dependencies during the testing process by analyzing server responses. We evaluated LlamaRestTest on 12 real-world services (including popular services such as Spotify), comparing it against RESTGPT, a GPT-powered specification-enhancement tool, as well as several state-of-the-art REST API testing tools, including RESTler, MoRest, EvoMaster, and ARAT-RL. Our results demonstrate that fine-tuning enables smaller models to outperform much larger models in detecting actionable parameter-dependency rules and generating valid inputs for REST API testing. We also evaluated different tool configurations, ranging from the base Llama3-8B model to fine-tuned versions, and explored multiple quantization techniques, including 2-bit, 4-bit, and 8-bit integer formats. Our study shows that small language models can perform as well as, or better than, large language models in REST API testing, balancing effectiveness and efficiency. Furthermore, LlamaRestTest outperforms state-of-the-art REST API testing tools in code coverage achieved and internal server errors identified, even when those tools use RESTGPT-enhanced specifications.
Abstract（参考訳）: 現代のWebサービスはREST APIに大きく依存しており、通常はOpenAPI仕様を使ってドキュメント化されている。この標準が広く採用されているため、OpenAPI仕様に基づいたテストを生成するブラックボックステストツールが多数開発されている。 Large Language Models (LLMs) は有望なテスト生成能力を示しているが、REST API テストへのアプリケーションはほとんど探索されていない。 LlamaRestTestは、REST APIのサンプル値のマイニングデータセットとパラメータ間の依存関係を使って、Llama3-8Bモデルを微調整し、定量化し、2つのカスタムLLMを利用する新しいアプローチである。私たちはLlamaRestTestを12の現実世界のサービス(Spotifyなどのポピュラーなサービスを含む)で評価し、GPTを使った仕様拡張ツールであるRESTGPTと、RESTler、MoRest、EvoMaster、ARAT-RLといった最先端のREST APIテストツールを比較しました。我々の結果は、ファインチューニングにより、アクション可能なパラメータ依存ルールを検出し、REST APIテストに有効なインプットを生成することで、より小さなモデルがはるかに大きなモデルを上回ることを示しています。また,基礎となるLlama3-8Bモデルから微調整バージョンまで,さまざまなツール構成を評価し,2ビット,4ビット,8ビットの整数形式を含む複数の量子化手法について検討した。私たちの研究は、REST APIのテスト、効率と効率のバランスをとることで、小さな言語モデルでも、大きな言語モデルでも、それよりも、あるいはそれ以上に、機能できることを示しています。さらに、LlamaRestTestは、コードカバレッジの達成と内部サーバエラーの特定において、最先端のREST APIテストツールよりも優れています。

論文の概要: LlamaRestTest: Effective REST API Testing with Small Language Models

関連論文リスト