Fugu-MT 論文翻訳(概要): Adversarial SQL Injection Generation with LLM-Based Architectures

論文の概要: Adversarial SQL Injection Generation with LLM-Based Architectures

arxiv url: http://arxiv.org/abs/2605.11188v1
Date: Mon, 11 May 2026 19:52:44 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-13 21:48:56.394529
Title: Adversarial SQL Injection Generation with LLM-Based Architectures
Title（参考訳）: LLMアーキテクチャによる逆SQLインジェクション生成
Authors: Ali Karakoc, H. Birkan Yilmaz,
Abstract要約: sqlインジェクション攻撃は、Open Worldwide Application Security Project (OWASP)トップ10の脅威の中で、依然として深刻な攻撃の1つです。人工知能(AI)の進歩、特にLarge Language Models(LLM)では、防御機構を測定するために敵攻撃テストを自動化する機会が生み出されている。我々は2つの新しいLCMベースのシステム、Retrieval Augmented Generation for Adrial SQLi(RADAGAS)とRefleXQLi(RefleXQLi)を紹介する。我々のテストは240の実験から成り、24万のペイロードを生成し、WAFに対して2200万のテストを実行した。
参考スコア（独自算出の注目度）: 0.15122029613508317
License: http://creativecommons.org/licenses/by/4.0/
Abstract: SQL injection (SQLi) attacks are still one of the serious attacks ranked in the Open Worldwide Application Security Project (OWASP) Top 10 threats. Today, with advances in Artificial Intelligence (AI), especially in Large Language Models (LLMs), an opportunity has been created for automating adversarial attack tests to measure the defense mechanisms. In this paper, we aim to create a comprehensive evaluation of use cases that utilize LLMs for adversarial SQL injection generation. We introduce two novel LLM-based systems, Retrieval Augmented Generation for Adversarial SQLi (RADAGAS) and Reflective Chain-of-Thought SQLi (RefleXQLi), and compare them with existing baselines against 10 Web Application Firewalls (WAFs) and one execution-based MySQL validator. To perform a comprehensive test, we used six rule-based open-source WAFs (ModSecurity PL1--3, Coraza PL1--3), 2 AI/ML-based WAFs (WAF Brain, CNN-WAF), and 2 commercial WAFs (AWS WAF and Cloudflare WAF). For the LLM models, we used GPT-4o, Claude 3.7 Sonnet, and DeepSeek R1. Our tests consist of 240 experiments that generate 240,000 payloads and perform 2.2 million tests against WAFs. Our comprehensive evaluation reveals that RADAGAS-GPT4o outperforms other baseline models with a 22.73\% bypass rate. The proposed RADAGAS variants are highly successful on AI/ML-based WAFs (92.49\% on WAF-Brain by RADAGAS-DeepSeek, 80.48\% on CNN-WAF by RADAGAS-Claude), but struggle to bypass rule-based WAFs (0--5.70\% on ModSecurity and Coraza). In addition to these findings, another observation is that creating less diverse payloads achieves more bypasses, however they show poor results if the initially chosen payload is not successful. We observe that our findings provide a comprehensive view on using LLM-based approaches in security testing.
Abstract（参考訳）: SQLインジェクション(SQLi)攻撃は、Open Worldwide Application Security Project(OWASP)トップ10の脅威の中で、依然として深刻な攻撃の1つです。今日、人工知能(AI)の進歩、特にLarge Language Models(LLM)において、防御機構を測定するために敵の攻撃テストを自動化する機会が生み出されている。本稿では,LLMを逆SQLインジェクション生成に利用するユースケースを総合的に評価することを目的とする。本稿では,Retrieval Augmented Generation for Adversarial SQLi (RADAGAS) と Reflective Chain-of-Thought SQLi (RefleXQLi) の2つの新しいLCMベースのシステムを紹介し,既存のベースラインと10のWeb Application Firewall (WAF) と1つの実行ベースのMySQLバリデータを比較した。包括的なテストを行うには、ルールベースのオープンソースWAF(ModSecurity PL1--3, Coraza PL1--3)、2つのAI/MLベースのWAF(WAF Brain, CNN-WAF)、2つの商用WAF(AWS WAF, Cloudflare WAF)を使用しました。 LLMモデルでは、GPT-4o、Claude 3.7 Sonnet、DeepSeek R1を使用しました。我々のテストは240の実験から成り、24万のペイロードを生成し、WAFに対して2200万のテストを実行した。総括評価の結果,RADAGAS-GPT4oは22.73倍のバイパス率で他のベースラインモデルよりも優れていた。提案されたRADAGASはAI/MLベースのWAF(RADAGAS-DeepSeekによるWAF-Brainの92.49 %、RADAGAS-ClaudeによるCNN-WAFの80.48 %)で高い成功を収めているが、ルールベースのWAF(ModSecurityとColazaの0--5.70 %)をバイパスするのに苦労している。これらの発見に加えて、より多様なペイロードを作成することは、より多くのバイパスを達成するが、初期選択されたペイロードが成功しなかった場合、結果は不十分である。この結果から, LLM を用いたセキュリティテストの総合的な評価が得られた。

論文の概要: Adversarial SQL Injection Generation with LLM-Based Architectures

関連論文リスト