Fugu-MT 論文翻訳(概要): MaskSQL: Safeguarding Privacy for LLM-Based Text-to-SQL via Abstraction

論文の概要: MaskSQL: Safeguarding Privacy for LLM-Based Text-to-SQL via Abstraction

arxiv url: http://arxiv.org/abs/2509.23459v1
Date: Sat, 27 Sep 2025 19:07:50 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 22:32:19.238897
Title: MaskSQL: Safeguarding Privacy for LLM-Based Text-to-SQL via Abstraction
Title（参考訳）: MaskSQL: 抽象化によるLLMベースのテキストからSQLへのプライバシ保護
Authors: Sepideh Abedini, Shubhankar Mohapatra, D. B. Emerson, Masoumeh Shafieinejad, Jesse C. Cresswell, Xi He,
Abstract要約: 大規模言語モデル(LLM)は推論を必要とするタスクに対して有望なパフォーマンスを示す。最先端のLCMもプロプライエタリであり、コストがかかり、リソース集約であり、ローカルなデプロイメントを非現実的にしている。プライバシー保護機構として抽象化を利用するテキスト・ツー・アクション・フレームワークであるMaskを紹介する。
参考スコア（独自算出の注目度）: 9.405530537180129
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) have shown promising performance on tasks that require reasoning, such as text-to-SQL, code generation, and debugging. However, regulatory frameworks with strict privacy requirements constrain their integration into sensitive systems. State-of-the-art LLMs are also proprietary, costly, and resource-intensive, making local deployment impractical. Consequently, utilizing such LLMs often requires sharing data with third-party providers, raising privacy concerns and risking noncompliance with regulations. Although fine-tuned small language models (SLMs) can outperform LLMs on certain tasks and be deployed locally to mitigate privacy concerns, they underperform on more complex tasks such as text-to-SQL translation. In this work, we introduce MaskSQL, a text-to-SQL framework that utilizes abstraction as a privacy protection mechanism to mask sensitive information in LLM prompts. Unlike redaction, which removes content entirely, or generalization, which broadens tokens, abstraction retains essential information while discarding unnecessary details, striking an effective privacy-utility balance for the text-to-SQL task. Moreover, by providing mechanisms to control the privacy-utility tradeoff, MaskSQL facilitates adoption across a broader range of use cases. Our experimental results show that MaskSQL outperforms leading SLM-based text-to-SQL models and achieves performance approaching state-of-the-art LLM-based models, while preserving privacy.
Abstract（参考訳）: 大規模言語モデル(LLM)は、テキストからSQL、コード生成、デバッグなど、推論を必要とするタスクで有望なパフォーマンスを示している。しかし、厳格なプライバシー要件を持つ規制フレームワークは、機密システムへの統合を制限している。最先端のLLMもプロプライエタリであり、コストがかかり、リソース集約であり、ローカルなデプロイメントを非現実的にしている。そのため、このようなLCMを利用することで、サードパーティプロバイダとデータを共有し、プライバシー上の懸念を高め、規制に従わないリスクを負うことが少なくない。微調整された小さな言語モデル(SLM)は特定のタスクにおいてLLMよりも優れ、プライバシー上の懸念を軽減するためにローカルにデプロイできるが、テキストからSQLへの変換のようなより複雑なタスクではパフォーマンスが劣る。本研究では,LLMプロンプト内の機密情報を隠蔽するプライバシ保護機構として抽象化を利用するテキスト-SQLフレームワークであるMaskSQLを紹介する。コンテンツを完全に削除するリアクションやトークンを広げる一般化とは異なり、抽象化は不要な詳細を破棄しながら必須情報を保持し、テキストからSQLタスクの効果的なプライバシとユーティリティのバランスを損なう。さらに、プライバシとユーティリティのトレードオフを制御するメカニズムを提供することで、MaskSQLは幅広いユースケースにまたがって採用を促進する。実験の結果、MaskSQLはSLMベースのテキスト-SQLモデルよりも優れており、プライバシを保ちながら最先端のLCMベースのモデルに近づくパフォーマンスを実現していることがわかった。

論文の概要: MaskSQL: Safeguarding Privacy for LLM-Based Text-to-SQL via Abstraction

関連論文リスト