Fugu-MT 論文翻訳(概要): SecureReviewer: Enhancing Large Language Models for Secure Code Review through Secure-aware Fine-tuning

論文の概要: SecureReviewer: Enhancing Large Language Models for Secure Code Review through Secure-aware Fine-tuning

arxiv url: http://arxiv.org/abs/2510.26457v1
Date: Thu, 30 Oct 2025 13:06:11 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-31 16:05:09.81439
Title: SecureReviewer: Enhancing Large Language Models for Secure Code Review through Secure-aware Fine-tuning
Title（参考訳）: SecureReviewer: セキュアなコードレビューのための大規模言語モデルの実現
Authors: Fang Liu, Simiao Liu, Yinghao Zhu, Xiaoli Lian, Li Zhang,
Abstract要約: コードレビュー中にセキュリティ関連の問題を特定し解決するためにSecureReviewerを提案する。まず、セキュアなコードレビュー機能をトレーニングし評価するためのデータセットを構築します。我々は、ドメイン固有のセキュリティ知識に生成されたコメントを基盤とするRAG技術を統合する。
参考スコア（独自算出の注目度）: 8.229920162000369
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Identifying and addressing security issues during the early phase of the development lifecycle is critical for mitigating the long-term negative impacts on software systems. Code review serves as an effective practice that enables developers to check their teammates' code before integration into the codebase. To streamline the generation of review comments, various automated code review approaches have been proposed, where LLM-based methods have significantly advanced the capabilities of automated review generation. However, existing models primarily focus on general-purpose code review, their effectiveness in identifying and addressing security-related issues remains underexplored. Moreover, adapting existing code review approaches to target security issues faces substantial challenges, including data scarcity and inadequate evaluation metrics. To address these limitations, we propose SecureReviewer, a new approach designed for enhancing LLMs' ability to identify and resolve security-related issues during code review. Specifically, we first construct a dataset tailored for training and evaluating secure code review capabilities. Leveraging this dataset, we fine-tune LLMs to generate code review comments that can effectively identify security issues and provide fix suggestions with our proposed secure-aware fine-tuning strategy. To mitigate hallucination in LLMs and enhance the reliability of their outputs, we integrate the RAG technique, which grounds the generated comments in domain-specific security knowledge. Additionally, we introduce SecureBLEU, a new evaluation metric designed to assess the effectiveness of review comments in addressing security issues. Experimental results demonstrate that SecureReviewer outperforms state-of-the-art baselines in both security issue detection accuracy and the overall quality and practical utility of generated review comments.
Abstract（参考訳）: 開発ライフサイクルの初期段階におけるセキュリティ問題の特定と対処は、ソフトウェアシステムに対する長期的なネガティブな影響を軽減するために重要である。コードレビューは、開発者がコードベースに統合される前にチームメイトのコードをチェックできる効果的なプラクティスである。レビューコメントの生成を効率化するために,LSMベースの手法が自動レビュー生成の能力を大幅に向上させた,さまざまな自動コードレビューアプローチが提案されている。しかし、既存のモデルは、主に汎用コードレビューに焦点を当てており、セキュリティ関連の問題を特定し、対処する上での有効性は未検討のままである。さらに、既存のコードレビューアプローチをセキュリティ問題に適応させるには、データ不足や不適切な評価指標など、重大な課題に直面します。これらの制限に対処するために、コードレビュー中にセキュリティ関連の問題を識別および解決するLLMの能力を高めるために設計された新しいアプローチであるSecureReviewerを提案する。具体的には、まず、セキュアなコードレビュー機能をトレーニングし、評価するためのデータセットを構築します。このデータセットを活用することで、LLMを微調整してコードレビューコメントを生成し、セキュリティ上の問題を効果的に識別し、提案したセキュアな微調整戦略による修正提案を提供します。 LLMにおける幻覚を緩和し、その出力の信頼性を高めるために、ドメイン固有のセキュリティ知識において生成されたコメントを基盤とするRAG技術を統合する。さらに、セキュリティ問題に対処する際のレビューコメントの有効性を評価するために設計された新しい評価指標SecureBLEUを紹介する。実験の結果,SecureReviewerは,セキュリティ問題検出精度と,生成したレビューコメントの全体的な品質と実用性の両方において,最先端のベースラインを上回っていることがわかった。

論文の概要: SecureReviewer: Enhancing Large Language Models for Secure Code Review through Secure-aware Fine-tuning

関連論文リスト