Fugu-MT 論文翻訳(概要): A Large-Scale Empirical Study of AI-Generated Code in Real-World Repositories

論文の概要: A Large-Scale Empirical Study of AI-Generated Code in Real-World Repositories

arxiv url: http://arxiv.org/abs/2603.27130v2
Date: Fri, 03 Apr 2026 07:17:25 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-06 12:42:34.075353
Title: A Large-Scale Empirical Study of AI-Generated Code in Real-World Repositories
Title（参考訳）: 実世界のリポジトリにおけるAI生成コードの大規模実証的研究
Authors: Tianhao Mao, Dongfang Zhao, Haixu Tang, Xiaofeng Wang, Hang Zhang,
Abstract要約: 実世界のリポジトリから収集したAI生成コードについて,大規模な実証的研究を行った。複雑度,構造特性,欠陥関連指標などのコードレベルの特性について検討する。我々の研究は、AI支援開発が従来の人間主導開発とどのように異なるかを強調した。
参考スコア（独自算出の注目度）: 11.883159173351743
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) are increasingly used in software development, generating code that ranges from short snippets to substantial project components. As AI-generated code becomes more common in real-world repositories, it is important to understand how it differs from human-written code and how AI assistance may influence development practices. However, existing studies have largely relied on small-scale or controlled settings, leaving a limited understanding of AI-generated code in the wild. In this work, we present a large-scale empirical study of AI-generated code collected from real-world repositories. We examine both code-level properties, including complexity, structural characteristics, and defect-related indicators, and commit-level characteristics, such as commit size, activity patterns, and post-commit evolution. To support this study, we develop a detection pipeline that combines heuristic filtering with LLM-based classification to identify AI-generated code and construct a large-scale dataset for analysis. Our study provides a comprehensive view of the characteristics of AI-generated code in practice and highlights how AI-assisted development differs from conventional human-driven development. These findings contribute to a better understanding of the real-world impact of AI-assisted programming and offer an empirical basis for future research on AI-generated software.
Abstract（参考訳）: 大規模言語モデル(LLM)は、短いスニペットから実質的なプロジェクトコンポーネントまで、様々なコードを生成するソフトウェア開発でますます使われている。 AI生成コードが現実のリポジトリで一般的になるにつれて、人間が書いたコードとどのように違うのか、AIアシストが開発プラクティスにどのように影響するかを理解することが重要です。しかし、既存の研究は、主に小規模または制御された設定に依存しており、AI生成コードの理解は限られている。本研究では,実世界のリポジトリから収集したAI生成コードについて,大規模な実証的研究を行う。複雑度,構造特性,欠陥関連指標などのコードレベルの特性とコミットサイズ,アクティビティパターン,コミット後の進化といったコミットレベルの特性について検討した。この研究を支援するために、ヒューリスティックフィルタリングとLLMに基づく分類を組み合わせた検出パイプラインを開発し、AI生成コードを特定し、分析のための大規模データセットを構築する。本研究は,AI生成コードの特徴を包括的に把握し,AI支援開発が従来の人間による開発とどのように異なるかを明らかにする。これらの発見は、AI支援プログラミングの現実世界への影響をよりよく理解し、AI生成ソフトウェアに関する将来の研究の実証的な基礎を提供する。

論文の概要: A Large-Scale Empirical Study of AI-Generated Code in Real-World Repositories

関連論文リスト