Fugu-MT 論文翻訳(概要): Hamster: A Large-Scale Study and Characterization of Developer-Written Tests

論文の概要: Hamster: A Large-Scale Study and Characterization of Developer-Written Tests

arxiv url: http://arxiv.org/abs/2509.26204v1
Date: Tue, 30 Sep 2025 13:08:23 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-01 17:09:04.544173
Title: Hamster: A Large-Scale Study and Characterization of Developer-Written Tests
Title（参考訳）: Hamster: 開発者によるテストの大規模調査と評価
Authors: Rangeet Pan, Tyler Stennett, Raju Pavuluri, Nate Levin, Alessandro Orso, Saurabh Sinha,
Abstract要約: 我々はJavaアプリケーションの開発者によるテストについて調査し、オープンソースリポジトリから170万のテストケースをカバーした。この結果から,開発者によるテストの大部分は,現在のATGツールの能力以上の特性を示すことがわかった。私たちは、現在のツール機能と開発者のテストプラクティスに対するより効果的なツールサポートのギャップを埋めるのに役立つ有望な研究方向を特定します。
参考スコア（独自算出の注目度）: 44.65515600399573
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Automated test generation (ATG), which aims to reduce the cost of manual test suite development, has been investigated for decades and has produced countless techniques based on a variety of approaches: symbolic analysis, search-based, random and adaptive-random, learning-based, and, most recently, large-language-model-based approaches. However, despite this large body of research, there is still a gap in our understanding of the characteristics of developer-written tests and, consequently, in our assessment of how well ATG techniques and tools can generate realistic and representative tests. To bridge this gap, we conducted an extensive empirical study of developer-written tests for Java applications, covering 1.7 million test cases from open-source repositories. Our study is the first of its kind in studying aspects of developer-written tests that are mostly neglected in the existing literature, such as test scope, test fixtures and assertions, types of inputs, and use of mocking. Based on the characterization, we then compare existing tests with those generated by two state-of-the-art ATG tools. Our results highlight that a vast majority of developer-written tests exhibit characteristics that are beyond the capabilities of current ATG tools. Finally, based on the insights gained from the study, we identify promising research directions that can help bridge the gap between current tool capabilities and more effective tool support for developer testing practices. We hope that this work can set the stage for new advances in the field and bring ATG tools closer to generating the types of tests developers write.
Abstract（参考訳）: 手動テストスイートの開発コストを削減することを目的とした自動テスト生成(ATG)は、何十年にもわたって研究され、記号解析、検索ベース、ランダムおよび適応ランダム、学習ベース、そして最近では、大規模言語モデルに基づくアプローチなど、様々なアプローチに基づいて数え切れないほどの技術を生み出してきた。しかし、この大規模な研究にもかかわらず、開発者によるテストの特徴に対する理解にはまだギャップがあり、その結果、ATGの技術やツールが現実的で代表的なテストを生成することができるかを評価する上では、まだギャップがある。このギャップを埋めるため、私たちはJavaアプリケーションの開発者によるテストに関する広範な実証的研究を行い、オープンソースリポジトリから170万のテストケースをカバーしました。私たちの研究は、テストスコープ、テストフィクスチャとアサーション、入力の種類、モックの使用など、既存の文献にほとんど無視されている開発者によるテストの側面を研究する上で、初めてのものです。評価結果に基づき、既存のテストと2つの最先端ATGツールで生成されたテストを比較する。私たちの結果は、開発者によるテストの大部分は、現在のATGツールの能力を超える特性を示しています。最後に、この調査から得られた洞察に基づいて、現在のツール機能と開発者のテストプラクティスに対するより効果的なツールサポートのギャップを埋めるのに役立つ有望な研究方向を特定します。この作業によって、この分野における新たな進歩のステージが整い、ATGツールを開発者が書くテストのタイプに近づけることを期待しています。

論文の概要: Hamster: A Large-Scale Study and Characterization of Developer-Written Tests

関連論文リスト