Fugu-MT 論文翻訳(概要): JunoBench: A Benchmark Dataset of Crashes in Python Machine Learning Jupyter Notebooks

論文の概要: JunoBench: A Benchmark Dataset of Crashes in Python Machine Learning Jupyter Notebooks

arxiv url: http://arxiv.org/abs/2510.18013v2
Date: Sat, 25 Oct 2025 07:59:51 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-28 13:14:10.589955
Title: JunoBench: A Benchmark Dataset of Crashes in Python Machine Learning Jupyter Notebooks
Title（参考訳）: JunoBench: Python機械学習のJupyterノートブックにおけるクレーシェのベンチマークデータセット
Authors: Yiran Wang, José Antonio Hernández López, Ulf Nilsson, Dániel Varró,
Abstract要約: JunoBenchは、Pythonベースの機械学習ノートブックにおける実世界のクラッシュのベンチマークデータセットである。 JunoBenchには111のキュレーションと再現可能なクラッシュがあり、それぞれに検証可能な修正が備わっている。 JunoBenchは、クラッシュと修正を確実に再現できる統一された実行環境を提供する。
参考スコア（独自算出の注目度）: 4.768285672660128
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Jupyter notebooks are widely used for machine learning (ML) prototyping. Yet few debugging tools are designed for ML code in notebooks, potentially due to the lack of benchmarks. We introduce JunoBench, the first benchmark dataset of real-world crashes in Python-based ML notebooks. JunoBench has 111 curated and reproducible crashes from public Kaggle notebooks, each paired with a verifiable fix, ranging over popular ML libraries, including TensorFlow/Keras, PyTorch, Scikit-learn, Pandas, and NumPy, as well as notebook-specific out-of-order execution issue. To support reproducibility and ease of use, JunoBench offers a unified execution environment where crashes and fixes can be reliably reproduced. By providing realistic crashes and their resolutions, JunoBench facilitates bug detection, localization, diagnosis, and repair tailored to the interactive and iterative nature of notebook-based ML development.
Abstract（参考訳）: Jupyterノートは機械学習(ML)プロトタイピングに広く使われている。しかし、ノートブックのMLコード用に設計されたデバッグツールはほとんどない。 JunoBenchは、PythonベースのMLノートブックにおける実世界のクラッシュのベンチマークデータセットである。 JunoBenchには、公開Kaggleノートブックから111のキュレーションと再現可能なクラッシュがあり、それぞれが検証可能な修正とペアリングされており、TensorFlow/Keras、PyTorch、Scikit-learn、Pandas、NumPyなど、一般的なMLライブラリにまたがっている。再現性と使いやすさをサポートするために、JunoBenchは、クラッシュと修正を確実に再現できる統一された実行環境を提供する。現実的なクラッシュとその解決を提供することで、JunoBenchはノートブックベースのML開発におけるインタラクティブで反復的な性質に合わせて、バグ検出、ローカライゼーション、診断、修復を容易にする。

論文の概要: JunoBench: A Benchmark Dataset of Crashes in Python Machine Learning Jupyter Notebooks

関連論文リスト