Fugu-MT 論文翻訳(概要): JunoBench: A Benchmark Dataset of Crashes in Python Machine Learning Jupyter Notebooks

論文の概要: JunoBench: A Benchmark Dataset of Crashes in Python Machine Learning Jupyter Notebooks

arxiv url: http://arxiv.org/abs/2510.18013v3
Date: Mon, 10 Nov 2025 13:52:00 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-11 19:11:14.357343
Title: JunoBench: A Benchmark Dataset of Crashes in Python Machine Learning Jupyter Notebooks
Title（参考訳）: JunoBench: Python機械学習のJupyterノートブックにおけるクレーシェのベンチマークデータセット
Authors: Yiran Wang, José Antonio Hernández López, Ulf Nilsson, Dániel Varró,
Abstract要約: JunoBenchは、PythonベースのMLノートブックにおける実世界のクラッシュのベンチマークデータセットである。 JunoBenchには111のキュレーションと再現可能なクラッシュが含まれている。
参考スコア（独自算出の注目度）: 4.768285672660128
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Jupyter notebooks are widely used for machine learning (ML) prototyping. Yet, few debugging tools are designed for ML code in notebooks, partly, due to the lack of benchmarks. We introduce JunoBench, the first benchmark dataset of real-world crashes in Python-based ML notebooks. JunoBench includes 111 curated and reproducible crashes with verified fixes from public Kaggle notebooks, covering popular ML libraries (e.g., TensorFlow/Keras, PyTorch, Scikit-learn) and notebook-specific out-of-order execution errors. JunoBench ensures reproducibility and ease of use through a unified environment that reliably reproduces all crashes. By providing realistic crashes, their resolutions, richly annotated labels of crash characteristics, and natural-language diagnostic annotations, JunoBench facilitates research on bug detection, localization, diagnosis, and repair in notebook-based ML development.
Abstract（参考訳）: Jupyterノートは機械学習(ML)プロトタイピングに広く使われている。しかし、ベンチマークの欠如もあって、ノートブックのMLコード用に設計されたデバッグツールはほとんどない。 JunoBenchは、PythonベースのMLノートブックにおける実世界のクラッシュのベンチマークデータセットである。 JunoBenchには、111のキュレーションと再現可能なクラッシュ、パブリックなKaggleノートブックの修正、一般的なMLライブラリ(例:TensorFlow/Keras、PyTorch、Scikit-learn)、ノートブック固有のアウトオブオーダー実行エラーなどが含まれている。 JunoBenchは、すべてのクラッシュを確実に再現する統一された環境を通じて、再現性と使いやすさを保証する。現実的なクラッシュ、その解決、豊富な注釈付きラベルのクラッシュ特性、自然言語診断アノテーションを提供することで、JunoBenchはノートブックベースのML開発におけるバグ検出、ローカライゼーション、診断、修復の研究を促進する。

論文の概要: JunoBench: A Benchmark Dataset of Crashes in Python Machine Learning Jupyter Notebooks

関連論文リスト