Deepchecks: A Library for Testing and Validating Machine Learning Models
and Data
- URL: http://arxiv.org/abs/2203.08491v1
- Date: Wed, 16 Mar 2022 09:37:22 GMT
- Title: Deepchecks: A Library for Testing and Validating Machine Learning Models
and Data
- Authors: Shir Chorev, Philip Tannor, Dan Ben Israel, Noam Bressler, Itay
Gabbay, Nir Hutnik, Jonatan Liberman, Matan Perlmutter, Yurii Romanyshyn,
Lior Rokach
- Abstract summary: Deepchecks is a Python library for comprehensively validating machine learning models and data.
Our goal is to provide an easy-to-use library comprising of many checks related to various types of issues.
- Score: 8.876608553825227
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper presents Deepchecks, a Python library for comprehensively
validating machine learning models and data. Our goal is to provide an
easy-to-use library comprising of many checks related to various types of
issues, such as model predictive performance, data integrity, data distribution
mismatches, and more. The package is distributed under the GNU Affero General
Public License (AGPL) and relies on core libraries from the scientific Python
ecosystem: scikit-learn, PyTorch, NumPy, pandas, and SciPy. Source code,
documentation, examples, and an extensive user guide can be found at
\url{https://github.com/deepchecks/deepchecks} and
\url{https://docs.deepchecks.com/}.
Related papers
- $\texttt{skwdro}$: a library for Wasserstein distributionally robust machine learning [6.940992962425166]
skwdro is a Python library for training robust machine learning models.
It features both scikit-learn compatible estimators for popular objectives, as well as a wrapper for PyTorch modules.
arXiv Detail & Related papers (2024-10-28T17:16:00Z) - pyvene: A Library for Understanding and Improving PyTorch Models via
Interventions [79.72930339711478]
$textbfpyvene$ is an open-source library that supports customizable interventions on a range of different PyTorch modules.
We show how $textbfpyvene$ provides a unified framework for performing interventions on neural models and sharing the intervened upon models with others.
arXiv Detail & Related papers (2024-03-12T16:46:54Z) - Causal-learn: Causal Discovery in Python [53.17423883919072]
Causal discovery aims at revealing causal relations from observational data.
$textitcausal-learn$ is an open-source Python library for causal discovery.
arXiv Detail & Related papers (2023-07-31T05:00:35Z) - scikit-fda: A Python Package for Functional Data Analysis [0.0]
scikit-fda is a Python package for Functional Data Analysis (FDA)
It provides a comprehensive set of tools for representation, preprocessing, and exploratory analysis of functional data.
arXiv Detail & Related papers (2022-11-04T16:34:03Z) - PyGOD: A Python Library for Graph Outlier Detection [56.33769221859135]
PyGOD is an open-source library for detecting outliers in graph data.
It supports a wide array of leading graph-based methods for outlier detection.
PyGOD is released under a BSD 2-Clause license at https://pygod.org and at the Python Package Index (PyPI)
arXiv Detail & Related papers (2022-04-26T06:15:21Z) - PyHHMM: A Python Library for Heterogeneous Hidden Markov Models [63.01207205641885]
PyHHMM is an object-oriented Python implementation of Heterogeneous-Hidden Markov Models (HHMMs)
PyHHMM emphasizes features not supported in similar available frameworks: a heterogeneous observation model, missing data inference, different model order selection criterias, and semi-supervised training.
PyHHMM relies on the numpy, scipy, scikit-learn, and seaborn Python packages, and is distributed under the Apache-2.0 License.
arXiv Detail & Related papers (2022-01-12T07:32:36Z) - Solo-learn: A Library of Self-supervised Methods for Visual
Representation Learning [83.02597612195966]
solo-learn is a library of self-supervised methods for visual representation learning.
Implemented in Python, using Pytorch and Pytorch lightning, the library fits both research and industry needs.
arXiv Detail & Related papers (2021-08-03T22:19:55Z) - DoubleML -- An Object-Oriented Implementation of Double Machine Learning
in Python [1.4911092205861822]
DoubleML is an open-source Python library implementing the double machine learning framework of Chernozhukov et al.
It contains functionalities for valid statistical inference on causal parameters when the estimation of parameters is based on machine learning methods.
The package is distributed under the MIT license and relies on core libraries from the scientific Python ecosystem.
arXiv Detail & Related papers (2021-04-07T16:16:39Z) - mvlearn: Multiview Machine Learning in Python [103.55817158943866]
mvlearn is a Python library which implements the leading multiview machine learning methods.
The package can be installed from Python Package Index (PyPI) and the conda package manager.
arXiv Detail & Related papers (2020-05-25T02:35:35Z) - giotto-tda: A Topological Data Analysis Toolkit for Machine Learning and
Data Exploration [4.8353738137338755]
giotto-tda is a Python library that integrates high-performance topological data analysis with machine learning.
The library's ability to handle various types of data is rooted in a wide range of preprocessing techniques.
arXiv Detail & Related papers (2020-04-06T10:53:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.