Surfboard: Audio Feature Extraction for Modern Machine Learning
- URL: http://arxiv.org/abs/2005.08848v1
- Date: Mon, 18 May 2020 16:20:20 GMT
- Title: Surfboard: Audio Feature Extraction for Modern Machine Learning
- Authors: Raphael Lenain, Jack Weston, Abhishek Shivkumar, Emil Fristed
- Abstract summary: Surfboard is an open-source Python library for extracting audio features with application to the medical domain.
It builds on state-of-the-art audio analysis packages and offers multiprocessing support for processing large workloads.
We illustrate Surfboard's application to a Parkinson's disease classification task, highlighting common pitfalls in existing research.
- Score: 11.527421282223948
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We introduce Surfboard, an open-source Python library for extracting audio
features with application to the medical domain. Surfboard is written with the
aim of addressing pain points of existing libraries and facilitating joint use
with modern machine learning frameworks. The package can be accessed both
programmatically in Python and via its command line interface, allowing it to
be easily integrated within machine learning workflows. It builds on
state-of-the-art audio analysis packages and offers multiprocessing support for
processing large workloads. We review similar frameworks and describe
Surfboard's architecture, including the clinical motivation for its features.
Using the mPower dataset, we illustrate Surfboard's application to a
Parkinson's disease classification task, highlighting common pitfalls in
existing research. The source code is opened up to the research community to
facilitate future audio research in the clinical domain.
Related papers
- Deep Fast Machine Learning Utils: A Python Library for Streamlined Machine Learning Prototyping [0.0]
The Deep Fast Machine Learning Utils (DFMLU) library provides tools designed to automate and enhance aspects of machine learning processes.
DFMLU offers functionalities that support model development and data handling.
This manuscript presents an overview of DFMLU's functionalities, providing Python examples for each tool.
arXiv Detail & Related papers (2024-09-14T21:39:17Z) - pyvene: A Library for Understanding and Improving PyTorch Models via
Interventions [79.72930339711478]
$textbfpyvene$ is an open-source library that supports customizable interventions on a range of different PyTorch modules.
We show how $textbfpyvene$ provides a unified framework for performing interventions on neural models and sharing the intervened upon models with others.
arXiv Detail & Related papers (2024-03-12T16:46:54Z) - Open and reusable deep learning for pathology with WSInfer and QuPath [2.554961754124123]
We introduce WSInfer, a new, open-source software ecosystem designed to make deep learning for pathology more streamlined and accessible.
WSInfer comprises three main elements: 1) a Python package to efficiently apply patch-based deep learning inference to whole slide images; 2) a QuPath extension that provides an alternative inference engine through user-friendly and interactive software, and 3) a model zoo, which enables pathology models and metadata to be easily shared in a standardized form.
arXiv Detail & Related papers (2023-09-08T22:47:23Z) - SequeL: A Continual Learning Library in PyTorch and JAX [50.33956216274694]
SequeL is a library for Continual Learning that supports both PyTorch and JAX frameworks.
It provides a unified interface for a wide range of Continual Learning algorithms, including regularization-based approaches, replay-based approaches, and hybrid approaches.
We release SequeL as an open-source library, enabling researchers and developers to easily experiment and extend the library for their own purposes.
arXiv Detail & Related papers (2023-04-21T10:00:22Z) - pymdp: A Python library for active inference in discrete state spaces [52.85819390191516]
pymdp is an open-source package for simulating active inference in Python.
We provide the first open-source package for simulating active inference with POMDPs.
arXiv Detail & Related papers (2022-01-11T12:18:44Z) - WAX-ML: A Python library for machine learning and feedback loops on
streaming data [0.0]
WAX-ML is a research-oriented Python library.
It provides tools to design powerful machine learning algorithms.
It strives to complement JAX with tools dedicated to time series.
arXiv Detail & Related papers (2021-06-11T17:42:02Z) - SpeechBrain: A General-Purpose Speech Toolkit [73.0404642815335]
SpeechBrain is an open-source and all-in-one speech toolkit.
It is designed to facilitate the research and development of neural speech processing technologies.
It achieves competitive or state-of-the-art performance in a wide range of speech benchmarks.
arXiv Detail & Related papers (2021-06-08T18:22:56Z) - PyHealth: A Python Library for Health Predictive Models [53.848478115284195]
PyHealth is an open-source Python toolbox for developing various predictive models on healthcare data.
The data preprocessing module enables the transformation of complex healthcare datasets into machine learning friendly formats.
The predictive modeling module provides more than 30 machine learning models, including established ensemble trees and deep neural network-based approaches.
arXiv Detail & Related papers (2021-01-11T22:02:08Z) - Plato Dialogue System: A Flexible Conversational AI Research Platform [64.82999992143448]
Plato is a flexible Conversational AI platform written in Python that supports any kind of conversational agent architecture.
Plato has been designed to be easy to understand and debug and is agnostic to the underlying learning frameworks that train each component.
arXiv Detail & Related papers (2020-01-17T18:27:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.