PyTorch Tabular: A Framework for Deep Learning with Tabular Data
- URL: http://arxiv.org/abs/2104.13638v1
- Date: Wed, 28 Apr 2021 08:50:08 GMT
- Title: PyTorch Tabular: A Framework for Deep Learning with Tabular Data
- Authors: Manu Joseph
- Abstract summary: PyTorch Tabular is a new deep learning library built on top of PyTorch and PyTorch Lightning.
It works on pandas dataframes directly.
Many SOTA models like NODE and TabNet are already integrated and implemented in the library with a unified API.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In spite of showing unreasonable effectiveness in modalities like Text and
Image, Deep Learning has always lagged Gradient Boosting in tabular data - both
in popularity and performance. But recently there have been newer models
created specifically for tabular data, which is pushing the performance bar.
But popularity is still a challenge because there is no easy, ready-to-use
library like Sci-Kit Learn for deep learning. PyTorch Tabular is a new deep
learning library which makes working with Deep Learning and tabular data easy
and fast. It is a library built on top of PyTorch and PyTorch Lightning and
works on pandas dataframes directly. Many SOTA models like NODE and TabNet are
already integrated and implemented in the library with a unified API. PyTorch
Tabular is designed to be easily extensible for researchers, simple for
practitioners, and robust in industrial deployments.
Related papers
- PyPulse: A Python Library for Biosignal Imputation [58.35269251730328]
We introduce PyPulse, a Python package for imputation of biosignals in both clinical and wearable sensor settings.
PyPulse's framework provides a modular and extendable framework with high ease-of-use for a broad userbase, including non-machine-learning bioresearchers.
We released PyPulse under the MIT License on Github and PyPI.
arXiv Detail & Related papers (2024-12-09T11:00:55Z) - Drawing Pandas: A Benchmark for LLMs in Generating Plotting Code [1.5999407512883512]
This paper introduces the human-curated PandasPlotBench dataset.
It is designed to evaluate language models' effectiveness as assistants in visual data exploration.
arXiv Detail & Related papers (2024-12-03T19:05:37Z) - PyTorch Frame: A Modular Framework for Multi-Modal Tabular Learning [52.92232950707439]
We present PyTorch Frame, a PyTorch-based framework for deep learning over multi-modal tabular data.
We demonstrate the usefulness of PyTorch Frame by implementing diverse models in a modular way.
We integrate PyTorch Frame with PyTorch Geometric, a PyTorch library for Graph Neural Networks (GNNs), to perform end-to-end learning over relational databases.
arXiv Detail & Related papers (2024-03-31T19:15:09Z) - TabuLa: Harnessing Language Models for Tabular Data Synthesis [4.539846270369207]
Tabula is a tabular data synthesizer that leverages the structure of large language models (LLMs)
Unlike state-of-the-art (SOTA) LLMs, Tabula discards the pre-trained weights originally designed for natural language tasks.
experiments show that Tabula achieves superior synthetic data utility compared to current SOTA methods.
arXiv Detail & Related papers (2023-10-19T13:50:56Z) - Causal-learn: Causal Discovery in Python [53.17423883919072]
Causal discovery aims at revealing causal relations from observational data.
$textitcausal-learn$ is an open-source Python library for causal discovery.
arXiv Detail & Related papers (2023-07-31T05:00:35Z) - torchgfn: A PyTorch GFlowNet library [56.071033896777784]
torchgfn is a PyTorch library that aims to address this need.
It provides users with a simple API for environments and useful abstractions for samplers and losses.
arXiv Detail & Related papers (2023-05-24T00:20:59Z) - PyGOD: A Python Library for Graph Outlier Detection [56.33769221859135]
PyGOD is an open-source library for detecting outliers in graph data.
It supports a wide array of leading graph-based methods for outlier detection.
PyGOD is released under a BSD 2-Clause license at https://pygod.org and at the Python Package Index (PyPI)
arXiv Detail & Related papers (2022-04-26T06:15:21Z) - PyTorchVideo: A Deep Learning Library for Video Understanding [71.89124881732015]
PyTorchVideo is an open-source deep-learning library for video understanding tasks.
It covers a full stack of video understanding tools including multimodal data loading, transformations, and models.
The library is based on PyTorch and can be used by any training framework.
arXiv Detail & Related papers (2021-11-18T18:59:58Z) - Small-Text: Active Learning for Text Classification in Python [23.87081733039124]
small-text is an easy-to-use active learning library for Python.
It offers pool-based active learning for single- and multi-label text classification.
arXiv Detail & Related papers (2021-07-21T19:23:56Z) - fastai: A Layered API for Deep Learning [1.7223564681760164]
fastai is a deep learning library which provides practitioners with high-level components.
It provides researchers with low-level components that can be mixed and matched to build new approaches.
arXiv Detail & Related papers (2020-02-11T21:16:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.