Sig-Networks Toolkit: Signature Networks for Longitudinal Language
Modelling
- URL: http://arxiv.org/abs/2312.03523v2
- Date: Tue, 6 Feb 2024 12:14:19 GMT
- Title: Sig-Networks Toolkit: Signature Networks for Longitudinal Language
Modelling
- Authors: Talia Tseriotou, Ryan Sze-Yin Chan, Adam Tsakalidis, Iman Munire
Bilal, Elena Kochkina, Terry Lyons, Maria Liakata
- Abstract summary: We present an open-source, pip installable toolkit, Sig-Networks, for longitudinal language modelling.
A central focus is the incorporation of Signature-based Neural Network models, which have recently shown success in temporal tasks.
We release the Toolkit as a PyTorch package with an introductory video, Git repositories for preprocessing and modelling including sample notebooks on the modeled NLP tasks.
- Score: 14.619019557308807
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We present an open-source, pip installable toolkit, Sig-Networks, the first
of its kind for longitudinal language modelling. A central focus is the
incorporation of Signature-based Neural Network models, which have recently
shown success in temporal tasks. We apply and extend published research
providing a full suite of signature-based models. Their components can be used
as PyTorch building blocks in future architectures. Sig-Networks enables
task-agnostic dataset plug-in, seamless pre-processing for sequential data,
parameter flexibility, automated tuning across a range of models. We examine
signature networks under three different NLP tasks of varying temporal
granularity: counselling conversations, rumour stance switch and mood changes
in social media threads, showing SOTA performance in all three, and provide
guidance for future tasks. We release the Toolkit as a PyTorch package with an
introductory video, Git repositories for preprocessing and modelling including
sample notebooks on the modeled NLP tasks.
Related papers
- ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding [51.509115746992165]
We introduce ARKit LabelMaker, the first large-scale, real-world 3D dataset with dense semantic annotations.
We also push forward the state-of-the-art performance on ScanNet and ScanNet200 dataset with prevalent 3D semantic segmentation models.
arXiv Detail & Related papers (2024-10-17T14:44:35Z) - Comgra: A Tool for Analyzing and Debugging Neural Networks [35.89730807984949]
We introduce comgra, an open source python library for use with PyTorch.
Comgra extracts data about the internal activations of a model and organizes it in a GUI.
It can show both summary statistics and individual data points, compare early and late stages of training, focus on individual samples of interest, and visualize the flow of the gradient through the network.
arXiv Detail & Related papers (2024-07-31T14:57:23Z) - PyPOTS: A Python Toolbox for Data Mining on Partially-Observed Time
Series [0.0]
PyPOTS is an open-source Python library dedicated to data mining and analysis on partially-observed time series.
It provides easy access to diverse algorithms categorized into four tasks: imputation, classification, clustering, and forecasting.
arXiv Detail & Related papers (2023-05-30T07:57:05Z) - Twitmo: A Twitter Data Topic Modeling and Visualization Package for R [0.0]
Twitmo provides a broad range of methods to collect, pre-process, analyze and visualize geo-tagged Twitter data.
One of the innovations of the package is the automatic pooling of Tweets into longer pseudo- documents using hashtags.
arXiv Detail & Related papers (2022-07-08T12:23:20Z) - Interactive Visualization of Protein RINs using NetworKit in the Cloud [57.780880387925954]
In this paper, we consider an example from protein dynamics, specifically residue interaction networks (RINs)
We use NetworKit to build a cloud-based environment that enables domain scientists to run their visualization and analysis on large compute servers.
To demonstrate the versatility of this approach, we use it to build a custom Jupyter-based widget for RIN visualization.
arXiv Detail & Related papers (2022-03-02T17:41:45Z) - PyTorch Geometric Signed Directed: A Software Package on Graph Neural
Networks for Signed and Directed Graphs [20.832917829426098]
PyTorch Geometric Signed Directed (PyGSD) is a software package for signed and directed networks.
PyGSD consists of easy-to-use GNN models, synthetic and real-world data, as well as task-specific evaluation metrics and loss functions.
As an extension library for PyG, our proposed software is maintained with open-source releases, detailed documentation, continuous integration, unit tests and code coverage checks.
arXiv Detail & Related papers (2022-02-22T10:25:59Z) - ESPnet2-TTS: Extending the Edge of TTS Research [62.92178873052468]
ESPnet2-TTS is an end-to-end text-to-speech (E2E-TTS) toolkit.
New features include: on-the-fly flexible pre-processing, joint training with neural vocoders, and state-of-the-art TTS models with extensions like full-band E2E text-to-waveform modeling.
arXiv Detail & Related papers (2021-10-15T03:27:45Z) - Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision
Datasets from 3D Scans [103.92680099373567]
This paper introduces a pipeline to parametrically sample and render multi-task vision datasets from comprehensive 3D scans from the real world.
Changing the sampling parameters allows one to "steer" the generated datasets to emphasize specific information.
Common architectures trained on a generated starter dataset reached state-of-the-art performance on multiple common vision tasks and benchmarks.
arXiv Detail & Related papers (2021-10-11T04:21:46Z) - Temporal Graph Network Embedding with Causal Anonymous Walks
Representations [54.05212871508062]
We propose a novel approach for dynamic network representation learning based on Temporal Graph Network.
For evaluation, we provide a benchmark pipeline for the evaluation of temporal network embeddings.
We show the applicability and superior performance of our model in the real-world downstream graph machine learning task provided by one of the top European banks.
arXiv Detail & Related papers (2021-08-19T15:39:52Z) - SALA: Soft Assignment Local Aggregation for Parameter Efficient 3D
Semantic Segmentation [65.96170587706148]
We focus on designing a point local aggregation function that yields parameter efficient networks for 3D point cloud semantic segmentation.
We explore the idea of using learnable neighbor-to-grid soft assignment in grid-based aggregation functions.
arXiv Detail & Related papers (2020-12-29T20:16:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.