EasyTPP: Towards Open Benchmarking Temporal Point Processes
- URL: http://arxiv.org/abs/2307.08097v3
- Date: Wed, 24 Jan 2024 02:37:10 GMT
- Title: EasyTPP: Towards Open Benchmarking Temporal Point Processes
- Authors: Siqiao Xue, Xiaoming Shi, Zhixuan Chu, Yan Wang, Hongyan Hao, Fan
Zhou, Caigao Jiang, Chen Pan, James Y. Zhang, Qingsong Wen, Jun Zhou,
Hongyuan Mei
- Abstract summary: Temporal point processes (TPPs) have emerged as the most natural and competitive models.
EasyTPP is the first central repository of research assets (e.g., data, models, evaluation programs, documentations) in the area of event sequence modeling.
- Score: 36.759041669027745
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Continuous-time event sequences play a vital role in real-world domains such
as healthcare, finance, online shopping, social networks, and so on. To model
such data, temporal point processes (TPPs) have emerged as the most natural and
competitive models, making a significant impact in both academic and
application communities. Despite the emergence of many powerful models in
recent years, there hasn't been a central benchmark for these models and future
research endeavors. This lack of standardization impedes researchers and
practitioners from comparing methods and reproducing results, potentially
slowing down progress in this field. In this paper, we present EasyTPP, the
first central repository of research assets (e.g., data, models, evaluation
programs, documentations) in the area of event sequence modeling. Our EasyTPP
makes several unique contributions to this area: a unified interface of using
existing datasets and adding new datasets; a wide range of evaluation programs
that are easy to use and extend as well as facilitate reproducible research;
implementations of popular neural TPPs, together with a rich library of modules
by composing which one could quickly build complex models. All the data and
implementation can be found at
https://github.com/ant-research/EasyTemporalPointProcess. We will actively
maintain this benchmark and welcome contributions from other researchers and
practitioners. Our benchmark will help promote reproducible research in this
field, thus accelerating research progress as well as making more significant
real-world impacts.
Related papers
- EBES: Easy Benchmarking for Event Sequences [17.277513178760348]
Event sequences are common data structures in various real-world domains such as healthcare, finance, and user interaction logs.
Despite advances in temporal data modeling techniques, there is no standardized benchmarks for evaluating their performance on event sequences.
We introduce EBES, a comprehensive benchmarking tool with standardized evaluation scenarios and protocols.
arXiv Detail & Related papers (2024-10-04T13:03:43Z) - DiscoveryBench: Towards Data-Driven Discovery with Large Language Models [50.36636396660163]
We present DiscoveryBench, the first comprehensive benchmark that formalizes the multi-step process of data-driven discovery.
Our benchmark contains 264 tasks collected across 6 diverse domains, such as sociology and engineering.
Our benchmark, thus, illustrates the challenges in autonomous data-driven discovery and serves as a valuable resource for the community to make progress.
arXiv Detail & Related papers (2024-07-01T18:58:22Z) - State-Space Modeling in Long Sequence Processing: A Survey on Recurrence in the Transformer Era [59.279784235147254]
This survey provides an in-depth summary of the latest approaches that are based on recurrent models for sequential data processing.
The emerging picture suggests that there is room for thinking of novel routes, constituted by learning algorithms which depart from the standard Backpropagation Through Time.
arXiv Detail & Related papers (2024-06-13T12:51:22Z) - Continual Learning with Pre-Trained Models: A Survey [61.97613090666247]
Continual Learning aims to overcome the catastrophic forgetting of former knowledge when learning new ones.
This paper presents a comprehensive survey of the latest advancements in PTM-based CL.
arXiv Detail & Related papers (2024-01-29T18:27:52Z) - Large Models for Time Series and Spatio-Temporal Data: A Survey and
Outlook [95.32949323258251]
Temporal data, notably time series andtemporal-temporal data, are prevalent in real-world applications.
Recent advances in large language and other foundational models have spurred increased use in time series andtemporal data mining.
arXiv Detail & Related papers (2023-10-16T09:06:00Z) - TSGM: A Flexible Framework for Generative Modeling of Synthetic Time Series [61.436361263605114]
Time series data are often scarce or highly sensitive, which precludes the sharing of data between researchers and industrial organizations.
We introduce Time Series Generative Modeling (TSGM), an open-source framework for the generative modeling of synthetic time series.
arXiv Detail & Related papers (2023-05-19T10:11:21Z) - Data Augmentation techniques in time series domain: A survey and
taxonomy [0.20971479389679332]
Deep neural networks used to work with time series heavily depend on the size and consistency of the datasets used in training.
This work systematically reviews the current state-of-the-art in the area to provide an overview of all available algorithms.
The ultimate aim of this study is to provide a summary of the evolution and performance of areas that produce better results to guide future researchers in this field.
arXiv Detail & Related papers (2022-06-25T17:09:00Z) - Few-Shot Named Entity Recognition: A Comprehensive Study [92.40991050806544]
We investigate three schemes to improve the model generalization ability for few-shot settings.
We perform empirical comparisons on 10 public NER datasets with various proportions of labeled data.
We create new state-of-the-art results on both few-shot and training-free settings.
arXiv Detail & Related papers (2020-12-29T23:43:16Z) - Leaf Segmentation and Counting with Deep Learning: on Model Certainty,
Test-Time Augmentation, Trade-Offs [2.984934409689467]
Deep supervised learning has been prevalent in recent works proposing better performing models at segmenting and counting leaves.
Despite good efforts from research groups, one of the main challenges for proposing better methods is still the limitation of labelled data availability.
arXiv Detail & Related papers (2020-12-21T17:00:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.